# Aj Davis Course Project Parts a and B

**Topics:**Statistical hypothesis testing, Statistics, Arithmetic mean

**Pages:**9 (1935 words)

**Published:**October 20, 2012

The first variable considered is Location, a categorical variable. The three subcategories are Urban, Suburban and Rural. The frequency distribution and pie chart are included. Measures of central tendency and descriptive statistics are not calculated due to the categorical nature of the variable.

Frequency Distribution:

LOCATION| FREQUENCY|

Urban| 22|

Suburban| 15|

Rural| 13|

The largest number of customers belong to the Urban Location category (44%), followed by those in the Suburban Location category (30%). The least number of customers belong in the Rural Location category (26%).

The next individual variable considered is Household Size, meaning the number of people living in the household. Size is a quantitative variable. The measures of central tendency and variation along with other descriptive statistics have been calculated for this variable.

Descriptive Statistics: AJ DAVIS Customer Data - Household Size

Total

Variable Count N N* CumN Percent CumPct Mean SE Mean TrMean StDev C1 50 50 0 50 100 100 3.420 0.246 3.341 1.739

Sum of

Variable Variance CoefVar Sum Squares Minimum Q1 Median Q3 C1 3.024 50.85 171.000 733.000 1.000 2.000 3.000 5.000

N for

Variable Maximum Range IQR Mode Mode Skewness Kurtosis MSSD C1 7.000 6.000 3.000 2 15 0.53 -0.72 3.214

FREQUENCY DISTRIBUTION:

SIZE| FREQUENCY|

1| 5|

2| 15|

3| 8|

4| 9|

5| 5|

6| 5|

7| 3|

The mean household size of the customers is found to be 3.42. The median of the data is 3 and the mode is 2. The standard deviation is approximately 1.74. The largest number of customers have a household size of 2 as is displayed in the frequency distribution and the bar graph. The 3rd individual variable considered is Credit Balance. It is a quantitative variable. The measures of central tendency, variation and other descriptive statistics have been calculated for this variable.

Descriptive Statistics: Credit Balance

Total

Variable Count N N* CumN Percent CumPct Mean SE Mean TrMean StDev C1 50 50 0 50 100 100 3970 132 3980 932

Sum of

Variable Variance CoefVar Sum Squares Minimum Q1 Median Q3 C1 868430 23.47 198523 830780693 1864 3109 4090 4748

N for

Variable Maximum Range IQR Mode Mode Skewness Kurtosis MSSD C1 5678 3814 1638 3890 2 -0.15 -0.72 373840

RELATIVE FREQUENCY DISTRIBUTION

CREDIT BALANCE| FREQUENCY| RELATIVE FREQUENCY|

$1500-$2000| 1| 0.02|

$2000-$2500| 2| 0.04|

$2500-$3000| 6| 0.12|

$3000-$3500| 6| 0.12|

$3500-$4000| 8| 0.16|

$4000-$4500| 12| 0.24|

$4500-$5000| 7| 0.14|

$5000-$5500| 6| 0.12|

$5500-$6000| 2| 0.04|

The mean credit balance of the customers is given as $3980. The standard deviation is approximately 932. The credit balance of the customers is more or less normally distributed with the peak of the bell shaped distribution falling in the range $4000 - $4500. The most customers have a credit balance within this range.

The relationship between the variables Income and Size is illustrated in the following scatter plot:

It is clear from the scatter plot that no relationship exists between size and income. The points are scattered with no pattern. The relationship between the variables Income and Credit Balance is illustrated in the following scatter plot:...

Please join StudyMode to read the full document