Assignment #3
Constructing a Methodology for Statistical Analysis
Christian Diener
998029324
Anthony Chum

Research Problem
As pollution continues to rise in our cities due to various human activities, the incidence of cancer also seems to be increasing. The study of cancer is important because it is a major health problem in today’s society. Not only is it a significant contributor to deaths all around the world but it also affects our economy as a whole. It affects the economy because people who develop cancer are forced to take off work in order to go through the many treatments of radiation and chemotherapy, which in turn has a severe impact on the economy due to the millions of dollars lost in productivity. It is also a financial burden on people due to them losing their job which makes it harder for them to pay the bills and provide themselves with basic needs. So, overall cancer affects a lot of aspects in people’s lives and not just their own lives. Therefore it is an important issue to study and my research question therefore is: Is there significant difference in health related problems, specifically cancer, amongst residents in different regions of the Greater Toronto Area? Specific Research Questions

The following are more specific questions based on my general research topic: Is there a significant difference in the prevalence of cancer between men and women? * Null Hypothesis: There is no significant difference in the prevalence of cancer between men and women. * Alternate Hypothesis: There is a significant difference in the prevalence of cancer in men and women. Is there a significant difference in the occurrence of cancer between people living in urban and rural areas? * Null Hypothesis: There is a significant difference in the occurrence of cancer between people living in urban and rural areas * Alternate Hypothesis: There is no significant difference in the occurrence of cancer between people living in urban and rural areas...

...
UNIVERSITY OF LA VERNE
COLLEGE OF BUSINESS AND PUBLIC MANAGEMENT
BUS 500C
QUANTITATIVE & STATISTICALANALYSIS
COMPREHENSIVE FINAL EXAMINATION
1. The personnel director for a business organization has identified 10 individuals as qualified candidates for 3 managerial training positions her firms seeks to fill. Use the appropriate rule to give the number of different combinations of the 10 individuals who could be chosen for the 3 positions.
As discussed in class we would use the combination approach. The primary reason is that they are all identical (qualified) positions and thus the order would not matter.
Data: The variable is the total number of elements = n (3) and r (10) designates the number of groups and is expressed by the following formula from our textbook (Keller, 2012).
The formula that produces the result of 120 combinations is.
= 10! = 10x9x8x7x6x5x4x3x2x1 3628800 = 120
3(10-3)! (3x2x1)(7x6x5x4x3x2x1) = 6(5040)
2. The president, vice president, secretary, and treasurer are to be selected from a group of 10 candidates. Use the appropriate rule to give the number of ways the positions may be filled.
Based on class discussion and my studies, and the facts provided, the permutation rule would be used to determine the answer. Applying this formula there would be 5040 permutations.
Order would matter, because there is a order or value to the four positions and each of the candidates could be combined...

...to create a multiple regression anlysis for this problem. Please provide as much explanation as you can. Please see attached files.
My research is based on this topic below. The data is attached in the spreadsheet. This is a multiple regression analysis. I have attached a PDF file that explains the case and the spreadsheet version with all the data recorded from the PDF file. Pleas emae sure you include all the graphs, plots and please use megastat software.
Topic:
We want to determine the primary factors that affect property crime rates in the United States. The statisticalanalysis of the data involves multiple-regression analysis.
Questions to answer are:
1. What are the primary determinants of property crimes in the United States?
2. What would you like to know about property crime rates that cannot be answered by this data set?
3. How does population density affect property crime rates? Is this expected?
You will want to prepare a summary of your findings to present to a management team from a national crime department. You will find and explain the regression model using a non technical discussion to explain the important factors affect on the property crime rate.
Answer
Multiple regression analysis can be used to model property crime in United States . The regression model suggested is of the form.
Crimes = b0+b1Pincome + b2Dropout +b3Pubaid+b4density+ b5Kids+...

...MBA Business Statistics
Homework 1
Reminders:
1. Due date: Jan-14-2012 (Saturday) in class.
2. Please submit only the hardcopy.
3. Please show the names and ID numbers of all your group members on the cover page. Please also
indicate your session (DSME5110W).
1.
Problem 2.1 (p. 33)
The file P02_01.xlsx indicates the gender and nationality of the MBA incoming class in two
successive years at the Kelley School of Business at Indiana University.
a. For each year, create tables of counts of gender and of nationality. Then create column charts of
these counts. Do they indicate any noticeable change in the composition of the two classes?
b. Repeat part a for nationality, but recode this variable so that all nationalities that have counts of 1
or 2 are classified as Other.
2.
Problem 2.5 (p. 33)
The file DJIA Monthly Close.xlsx contains monthly values of the Dow Jones Industrial Average
from 1950 through 2009. It also contains the percentage changes from month to month. (This file will
be used for an example later in this chapter.) Create a new column for recoding the percentage
changes into six categories: Large negative (< -3%), Medium negative (< -1%, ≥ -3%), Small
negative (< 0%, ≥ -1%), Small positive (< 1%, ≥ 0%), Medium positive (< 3%, ≥ 1%), and Large
positive (≥ 3%). Then create a column chart of the counts of this categorical variable. Comment on its
shape.
3.
Problem 2.6 (p. 55)
The file P02_06.xlsx lists the average time (in...

...StatisticalAnalysis
The analysis of the data from the study of Barnes & Noble stores is in two stages, the descriptive study and inferential statistical study. Initially, the Team will distribute and collect the questionnaires. The use of classification will summarize the data and express it in the tabular form for better understanding of the data. For example, if the questionnaires consist of information from males and females, the data is putinto two categories and expressed in a table form. A proper graphical method can display the data summation. As an example, the display of information can be in a pie chart or simple bar chart of different categories like people who prefer to read electronic books and people who prefer to read hard copy books. Finally, a proper statistical hypothesis testing method can answer the research questions in the questionnaire. In the Barnes & Noble case study, the hypothesis test that proportion of people who prefer to read the electronic copy of books is larger than the proportion of people who prefer to read hard copy books. The hypothesis test consists of equality of proportions of two populations. The hypothesis test can assist in determining if the emergence of electronic copy of books is reducing the prices of hard bound and paperback books.
Analyzing the Data
The selection of the analysis is based on two things: the way...

...Elementary Concepts in Statistics. In this introduction, we will
briefly discuss those elementary statistical concepts that provide the necessary
foundations for more specialized expertise in any area of statistical data analysis. The
selected topics illustrate the basic assumptions of most statistical methods and/or have
been demonstrated in research to be necessary components of one's general
understanding of the "quantitative nature" of reality (Nisbett, et al., 1987). Because of
space limitations, we will focus mostly on the functional aspects of the concepts
discussed and the presentation will be very short. Further information on each of those
concepts can be found in the Introductory Overview and Examples sections of this
manual and in statistical textbooks. Recommended introductory textbooks are:
Kachigan (1986), and Runyon and Haber (1976); for a more advanced discussion of
elementary theory and assumptions of statistics, see the classic books by Hays (1988),
and Kendall and Stuart (1979).
• What are variables?
• Correlational vs.
experimental research
• Dependent vs. independent
variables
• Measurement scales
• Relations between variables
• Why relations between
variables are important
• Two basic features of every
relation between variables
• What is "statistical
significance" (p-value)
• How to determine that a
result is "really" significant
•...

...Gender Discrimination: A StatisticalAnalysis
Gender discrimination, or sex discrimination, may be characterized as the unequal treatment of a person based solely on that person's sex. .
It is apparent that gender discrimination is pervasive in the modern workplace, however, its presence and effects are often misrepresented and misunderstood. Statistical testing plays an important role in cases where the existence of discrimination is a disputed issue and has been used extensively to compare expected numbers of members of a protected group, to the actual number of members of that protected group that have been involved in a significant employment action. This paper will use statistical testing and analysis, including a multiple regression model, to estimate the effects that various independent variables have upon the dependent variable, salary level.
This analysis utilized a data sample consisting of 46 employees and variables relating to each of those employees. These variables include: gender, age, level of education, length of employment, job type, and weekly salary. Each of these variables is further broken as follows: gender was divided between males and females; age was listed as the age of the employee; education was broken down to reflect the last level of education obtained by the employee, some high school, high school, college, and graduate school; employment length was...

...Problems Chapter 7
1. A population of 1,000 students spends an average of $10.50 a day on dinner. The standard deviation of the expenditure is $3. A simple random sample of 64 students is taken.
a. What are the expected value, standard deviation, and shape of the sampling distribution of the sample mean?
b. What is the probability that these 64 students will spend a combined total of more than $715.21?
c. What is the probability that these 64 students will spend a combined total between $703.59 and $728.45?
ANS:
a. 10.5 0.363 normal
b. 0.0314
c. 0.0794
2. The life expectancy in the United States is 75 with a standard deviation of 7 years. A random sample of 49 individuals is selected.
a. What is the probability that the sample mean will be larger than 77 years?
b. What is the probability that the sample mean will be less than 72.7 years?
c. What is the probability that the sample mean will be between 73.5 and 76 years?
d. What is the probability that the sample mean will be between 72 and 74 years?
e. What is the probability that the sample mean will be larger than 73.46 years?
ANS:
a. 0.0228
b. 0.0107
c. 0.7745
d. 0.1573
e. 0.9389
3. A simple random sample of 8 employees of a corporation provided the following information.
Employee 1 2 3 4 5 6 7 8
Age 25 32 26 40 50 54 22 23
Gender M M M M F M M F
a. Determine the point estimate for the average age of all employees.
b. What is...