1. What is the difference between R2 and adjusted R2?

R2 is a statistic that will give some information about the goodness of fit of a model. In regression, the R2 coefficient of determination is a statistical measure of how well the regression line approximates the real data points. An R2 of 1.0 indicates that the regression line perfectly fits the data. Adjusted R2 is a modification of R2 that adjusts for the number of explanatory terms in a model. Unlike R2, the adjusted R2 increases only if the new term improves the model more than would be expected by chance. The adjusted R2 can be negative, and will always be less than or equal to R2. Adjusted R2 does not have the same interpretation as R2. As such, care must be taken in interpreting and reporting this statistic. Adjusted R2 is particularly useful in the Feature selection stage of model building. Adjusted R2 is not always better than R2: adjusted R2 will be more useful only if the R2 is calculated based on a sample, not the entire population. For example, if our unit of analysis is a state, and we have data for all counties, then adjusted R2 will not yield any more useful information than R2.

2. How does testing the significance of the entire multiple regression models differ from testing the contribution of each independent variable?

When testing the significance of the entire multiple regression, we are testing the jointly affect of the regressors (predictors) all together. On the other hand, when testing the contribution of each independent variable, we are testing the affect of that specific variable on the dependent variable.

3. Why and how do you use dummy variables?

The use of dummy variables allows you to include categorical independent variables as

part of the regression model. If a given categorical independent variable has two categories, then you need only one dummy variable to represent the two categories.

...Trajico, Maria Liticia D.
BSEd III-A2
REFLECTION
The first thing that puffs in my mind when I heard the word STATISTIC is that it was a very hard subject because it is another branch of mathematics that will make my head or brain bleed of thinking of how I will handle it. I have learned that statistic is a branch of mathematics concerned with the study of information that is expressed in numbers, for example information about the number of times something happens. As I examined on what the statement says, the phrase “number of times something happens” really caught my attention because my subconscious says “here we go again the non-stop solving, analyzing of problems” and I was right. This course of basic statistic has provided me with the analytical skills to crunch numerical data and to make inference from it. At first I thought that I will be alright all along with this subject but it seems that just some part of it maybe it is because I don’t pay much of my attention to it but I have learned many things. I have learned my lesson.
During our every session in this subject before having our midterm examination I really had hard and bad times in coping up with this subject. When we have our very first quiz I thought that I would fail it but it did not happen but after that, my next quizzes I have taken I failed. I was always feeling down when in every quiz I failed because even though I don’t like this...

...1. A density curve consists of a straight line segment that begins at the origin (0, 0) and has
slope of 1.
a. Sketch this density curve. What are the coordinates of the right endpoint of the segment?
(Note that the right endpoint should be fixed so that the total area under the curve is 1.
This is required for a valid density curve.)
b. Determine the median, the first quartile (Q1), and the third quartile (Q3).
c. Relative to the median, where would you expect the mean of the distribution to lie?
Explain briefly.
The distribution is skewed left, so the mean will be left of the median.
d. What percent of the observations lie below 0.5? Above 1.5?
2. The following are the salaries of employees in a small business:
15,000 15,000 17,500 23,500 23,750 25,000 26,000 27,500 29,000 45,000
a. Before using your calculator, do you think that the mean or the median for these values
will be higher? Why do you think so?
b. What are the mean and median of the salaries? Was your answer to (a) correct?
c. Find the standard deviation and the IQR of the data
d. Describe the shape of the distribution terms of its overall shape, including outliers and
skewness. Be sure to give the locations of the main features
e. Speculate on the reasons for why the distribution is shaped as it is.
3. The following represents the outcomes of rolling a die 600 times (actually, it was
simulated on the TI83 as follows: RandInt (1,6,600)_L1).
Face
1
2
3 ...

...QUESTION 21
The finishing process on new furniture leaves slight blemishes. The table below displays a manager's probability assessment of the number of blemishes on one piece of new furniture.
Number of Blemishes
0
1
2
3
4
5
Probability
0.34
0.25
0.19
0.11
0.07
0.04
1. On average, how many blemishes do we expect on one piece of new furniture?
2. What is the variance of blemishes on one piece of new furniture? (round to the nearest hundredth) QUESTION 22
The probability that a person catches a cold during the cold-and-flu season is 0.4. Assume that 10 people are chosen at random.
On average, how many of these ten people would you expect to catch a cold?
What is the standard deviation of the number of people who catch a cold? (round to the nearest hundredth)
QUESTION 23
The number of nails in a five-pound box is normally distributed with a mean of 566 and a standard deviation of 33.
What is the probability that there are less than 500 nails in a randomly-selected five-pound box of nails? (express as a decimal, not a percentage)
The probability is 0.99 that a randomly-selected five-pound box of nails contains at least how many nails approximately?
QUESTION 24
You are the owner of a small casino in Las Vegas and you would like to reward the high-rollers who come to your casino. In particular, you want to give free accommodations to no more than 10% of your patrons....

...cans and found that the arithmetic mean weight was 16.05 ounces. At the 5 percent level of significance, can we conclude that the mean weight is greater than 16 ounces? Determine the p-value.
38. A recent article in the Wall Street Journal reported that the 30-year mortgage rate is now less than 6 percent. A sample of eight small banks in the Midwest revealed the following 30- year rates (in percent):
4.8 5.3 6.5 4.8 6.1 5.8 6.2 5.6
At the .01 significance level, can we conclude that the 30-year mortgage rate for small banks is less than 6 percent? Estimate the p-value.
Chapter 11
27. A recent study focused on the number of times men and women who live alone buy take-out dinner in a month. The information is summarized below.
Statistic Men Women
Sample mean 24.51 22.69
Population standard deviation 4.48 3.86
Sample size 35 40
At the .01 significance level, is there a difference in the mean number of times men and women order take-out dinners in a month? What is the p-value?
46. Grand Strand Family Medical centre is specifically set up to treat minor medical emergencies for visitors to the Myrtle Beach area. There are two facilities, on in the Little River area, and the other in Murrells Inlet. The Quality Assurance Department wishes to compare mean waiting times for patients in two locations. Samples of waiting times, reported in minutes follow:
Location Waiting time
Little River 31.73 28.77 29.53 22.08 29.47 18.60 32.94 25.18 29.82 26.49...

...MAT 308 Name:___________________________
Chp 1-2 Test
100 points
In order to receive full credit, please show all work for every problem. You are welcome to use your calculator, book and notes, but please take this test on your own.
1. In your own words, explain what parameters and statistics are and what the difference is between them. Give an example to clarify. (10 points)
2. According to Consumer Reports, a random sample of 35 new cars gave an average of 21.1 mpg.
What is the variable being studied?
What is the implied population?
What is the sample?
Is the variable qualitative or quantitative?
Is the variable discrete, continuous or neither?
(10 points)
3. A national survey asked 1261 US adult fast-food customers which meal (breakfast, lunch, dinner or snack) they ordered.
What is the variable being studied?
What is the implied population?
What is the sample?
Is the variable qualitative or quantitative?
Is the variable discrete, continuous or neither?
(10 points)
4. Consider a completely randomize experiment in which a control group is given a placebo for congestions relief and a treatment group is given a new drug for congestion relief.
What does it mean that the control group was given a placebo? (4 points)
How could this experiment be made into a double-blind experiment? (4 points)
5. Which technique for gathering data (observational study or...

...Practice Problems 1-KEY 1. The closing stock price of Ahmadi, Inc. for a sample of 10 trading days is shown below. Day Stock Price 1 84 2 87 3 84 4 88 5 85 6 90 7 91 8 83 9 82 10 86
For the above sample, compute the following measures. a. b. c. The mean = ∑X/n = 860/10 = 86 The median = (85+86)/2 = 85.5 The variance = ∑ X - X 2/ n-1 = {(84-86)2 + (87-86)2 + (84-86)2 + (88-86)2 + (85-86)2 + (90-86)2 + (91-86)2 + (83The standard deviation = √8.89 = 2.98 The coefficient of variation = 2.98/86 * 100% = 3.47%
86)2 + (82-86)2 + (86-86)2 } / (10 -1) = 8.89 d. f.
2. In 2008, the average age of students at GUST was 22 with a standard deviation of 3.96. In 2009, the average age was 24 with a standard deviation of 4.08. In which year do the ages show a more dispersed distribution? Show your complete work and support your answer. CV2008 = 3.96/22 * 100% = 18% CV2009 = 4.08/24 * 100% = 17% So, 2008 shows more dispersed distribution 3. A local university administers a comprehensive examination to the recipients of a B.S. degree in Business Administration. A sample of examinations are selected at random and scored. The results are shown below. Grade For the above data, determine a. The mean = ∑X/n = 664/8 = 83 b. c. The median = (85+87)/2 = 86 The standard deviation = √variance ariance = ∑ X - X 2/ n-1 = {(93-83)2 + (65-83)2 + (80-83)2 + (97-83)2 + (85-83)2 + (87-83)2 + (97-83)2 + (60 - 83)2 } / (8 -1) = 196.29 S0, standard deviation = √196.29 = 14.01 d. The coefficient of...

...DETERMINANTS IN A BUSINESS STATISTICS COURSE AT A LARGE URBAN INSTITUTION
CIS 3300
November 30, 2012
INTRODUCTION
This research paper discusses the effects of several different factors on a student’s success in a Business Statistics course. The different variables include areas related to the student’s academic factors as well as factors related to the student’s personal life. The academic related variables are: course of study, study hours per week, semester credit hours, GPA, class year, semester and class time. The personal life variable is: work hours per week. All of the above listed variables are highly related to a students’ ability to succeed in a Business Statistics course.
The main purpose of this research paper is to determine which factors show the greatest significance in predicting a students’ success in a Business Statistics course. This study will provide valuable information to both students and professors by helping both to modify certain factors to produce a higher success rate in this course. This information could be used by students to decide what the best time of day would be to take a statistics course, how many study hours are needed or to decide which school year is best to enroll in the course. Professors could use this information to schedule their statistics courses at the peak hours of the day that are the best times to take a...

...Organization of Terms
Experimental Design
Descriptive
Inferential
Population
Parameter
Sample
Random
Bias
Statistic
Types of
Variables
Graphs
Measurement scales
Nominal
Ordinal
Interval
Ratio
Qualitative
Quantitative
Independent
Dependent
Bar Graph
Histogram
Box plot
Scatterplot
Measures of
Center
Spread
Shape
Mean
Median
Mode
Range
Variance
Standard deviation
Skewness
Kurtosis
Tests of
Association
Inference
Correlation
Regression
Slope
y-intercept
Central Limit Theorem
Chi-Square
t-test
Independent samples
Correlated samples
Analysis-of-Variance
Glossary of Terms
Statistics - a set of concepts, rules, and procedures that help us to:
organize numerical information in the form of tables, graphs, and charts;
understand statistical techniques underlying decisions that affect our lives and well-being; and
make informed decisions.
Data - facts, observations, and information that come from investigations.
Measurement data sometimes called quantitative data -- the result of using some instrument to measure something (e.g., test score, weight);
Categorical data also referred to as frequency or qualitative data. Things are grouped according to some common property(ies) and the number of members of the group are recorded (e.g., males/females, vehicle type).
Variable - property of an object or event that can take on different values. For example, college major...