# Regression Analysis

Satisfactory Essays
Mortality Rates
Regression Analysis of Multiple Variables

Neil Bhatt
993569302
Sta 108 P. Burman
11 total pages

The question being posed in this experiment is to understand whether or not pollution has an impact on the mortality rate. Taking data from 60 cities (n=60) where the responsive variable Y = mortality rate per population of 100,000, whose variables include Education, Percent of the population that is nonwhite, percent of population that is deemed poor, the precipitation, the amount sulfur dioxide, and amount of nitrogen dioxide.
Data:
60 Standard Metropolitan Statistical Area (SMSA) in the United States, obtained for the years 1959-1961. [Source: GC McDonald and JS Ayers, “Some applications of the ‘Chernoff Faces’: a technique for graphically representing multivariate data”, in Graphical Representation of Multivariate Data, Academic Press, 1978.
Taking the data, we can construct a matrix plot of the data in order to take a visible look at whether a correlation seems to exist or not prior to calculations.

Data Distribution:

Scatter Plot Matrix
As one can observe there seems to be a cluster of data situated on what appears to be a correlation of relationship between Y=Mortality rate and X= potential variables influencing Y.
From this we construct a correlation matrix in order to see a relationship in matrix form.
Correlation Matrix EDUC MORTALITY NONWHITE NOX POOR PRECIP
EDUC 1.0000000 -0.51098130 -0.2087739 0.22440191 -0.40333845 -0.4904252
MORTALITY -0.5109813 1.00000000 0.6437364 -0.07738105 0.41045399 0.5094924
NONWHITE -0.2087739 0.64373637 1.0000000 0.01838530 0.70491501 0.4132045
NOX 0.2244019 -0.07738105 0.0183853 1.00000000 -0.10254386 -0.4873207
POOR -0.4033385 0.41045399 0.7049150 -0.10254386 1.00000000

## You May Also Find These Documents Helpful

• Good Essays

Statistics is the science pertaining to the collection and analysis of data. It is the refinement of the ambiguous, the distilling of truth from the crudest of resources. For this reason, it is necessary to discern the simplest path from Point A to Point B, disregarding any unnecessary data that may lie in the path. This, however, is easier in theory than in practice, and statisticians have developed various techniques to help differentiate between causation, a variable directly related to phenomena, and association, a variable who 's changes occur concurrently with the phenomena, and could be causal or non-causal.…

• 857 Words
• 4 Pages
Good Essays
• Good Essays

2. Make a well-labeled scatterplot of the data. Describe the direction, form, and strength of the relationship. Are there any outliers?…

• 398 Words
• 2 Pages
Good Essays
• Good Essays

In order to figure out how variables relates to each other and the connections among the variables, or one can predict the other. I will choose three quantitative variables or two quantitative variables and one categorical variable on each pairs. I will also use graphs of scatter plots; regression and correlation to understand that how one variable affect other two variables. There are six groups below:…

• 1710 Words
• 7 Pages
Good Essays
• Good Essays

The graph shows some sample (attendance, containers) data, along with a line modeling the pattern in the data.…

• 1364 Words
• 13 Pages
Good Essays
• Satisfactory Essays

One of the best places to start when assessing relationships is looking at a graph like a scatter plot or a matrix plot that can graphically reveal the direction, form, and strength of a relationship.…

• 193 Words
• 2 Pages
Satisfactory Essays
• Good Essays

From the correlation analysis above we see that 4 variables have a high correlation. These are:…

• 642 Words
• 3 Pages
Good Essays
• Satisfactory Essays

a) draw a scatter diagram of number of sales calls and number of units sold…

• 384 Words
• 2 Pages
Satisfactory Essays
• Satisfactory Essays

By running the above regression model for each brand, we got the following elasticity matrix and the figures for “V” and “C.” Note that we used the average price and quantity for P and Q to calculate each brand’s elasticity.…

• 1130 Words
• 5 Pages
Satisfactory Essays
• Powerful Essays

Variables identification .......................................................................................................................... 3 Scatter Plots .......................................................................................................................................... 4 Correlation ............................................................................................................................................. 4 Regression ............................................................................................................................................ 4 Answers to Questions ................................................................................................ 6…

• 1919 Words
• 8 Pages
Powerful Essays
• Good Essays

I will be using scatterplots to demonstrate the potential correlation in my data. I will find the r correlation coefficient and, if appropriate, the regression equation.…

• 1082 Words
• 5 Pages
Good Essays
• Good Essays

(c) a change in X by one unit is associated with a β1 100% change in Y.…

• 1484 Words
• 6 Pages
Good Essays
• Powerful Essays

According to the data I’ve received, there are 6578 observations. The data base is composed by 13 columns and 506 rows. All the explanatory variables are continuous as well as the dependent variable and there are no categorical variables. My goal is to build a regression model to predict the average of Y or particular Y by a given X. 1. Do the regression assumptions such as Constant Variance, Normality and Independence and the correct functional hold for the model? By performing residual analysis, I can test the model. 2. Is there any relationship between the explanatory variables? I do multicollinearity test to test this condition. 3. I want to find out the confidence interval and prediction interval for the average Y and particular Y value. 4. In order to check the usefulness of the model and the relationship between X and Y, I consider several variables: i. Multiple Coefficient of Determination R2 and Radj2) ii. DWT iii. F Ratio iv. VIF value v. P Probability value.…

• 1474 Words
• 6 Pages
Powerful Essays
• Good Essays

Suppose a firm fed the values of turnover, y, and advertising expenditure, x, (both in \$000) for the past eight years, into a computer and obtained the regression relationship y = 26.7 + 8.5x.…

• 675 Words
• 3 Pages
Good Essays
• Good Essays

Correlational analysis involves the 2-dimensional angle of the extent of correlation between the elements. This tool of analysis ascertains the interconnectedness of the elements in study and if there is, determines the extent of the link (Field, 2009). Pearson’s correlations and linear regressions are utilised to determine the interconnectedness amongst the elements. The collated data are separated by each element and linear regression analysis is undertaken to ascertain the extent of interconnectedness (Christensen,…

• 676 Words
• 3 Pages
Good Essays
• Good Essays

Our analysis has shown that the following variables are the most significant predictors of the selling price:…

• 1480 Words
• 6 Pages
Good Essays