Top-Rated Free Essay
Preview

Statistics

Satisfactory Essays
580 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Statistics
ASSIGNMENT SUBMISSION FORM Course Name: SMMD Assignment Title: Assignment 1 (HousePrices.jmp)
Submitted by: Garima Agrawal (Section D)
(Student name or group name) Group Member Name | PG ID | Garima Agrawal | 61410506 |

Question1: The data for home values has a considerable wide range (429578) as compared to the inter-quartile range (93522). This means the data has a huge spread and the same can be verified from coefficient of variation which is even more than 41%. Besides, as can be seen from graphical plot and the positive skewness (0.87) measure, the data is skewed towards right. Also, the outliers present towards the right end indicate the presence of few extremely high valued houses, due to which average price of houses is higher than the median price. The highest density of data is present in two lower quartiles, as can be seen from box plot. This shows that low valued houses are present in bulk, and thus must available in the market easily. | |
Question 2: Though normal distribution model is not an absolutely apt for the data set of prices, the data can still be analyzed by assuming normality owing to the fact that data points hover around the diagonal line of normal Quantile plot. Some data points also cross the permissible range, but the density of data (high in the middle, and low at the ends ) allows for the usage of normal distribution model.The same can be verified from the measure of Kurtosis (0.7) which is well in permissible range for usage of normal distribution model. | |

Question 3:

MEAN = 164K; STANDARD DEVIATION = 68K A.Z1 (@ x as 92.8K) = (92.8 – 164)/68 = -1.04Z2 (@ x as 255.5K) = (255.5 – 164)/68 = 1.34P(Z1 < Z < Z2) = 0.9099 – 0.1492 = 0.7607Percentage probability is 76.07, which seems to be more than the actual value, basis what can be seen via boxplot. | B.Z1 (@ x as 232K) = (232 – 164)/68 = 1P( Z < Z1) = 0.8413Percentage probability is 84.13, which is consistent with what can be seen via data distribution. | C.Prob (Z < Z1) = 0.75Z1 = 0.6745Price, X1 = (68)*(0.6745) + 164 = 209.86So, Theoretical value of house at 75th quartile is 209.9K as compared to the actual value of 205.3K. |

Question 4: On the analysis of data set, histogram clearly depicts the presence of white spaces in the data, which are values of living area unavailable in the market. Same is not directly evident from box plot.And the box plot on the other hand aptly shows the position of median and quartiles instantly. As can be seen from histogram as well as box plot, the data set is skewed towards right. The same can be verified from the measure of Skewness (0.807) which being positive indicates a right skewed data set. | |

Question 5:

On the analysis of data set, histograms clearly depict that original variable is a better fit for normal distribution variable as compared to the logarithmic one. The value of kurtosis for the new variable is -0.47 as compared to that of original at 0.392. Closer to zero is the value of Kurtosis, more is the normality in data.In my opinion, this change in normality is due to the fact that logarithm has scaled down the range and thus increased the number of bars relatively. This has caused data to deviate a little from normality. | Plot for living area | Plot for Log(Living Area) |

You May Also Find These Documents Helpful

  • Satisfactory Essays

    Mat 540 Quiz 4

    • 761 Words
    • 4 Pages

    P(-2.25 < z < 1.25) = F(1.25) - (1 - F(2.25)) = 0.89435 - (1 - 0.987776) = 0.882126…

    • 761 Words
    • 4 Pages
    Satisfactory Essays
  • Satisfactory Essays

    MATH533 Week 2 Project

    • 495 Words
    • 2 Pages

    Size is the most understandable data to me because it clearly shows that most of the customers sampled had a household size of 2. As you can see from the graph below size 2 is over double the size of most of the other data. The mean of the household size data is 3.42 and the standard deviation is 1.739. According to the histogram the data is mostly skewed to the right.…

    • 495 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Maths Paper Notingham Uni

    • 333 Words
    • 2 Pages

    of jar fills is normally distributed, what percentage of jar fills will be (i) greater than 202.5…

    • 333 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    QNT351 Week5 MyStatsLab

    • 2853 Words
    • 9 Pages

    What is the probability that a Type I error will be made for z > 2.575?…

    • 2853 Words
    • 9 Pages
    Satisfactory Essays
  • Powerful Essays

    0.003 x 100 / 0.040 = 7.5... ≈ ±8% ---------------------> 1.5% + 8% = ±9.5%…

    • 638 Words
    • 3 Pages
    Powerful Essays
  • Good Essays

    Math 116

    • 1192 Words
    • 5 Pages

    Find the z-score for having area 0.07 to its right under the standard normal curve, that is, find [pic].…

    • 1192 Words
    • 5 Pages
    Good Essays
  • Powerful Essays

    In the world of real estate, location sometimes determines the future of a property, residential or non-residential. Revere Street, a residential property located in the heart of downtown Boston, Massachusetts, provides a wonderful location for residents who also like to enjoy the convenience and diversity the city could offer. Revere Street shares Boston's unique New England historical city view and modernized financial district landscape. The pin point with the letter A on the map below shows a geographic center of Revere Street inside the City of Boston.…

    • 742 Words
    • 4 Pages
    Powerful Essays
  • Good Essays

    Stat study guide 101

    • 278 Words
    • 3 Pages

    ex: probability of getting between 270 and 310 successes inclusive = 269.5 < x 157.5…

    • 278 Words
    • 3 Pages
    Good Essays
  • Good Essays

    A histogram shows the distribution of data within the Income. In this Histogram graph of Income, it shows that the graph is not symmetrical. This histogram graph has a wider bell shape form. The graph shows that this graph is more like two graph because there is a clear difference between income generating from 20-40 and from 50-above. There are two separated cluster; therefore, the skewness of this graph is skewed right. Income has a lower value of kurtosis which indicates a lower, less distinct peak. The following table shows the numerical summary of Income:…

    • 1166 Words
    • 5 Pages
    Good Essays
  • Powerful Essays

    Week 4 Ilab

    • 813 Words
    • 4 Pages

    * Finally, we will change the probability of a success to ¾. In column C4 enter the words ‘three fourths’ as the variable name. Again, use similar steps to that given above in order to calculate the probabilities for this column. The only difference is in Event probability: use 0.75.…

    • 813 Words
    • 4 Pages
    Powerful Essays
  • Satisfactory Essays

    week 2 quiz

    • 401 Words
    • 2 Pages

    Y ~ N(98.2 , 0.62 / root(50) ) P(Y < 97.98) = P(Z < (97.98 - 98.2) / 0.0876...) = P(Z < -2.509...) = 1 - phi(2.509...) = 1 - 0.99395 = 0.0060 = 6% correct…

    • 401 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    Lightlab

    • 470 Words
    • 9 Pages

    Percent error=> +/- 0.51% Percent error= | (theoretical-experimental) / (theoretical) |*100 Percent error= | (195-196) / (195) |*100 Percent error=> +/- 0.51% Percent error= | (theoretical-experimental) / (theoretical)…

    • 470 Words
    • 9 Pages
    Powerful Essays
  • Satisfactory Essays

    exercise 18

    • 496 Words
    • 3 Pages

    Assuming that the distribution is normal for weight relative to the ideal and 99% of the male participants scored between (-53.68,64.64), Where did 95% of the values for weight relative to the ideal lie? Round your answer to two decimal places.…

    • 496 Words
    • 3 Pages
    Satisfactory Essays
  • Satisfactory Essays

    cheat sheet

    • 266 Words
    • 2 Pages

    To determine Z using the given area from the left of the bell curve and standard deviation, use invNorm(probability, mean, standard deviation). If using the right, use invNorm(1 – probability, mean, standard deviation).…

    • 266 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    statistics exam 2

    • 404 Words
    • 2 Pages

    Probability 0.1746 0.15 0.10 0.05 0.00 0 P(X=5)=0.175 5 X 10 What is the probability that Mary will get a score of no more than 35% on this exam? n=20, p=0.20 (0.35)(20)=7 P(X≤7) Distribution Plot Binomial, n=20, p=0.2 0.25 0.9679 Probability 0.20 0.15 0.10 0.05 0.00 X P(X≤7)=0.968 7 10 The following information is available on the number of calls received at the telephone switchboard of…

    • 404 Words
    • 2 Pages
    Good Essays

Related Topics