Preview

Statistics

Good Essays
Open Document
Open Document
1252 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Statistics
| Statistics 101 Report | The Kentucky Milk Case Study | | |

|

Preliminary Analysis
2a)

Figure 1: X as a Data Object
X is a data frame as derived from the program R shown above in Figure 1. There are 274 observations of 11 variables. The number of observations is obtained from the number of rows while the number of variables is obtained from the number of columns.
2b)

Figure 2: Creating a sub-data frame from X

Figure 3:Sub-data frame from X
Figure 2 shows a screenshot of the commands entered into R to create a sub-data frame X containing observations of the 7 selected variables. Figure 3 shows the sub-data frame created from the commands in R.
2c)

Figure 4: Missing values of WWBID
The variable WWBID has 36 missing values and the fraction of the number of missing values out of the total number of cases is1291.
2d)
The percentage of cases in the dataset which contains one or more missing value as calculated using R is 13.19% which can also be seen from Figure 4.

2e)

Figure 5: Bid price variable for markets Tri-County & Surround in 1984

Figure 6: Bid price variable for markets Tri-County & Surround in 1985

Figure 7: Bid price variable for markets Tri-County & Surround in 1986

Figure 8: Bid price variable for markets Tri-County & Surround in 1987

Figure 9: Bid price variable for markets Tri-County & Surround in 1988

Figures 5-9 above show the box plots obtained using R for Tri-County and Surround for the years 1984 to 1988. After examining these plots from the combined data for Meyer and Trauth, there is presence of potential outliers in all the years.
There is a presence of potential outliers in all 5 years in Surround. On the other hand, they are only present in years 1985 and 1988 for Tri-County.
For the potential outliers in Surround, they are all the maximum values of the bid price variable. However, Tri-County has a potential outlier in 1988 which is the minimum value of the bid price variable.

You May Also Find These Documents Helpful

  • Good Essays

    The mean household size of the customers is found to be 3.42. The median of the data is 3 and the…

    • 1935 Words
    • 8 Pages
    Good Essays
  • Satisfactory Essays

    Mgmt 305 Question Paper

    • 479 Words
    • 2 Pages

    ____6. Refer to Exhibit 2. What is the probability that a randomly selected computer will have a…

    • 479 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Weekly Summary

    • 673 Words
    • 2 Pages

    * The statistics presented in this Sunday’s Arizona Daily Star, however, present the picture in a totally different manner based on actual data collected for different areas in Tucson.…

    • 673 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    QNT 351 Final Exam

    • 604 Words
    • 2 Pages

    QNT 351 Final Exam1) The main purpose of descriptive statistics is to2) The general process of gathering, organizing, summarizing, analyzing, and interpreting data is called3) The performance of personal and business investments is measured as a percentage, return on investment. What type of variable is return on investment 4) What type of variable is the number of robberies reported in your city5) What level of measurement is the number of auto accidents reported in a given month6) The names of the positions in a corporation, such as chief operating officer or controller, are examples of what level of measurement7) Shoe sizes, such as 7B, 10D, and 12EEE, are examples of what level of measurement8) Monthly commissions of first-year insurance brokers are 1,270, 1,310, 1,680, 1,380, 1,410, 1,570, 1,180, and 1,420. These figures are referred to as9) A small sample of computer operators shows monthly incomes of 1,950, 1,775, 2,060, 1,840, 1,795, 1,890, 1,925, and 1,810. What are these ungrouped numbers called10) The sum of the deviations of each data value from this measure of central location will always be 011) For any data set, which measures of central location have only one value12) A sample of single persons receiving social security payments revealed these monthly benefits 826, 699, 1,087, 880, 839, and 965. How many observations are below the median 13) A dot plot shows14) The test scores for a class of 147 students are computed. What is the location of the test score associated with the third quartile15) The National Center for Health Statistics reported that of every 883 deaths in recent years, 24 resulted from an automobile accident, 182 from cancer, and 333 from heart disease. QNT 351 Final Exam. Using the relative frequency approach, what is the probability that a particular death is due to an automobile accident16) If two events A and B are mutually exclusive, what does the special rule of addition state17) A listing of all possible outcomes of an…

    • 604 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Shows the values that a variable can take and the number of observations associated with each value…

    • 460 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    Sas Assignment

    • 300 Words
    • 2 Pages

    UNSW SCHOOL OF MATHEMATICS MATH2871 DATA MANAGEMENT FOR STATISTICAL ANALYSIS ASSIGNMENT 1 - ANSWERS TOTAL MARKS 20 (10% of final grade) Number of Questions: 2…

    • 300 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    The above chart represents the national average as proved by the Real Estate data set. The data has been sorted from lowest to highest and divided by six to form the various groups. The descriptive statistics is shown in the table below.…

    • 680 Words
    • 3 Pages
    Good Essays
  • Good Essays

    (total amount of data adds up to 88, then divided this total by the number of data sets, 30. 88 divided by 30, equals 2.93, which I rounded up to 3.)…

    • 1433 Words
    • 6 Pages
    Good Essays
  • Better Essays

    Pro Forma

    • 1365 Words
    • 7 Pages

    Graph 2-1 shows the total interest income of Orrstown from year 2006-2013. As it takes a longer time for small financial institutions to react on the financial crisis, the negative interest income of orrstown showed in year 2012 and 2013. Because there would be little chance of any big financial crisis in the following five years, it is reasonable to take outliers. Take the average of the annual growth rate of total interest income from 2006 to 2011, and use it as the annual growth rate of total interest income for the nest five years.…

    • 1365 Words
    • 7 Pages
    Better Essays
  • Good Essays

    higher rates of robberies, and I found out that Chicago had the highest over the two years of data…

    • 831 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    Paragon

    • 1544 Words
    • 7 Pages

    1(a) Paragon Electronics, INC. LIFO Purchases By Paragon Sold to Ending Inventory Cost of goods sold Year Units Unit cost($) Aero Inc Units Unit Cost($) Total($) Unit Unit Cost($) Total($) 1986 100 700 80 20 700 14,000 80 700 56,000 1987 100 800…

    • 1544 Words
    • 7 Pages
    Powerful Essays
  • Satisfactory Essays

    Trouble group aims to be the best in all aspects our company’s mission is to succeed in all of the following:…

    • 5696 Words
    • 23 Pages
    Satisfactory Essays
  • Powerful Essays

    Georgia Atlantic Company

    • 4286 Words
    • 18 Pages

    This case can be used in several ways. In the introductory course, the case can be used as the basic structure for a lecture or as a written assignment in conjunction with lecture and text material. In our more advanced courses, which usually have smaller enrollments, we have found additional uses which produced very satisfactory results. To encourage active class discussion of various aspects of dividend policy, we divide the class into groups and assign each group one or several of the positions with respect to dividend policy. Students then present oral arguments in favor of the particular policy they have been assigned and against the others. Students exhibit a good deal of creativity in developing reasons for following a policy they may or may not believe in.…

    • 4286 Words
    • 18 Pages
    Powerful Essays
  • Good Essays

    Comparing Data

    • 268 Words
    • 2 Pages

    When outliers are present, the median and IQR are used to measure center and spread because they are unaffected by extreme values. When the data appears to be symmetric and there are no known outliers, the mean and standard deviation (another measure of spread) are used.…

    • 268 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    Course Work

    • 3677 Words
    • 15 Pages

    Question 1: We would have the work on the data set phones.sav. Therefore, we hoped to glance at the information of the data at initial. After the reading, the data set consisted of the following 12 variables that all relates to the year 2010. They were shown in the variable name and the description in the below Table 1.001.…

    • 3677 Words
    • 15 Pages
    Powerful Essays