Preview

Multi Regression Problem for Wine Quality

Good Essays
Open Document
Open Document
582 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Multi Regression Problem for Wine Quality
Multi Regression Problem for Wine Quality The purpose of this regression analysis was to test wine quality. An evaluation like this would help assure quality for the wine market. We collected or data from “Machine Learning Repository” a data mining website. The data we obtained from Machine Learning Repository compares variables such as fixed acidity, volatile acidity, citric acid, residual sugar, chlorides, free sulfur dioxide, total sulfur dioxide, density, pH, sulphate, and alcohol to help identify the quality of the wine The first step in or regression analysis was to use SAS to run a stepwise and backward elimination test in order to remove any unneeded variables. The summary of the stepwise and backward elimination test determined that pH, total sulfur dioxide, volatile acidity, density, alcohol, and sulphate were all variables that could be removed from our models we were comparing. Once the unneeded variables were eliminated, three models were created and compared against one another to determine which model was best. The variables for model one were color, fixed acidity, citric acid, residual sugar, and free sulfur dioxide , u=5.8255 + .2117x1 - .1104X2 + 1.4832X3 - .0597X4 + .0183X5. The variables used in model two were color, citric acid, residual sugar, and free sulfur dioxide, u=5.0404 +.3279x1 + 1.1687X2 - .0607X3 + .0183X4. Model three variables were citric acid, residual sugar, and free sulfur dioxide, u=4.9968 + 1.6035X1 - .0577X2 + .02188. Once the models were set up we compared there t and p-values with one another and found that model three had the best p-values and also the lowest variance inflation factors so model three was chosen as the best model. After running model three whose variables are citric acid, residual sugar, and free sulfur dioxide in SAS the results of the variance inflation factors showed no signs of multicollinearity. The next step was to run a complete regression analysis of model three. The residual by

You May Also Find These Documents Helpful

  • Satisfactory Essays

    Busn311 Unit 5

    • 1291 Words
    • 6 Pages

    | | | | | | | 2.3 | 5.5 | SUMMARY OUTPUT | | | | | | | | | 4.5 | 3.2 | | | | | | | | | | | 5.4 | 5.2 | Regression Statistics | | | | | | | | | 6.2 | 5.1 | Multiple R | 0.267367 | | | | | | | | | 2.3 | 5.8 | R Square | 0.071485 | | | | | | | | | 4.5 | 5.3 | Adjusted R Square | 0.038324 | | | | | | | | | 5.4 | 5.9 | Standard Error | 0.785675 | | | | | | | | | 6.2 | 3.7 | Observations | 30 | | | | | | | | | 6.2 | 5.5 | | | | | | | | | | | 2.3 | 5.8 | ANOVA | | | | | | | | | | 4.5 | 5.3 | | df | SS | MS | F | Significance F | | | | | 5.4 | 5.9 | Regression | 1 | 1.33067 | 1.33067 | 2.15568 | 0.153189 | | | | | 6.2 | 3.7 | Residual | 28 | 17.284 | 0.617286 | | | | | |…

    • 1291 Words
    • 6 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Pp Lab Report Digestion

    • 1563 Words
    • 15 Pages

    Laboratory Report/ Hope Schallert/ Effect of Dietary Fiber on Transit Time and Bile/ Dr. Weithop / 03.08.2015/ Page [1] of [4]…

    • 1563 Words
    • 15 Pages
    Satisfactory Essays
  • Good Essays

    Bonny Doon Case

    • 2560 Words
    • 11 Pages

    The size of the wine market in the U.S., measured by tonnage, is estimated to be 2.5 million tons of crushed wine grapes in 1998. About half of the tonnages crushed are red wine grapes and the other half are white wine grapes. The best wineries are located in the Napa Valley and Sonoma region, whose wines receive high praises from critics. The per capita wine consumption in the U.S. is only about 2.02 gallons per adult as compared to 16.2 gallons in France and 15.8 gallons in Italy. Thus, demand for wines in the U.S. has huge potential for continued growth. At the same time, there is increasing demand for U.S.-made wines abroad.…

    • 2560 Words
    • 11 Pages
    Good Essays
  • Good Essays

    Peroxidase Lab

    • 823 Words
    • 4 Pages

    The next set of data was used in relation of different temperature levels in degrees Celsius instead of pH. In all four groups there was a change in value.…

    • 823 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    AMB 201 Market Research

    • 3990 Words
    • 16 Pages

    a. Predictors: (Constant), Technological, centrality, Anthropocentric, Political, Competition, happiness, Economic, success | Table 1 Model Summary of personal values on environmental concern ANOVAb | Model | Sum of Squares | df | Mean Square | F | Sig. | 1 | Regression | 248.572 | 8 | 31.071 | 39.671 | .000a | | Residual | 793.402 | 1013 | .783 | | Total | 1041.973 | 1021 | a. Predictors: (Constant), Technological, centrality, Anthropocentric, Political, Competition, happiness, Economic, success | b. Dependent Variable: concern | Table 2 ANOVA of personal values of environmental concern…

    • 3990 Words
    • 16 Pages
    Powerful Essays
  • Satisfactory Essays

    Age Gap Analysis

    • 896 Words
    • 4 Pages

    β2: An increase of living area by a hundred of square feet increases the selling price of home by 8884.48 dollars.…

    • 896 Words
    • 4 Pages
    Satisfactory Essays
  • Powerful Essays

    References: Cell Membrane. Wikipedia: The Free Encyclopedia. Wikimedia Foundation, Inc. 20 September 2011. Friday 28 Oct. 2011. <http://en.wikipedia.org/wiki/Cell_membrane>…

    • 7879 Words
    • 32 Pages
    Powerful Essays
  • Satisfactory Essays

    SPSS to run a regression model and then answer the following questions (use  = 0.05…

    • 375 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    The independent variable is the brand of antacid (Equate Maximum, Equate regular, and Picot. The constants are the type of grape juice, type of lemon juice, amount of antacid at a time, amount of lemon juice, amount of grape juice, plastic cups, and eye droppers. The dependent variable represents the number of drops of antacid required to neutralize lemon juice. The significant is that people will be able to learn which antacid is the potential for neutralizing acid. This method can be used for people who are looking to purchase the most potent antacid for relieving their heartburn symptoms. People can save money by purchasing one antacid that will work effectively for their heartburn issues instead of having to try multiple different antacids.…

    • 239 Words
    • 1 Page
    Satisfactory Essays
  • Satisfactory Essays

    In my investigation I plan to find out how much vitamin C there is in orange juice, pineapple juice and tropical juice and then compare my results to find out which juice has the most vitamin C. The independent variable is the type of juice and the dependant variable is the volume of fruit juice needed to decolourise 1cm3 Of 1% DCPIP solution.…

    • 635 Words
    • 3 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Saltwater Formation

    • 464 Words
    • 2 Pages

    2.)My independent variable was the amount of soil and salt that was put in the coffee filter.…

    • 464 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Co2 In Yeast Solutions

    • 795 Words
    • 4 Pages

    With this lab I have selected to make a data table in which the table includes the quantity of carbon dioxide produced at the end of each trial in mL, the number of trials, standard deviation, variance, and standard error. I have chosen to make a scatter plot for all of the means of the sucrose solutions including the standard error. I’ve decided to process and present my data in this form because the structure of the data table and graph clearly show the carbon dioxide produced in each sucrose solution which is the main focus of this lab. I have chosen to include standard deviation because when recording data because standard deviation helps you comprehend how much a given data point is different from the average (mean), which is helpful in this lab because this will inform you of any outliers in the mean.…

    • 795 Words
    • 4 Pages
    Good Essays
  • Good Essays

    The factors influence was assessed by analyzing the main effects and interaction between the factors and was calculated using Statistica 8.0 software (StatSoft Inc. Tulsa, OK, USA). Student’s t-test was used to analyze the level of significance. The Pareto chart was constructed with this software and shows the values of the Student’s t-test for each component of the medium. It was considered as response variable the quantification of reducing sugars by DNS…

    • 705 Words
    • 3 Pages
    Good Essays
  • Good Essays

    The dependent variables are the amount if sugar and starch in the 2 banana samples. The amount of reducing sugars in the 2 samples can be compared by performing the Benedict's test and finding the amount of red ppt in each sample .…

    • 1085 Words
    • 6 Pages
    Good Essays