Preview

How to use dummy variables

Better Essays
Open Document
Open Document
1267 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
How to use dummy variables
How to Work with Dummy Independent Variables

Chapter 8 is devoted to dummy (independent) variables. This How To answers common questions on working with and interpreting dummy variables.

Questions:

1) How to include dummy variables in a regression?

2) How to interpret a coefficient on a dummy variable?

3) How to test hypotheses with dummy variables and interaction terms?

4) How to create a double-log functional form with dummy variables?

5) How to interpret a coefficient on a dummy variable with a log dependent variable?

1) How to include dummy variables in a regression?

Example:
You want to include Region of the United States in your earnings function regression. You obtain the variable GMREG from the Current Population Survey (CPS), and it has four possible values that the codebook maps to a region like this:

The data are sitting in an Excel file column like this:

Obviously, you CANNOT use GMREG directly in a regression.

To incorporate region as a dummy variable, follow these steps:

1) Create Number of Categories – 1 new variables. (In this example, 4 – 1 = 3 new variables)

In a new column, enter Northeast as the label for the variable.

Use an IF statement to create a 1 is GMREG is 1; otherwise a 0.
=IF(B2=1,1,0)
(where B2 has the value of GMREG for that observation)

Repeat for Midwest and South.

2) Include the Number of Categories – 1 variables in the regression.

The choice of which category to leave out (in this example, West) is totally arbitrary and has no effect on the final results. The actual coefficients of the regression equation do, of course, depend on the category left out (called the base case), but because you interpret a dummy variable coefficient relative to the base case, the predicted values end up the same. See Section 8.2 for more.
2) How to interpret a coefficient on a dummy variable?

For a single dummy variable without an interaction term, the value of the

You May Also Find These Documents Helpful

  • Good Essays

    Math203

    • 385 Words
    • 2 Pages

    Doucette wants to decide whether or not to put an employee retention program in place. But first, he wants Sarah Jenkins to check whether manager tenure and crew tenure are related to store profit. Accordingly, run the three regression models per instructions given below; data for these 3 models is in the worksheet labeled Data for Case A.…

    • 385 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    Exercise3statistics

    • 657 Words
    • 2 Pages

    3. Which variables in Table I are measurement at the interval/ratio level? Which ones are measured at the nominal level? Provide a rationale for your answer.…

    • 657 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Econ2206 Assignment

    • 457 Words
    • 2 Pages

    to find the answers to questions (iii)-(vii), where s are parameters to be estimated. In addition to answering the questions (i)-(vii), you are encouraged to comment on the adequacy of this model for analyzing the questions. You have access to a data set from a recent national health survey of Luckland, which can be regarded as a random sample. The data description is in the file “NHS.des” and data are in the file “NHS.raw”. Read “NHS.des” carefully and make sure that you understand the meaning of each variable in…

    • 457 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    lab 1

    • 636 Words
    • 3 Pages

    Before beginning, set up a data table similar to this Data Table 1. Fill in the names of the numbered structures.…

    • 636 Words
    • 3 Pages
    Powerful Essays
  • Good Essays

    c. a cross classification of data where categories of one variable are presented in rows…

    • 2113 Words
    • 14 Pages
    Good Essays
  • Satisfactory Essays

    Soci

    • 780 Words
    • 4 Pages

    2. Find the multiple regression equation. Interpret its meaning and the meaning of its slopes and constant.…

    • 780 Words
    • 4 Pages
    Satisfactory Essays
  • Good Essays

    Extraneous Variables

    • 477 Words
    • 2 Pages

    A well-designed experiment copes with the potential effects of extraneous variables by using random assignment to experimental conditions and sometimes also by incorporating direct control and/or blocking into the design of the experiment. Each of these strategies—random assignment, direct control, and blocking—is described as follows;…

    • 477 Words
    • 2 Pages
    Good Essays
  • Good Essays

    3. Identify the broad activity categories and create cost pools by assigning the costs from Table 2 to the pools.…

    • 436 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Historical Process

    • 1214 Words
    • 6 Pages

    Using what you learned in this unit, examine the sources provided to answer these questions:…

    • 1214 Words
    • 6 Pages
    Good Essays
  • Good Essays

    Basic Statistics

    • 907 Words
    • 4 Pages

    2) Definition of Variables For each variable, write a single definition paragraph talking about the variable. Paragraphs should be in this order: dependent variable, primary independent variable, and three independent variables.The primary independent variable for single mothers is whether an individual is raised in a single parent home. This independent variable mostly affects teenagers. “75% of teenage pregnancies are adolescents from single parent homes” (A Generation at Risk).…

    • 907 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    |Environment Category |[X] A [ ] B [ ] C [ ] FI [ ] TBD (to be determined) |…

    • 1450 Words
    • 6 Pages
    Powerful Essays
  • Good Essays

    (Hagan, 2012). Dependent variables are the outcome of a variable to predict outcomes of certain concepts of crime and recidivism. Dependent variables are usually the subject of one’s study. Independent or also known as predictor variable which have the concepts of causes, determines, or precedes in time of the dependent variable. Independent variable is usually a demographic variable or treatment. Theories are described as attempts to develop plausible explanation of reality. It is usually a broad and general statement that is regarding relationships between variables. Hypotheses are specific statements regarding the relationship between two variable and derived from more general theories. To approach research involved formulation of hypotheses, operationalization or measurement of variables and testing of bringing evidence to be brought upon…

    • 858 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Lab: process design

    • 993 Words
    • 4 Pages

    3. You have the log file with you which has all the variable values. Use…

    • 993 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    Chapter 1 includes the following subtopics, namely: 1) Rationale; 2) Theoretical Framework; 3) Conceptual Framework/Paradigm; 4) Statement of the problem; 5) Hypothesis (Optional); 6) Assumption (Optional); 7) Scope and Delimitation; Importance of the study; 9) Definition of terms.…

    • 316 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Chapter 1 includes the following subtopics, namely: 1) Rationale; 2) Theoretical Framework; 3) Conceptual Framework/Paradigm; 4) Statement of the problem; 5) Hypothesis (Optional); 6) Assumption (Optional); 7) Scope and Delimitation; Importance of the study; 9) Definition of terms.…

    • 660 Words
    • 3 Pages
    Good Essays