Top-Rated Free Essay
Preview

STATS TCO 1

Powerful Essays
1632 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
STATS TCO 1
STATISTICS
CHAPTER 1 NOTES

DATA: consists of information coming from observations, counts, measurements, or responses.

STATISTICS: is the science of collecting, organizing, analyzing, and interpreting data in order to make decisions.

DATA SETS:
-POPULATION: is the collection of all outcomes, responses, measurements or counts that are of interest -SAMPLE: is a subset or part of a population
EXAMPLE OF POPULATION:
The age of each resident in an apartment building

EXAMPLE OF SAMPLE:
The temperature in 4 state capitals out of 50

PARAMETER: is a numerical description of a population characteristic
EXAMPLE OF PARAMETER: the 2182 students who accepted admission offers to Northwestern University in 2008 have an average SAT score of 1442

STATISTIC: is a numerical description of a sample characteristic
EXAMPLE OF STATISTIC: a study of 6076 adults in public restrooms found that 23% did not wash their hands before exiting.

BRANCHES OF STATISTICS -DESCRIPTIVE STATISTICS: is the branch of statistics that involves the organization, summarization, and display of data -INFERENTIAL STATISTICS: is the branch of statistics that involves using a sample to draw conclusions about a population. A basic tool in the study of inferential statistics is probability

TYPES OF DATA -QUALITATIVE DATA: consists of attributes, labels, or nonnumeric entries -QUANTIATIVE DATA: consists of numerical measurements or counts
EXAMPLE OF QUALITATIVE DATA: favorite musical band
EXAMPLE OF QUANTITATIV DATA: number of flights leaving an airport each year

LEVELS OF MEASUREMENT (LOW TO HIGH) -NOMINAL: qualitative data only. Data at this level are categorized using names, labels or quantities. No mathematical computations can be made at this level
EXAMPLE OF NOMINAL: social security numbers, numbers on sports jerseys. (it would not make sense to add sport jerseys together for the Chicago bulls team)

ORDINAL: are qualitative or quantitative. Data at this level can be arranged in order, or ranked, but differences between entries are not meaningful.
EXAMPLE OF ORDINAL: top 5 TV programs: 1. American idol on tuesday 2. American idol on wednesday 3. Dancing with the stars 4. NCIS 5. The mentalist

INTERVAL: data this level can be ordered, and meaningful differences between data entries can be calculated. At this level, a zero entry simply represents a position on a scale, the entry is not an inherent zero.
EXAMPLE OF INTERVAL: temperature outside is 0 celcius. A position on the Celsius scale.
2 degrees is not twice as warm as 1 degree.

RATIO: similar to interval with the added property that a zero entry is an inherent zero. A ratio of two data values can be formed so that the one data value can be meaningful expressed as a multiple of another. (an inherent zero implies “none”)
EXAMPLE OF RATIO: you have zero dollars in your account, meaning you have NO money.
$2.00 is twice as $1.00

LEVEL OF MEASUREMENT
PUT DATA IN CATEGORIES
ARRANGE DATA IN ORDER
SUBTRACT DATA VALUES
DETERMINE IF ONE DATA VALUE IS A MULTIPLE OF ANOTHER
NOMINAL
YES
NO
NO
NO
ORDINAL
YES
YES
NO
NO
INTERVAL
YES
YES
YES
NO
RATIO
YES
YES
YES
YES

LEVEL
EXAMPLE OF A DATA SET
MEANINGFUL CALCULATIONS
NOMINAL LEVEL (QUALITATIVE DATA)
TYPES OF SHOWS TELEVISED BY A NETWORK
COMEDY SPORTS
DRAMA COOKING
REALITY SHOWS SOAPS
DOCUMENTARIES TALK SHOWS
PUT IN A CATEGORY
ORDINAL LEVEL (QUALITATIVE OR QUANTITATIVE DATA)
MOTION PICTURE ASSOCIATION OF AMERICA RATINGS DESCRIPTION
G = GENERAL AUDIENCES
PG = PARENTAL GUIDENACE SUGGESTED
PG-13 PARENTS STRONGLY CAUTIONED
R = RESTRICTED
NC-17 NOBODY UNDER 17 YEARS OF AGE
PUT IN A CATEGORY AND PUT IN ORDER. FOR INSTANCE PG RATING HAS A STRONGER RESTRICTION THAN A G RATING.
INTERVAL LEVEL (QUANTIATIVE DATA)
AVERAGE MONTHLY TEMPS
IN DENVER, COLORADO
JAN 29.2 JULY 73.4
FEB 33.2 AUG 71.7
MAR 39.6 SEP 62.4
APRIL 47.6 OCT 51
MAY 57.2 NOV 37.5
JUN 67.6 DEC 30.3
PUT IN A CATEGORY, PUT IN ORDER, AND FIND DIFFERENCES BETWEEN VALUES. MAY IS WARMER THAN APRIL
RATIO LEVEL (QUANTITATIVE DATA)
AVERAGE MONTHLY PRECIPITATION IN INCHES FOR ORLANDO FLORIDA
PUT IN A CATEGORY, PUT IN ORDER, FIND DIFFERENCES BETWEEN VALUES AND RATIOS
DATA COLLECTIONS AND EXPERIMENTAL DESIGN
-DESIGNING A STATISTICAL STUDY 1. IDENTIFY THE VARIABLE(S) OF INTEREST (THE FOCUS) AND THE POPULATION OF STUDY
2. DEVELOP A DETAILED PLAN FOR COLLECTING DATA. IF YOU USE A SAMPLE MAKE SURE THE SAMPLE IS REPRESENTTIVE OF THE POPULATION
3. COLLECT THE DATA
4. DESCRIBE THE DATA, USING DESCRIPTIVE STATISTICS TECHNIQUES
5. INTERPRET THE DATA AND MAKE DECISIONS ABOUT THE POPULATION USING INFERENTIAL STATISTICS
6. IDENTIFY ANY POSSIBLE ERRORS
-DATA COLLECTION -OBERSERVATIONAL STUDY -PERFORM AN EXPERIMENT -SIMULATION -SURVEY
DATA COLLECTION METHOD
EXAMPLE
OBSERVATIONAL
A STUDY OF HOW 4TH GRADE STUDENTS SOLVE A PUZZLE
EXPERIMENTAL STUDY
A STUDY OF THE EFFECT OF EATING OATMEAL ON LOWERING BLOOD PRESSURE
SIMULATION STUDY
A STUDY OF THE EFFECT OF CHANGING FLIGHT PATTERNS ON THE NUMBER OF AIRPLANE ACCIDENTS
SURVEY STUDY
A STUY OF U.S. RESIDENTS APPROVAL RATING OF THE U.S. PRESIDENT

-EXPERIMENTAL DESIGN -THREE KEY ELEMENTS OF A WELL-DESIGNED EXPERIEMNT ARE CONTROL, RANDOMIZATION, AND REPLICATION
-CONFOUNDING VARIABLE: occurs when an experimenter cannot tell the difference between the effects of different factors on a variable
-PLACEBO EFFECT: when a subject reacts favorably to a placebo when in fact the subject has been given no medicated treatment at all.
-BLINDING: technique where subjects do not know whether they are receiving a treatment or a placebo
-DOUBLE-BLIND EXPERIMENT: neither the subject or experimenter know who has taken the placebo and who has taken the treatment (this is preferred)
-RANDOMIZATION: a process of randomly assigning subjects to different treatment groups
-COMPLETELY RANDOMIZED DESIGN: subjects are assigned to different treatment groups through random selection
-RANDOMIZED BLOCK DESIGN: divide subjects with similar characteristics into blocks, and then, within each block, randomly assign subjects to treatment groups
-MATCHED PAIR DESIGN: where subjects are paired up according to a similarity. One subject in the pair receives one treatment while the other receives another.
-SAMPLE SIZE: which is the number of subjects, is another important part of experimental design. To improve validity of experiment results, replication is required
-REPLICATION: is the repetition of an experiment under the same or similar conditions

SAMPLING TECHNIQUES -CENSUS: is a count or measure of an entire population -SAMPLING: is a count or measure of part of a population
-SAMPLE ERROR: is the difference between the results of a sample and the results of the population
-RANDOM SAMPLE: is one in which every member of the population has an equal chance of being selected
-SIMPLE RANDOM SAMPLE: is a sample in which every possible sample of the same size has the same chance of being selected.
-STRATIFIED SAMPLE: when it is important to have members from each segment of the population
-CLUSTER SAMPLE: when the population falls into naturally occurring subgroups, each having a similar characteristics. Divide the population into clusters
-SYSTEMATIC SAMPLE: in which every member of the population is assigned a number. The membres of the population are ordered in some way, a starting number is selected, and then sample members are selected at regular intervals from the starting number.
-CONVIENCE SAMPLE: consist of only available members of the population

CHAPTER 2 NOTES
FREQUENCY DISTRIBUTIONS AND THEIR GRAPHS
FREQUENCY DISTRIBUTION: is a table that shows classes or intervals of data entries with a count of the number of entries in each class.
EXAMPLE:
CLASS
FREQUENCY f
1-5
5
6-10
8
11-15
6
16-20
8
21-25
5
26-30
4

Upper class limit: 5, 10 15, 20 25, and 30
Lower class limit: 1, 6, 11, 16, 21, and 26
Class width: distance between upper or lower class limits of consecutive classes 6-1=5
Range: difference between max and minimum data entries

CONSTRUCTING A FREQUENCY DISTRIBUTION FROM A DATA SET
1. DECIDE ON THE NUMBER OF CLASSES. SHOULD BE BETWEEN 5 – 20
2. FIND THE CLASS WIDTH AS FOLLOWS. DETERMINE THE RANGE OF DATA, DIVIDE THE RANGE BY THE NUMBER OF CLASSES, AND ROUND UP TO THE NEXT CONVIENT NUMNER
3. FIND THE CLASS LIMITS
4. MAKE A TALLY MARK FOR EACH DATA ENTRY IN THE ROW OF THE APPROPRIATE CLASS
5. COUNT THE TALLY MAKRS TO FIND THE FREQUENCY

EXAMPLE: THE FOLLOWING LISTS THE PRICES IN DOLLARS OF 30 PORTABLE GLOBAL POSITIONING SYSTEM (GPS) NAVIGATORS. CONSTRUCT A FREQUENCY DISTRIBUTION TABLE THAT HAS SEVEN CLASSES

90 130 400 200 350 70 325 250 150 250
275 270 150 130 59 200 160 450 300 130
220 100 200 400 200 250 95 180 170 150

MINIMUM DATA ENTRY = 59
MAXIMUM DATA ENTRY = 450
RANGE= 450 – 59 = 391
DIVIDE RANGE BY NUMBER OF CLASSES 391/7 = 55.86 ROUND UP TO 56
Ef= sum of frequencies (30)
59 + 56 = 115 -1 = 114
59-114 FOR THE FIRST CLASS

CLASS
F
59-114
5 IIIII
115-170
8 IIIIIIII
171-226
6 IIIIII
227-282
5 IIIII
283-338
2 II
339-394
1 I
395-450
3 III
Ef = sum of frequencies (tallys)
Midpoint = Lower class limit + upper class limit divided by 2
Relative Frequency = Class frequency divided by sample size (n) f/n
Cumulative frequency = sum of frequencies of that class and all previous classes. The cumulative frequency of the last class is equal to the sample size n

FREQUENC HISTOGRAM
FRQUENCY HISTORGRAM: is a bar graph that represents the frequency distribution of a data set.
-A historgram as the following properties: 1. The horizontal scale is quantitative and measures the data values 2. the vertical scale measures the frequency of the classes 3. Consecutive bars must touch
-because consecutive bars must touch, bars must begin and end at class boundaries instead of class limits. CLASS BOUNDARIES are the numbers that separate classes without forming gaps between them.
-if data entries are integers, subtract 0.5 from each lower limit to find the lower class boundaries. To find the upper class boundaries add 0.5 to each upper limit. The upper boundary limit of a class will equal the lower boundary of the next higher class

RELATIVE FREQUENCY HISTORGRAM
-HAS THE SAME SHAPE AND THE SAME HORIZONTAL SCALE AS THE CORRESPONDING FREQUENCY HISTORGRAM. THE DIFFERENCE IS THAT THE VERTICAL SCALE MEASURES RELATIVE FREQUENCIES, NOT FREQUENCIES.

GRAPHING QUANTIATIVE DATA SETS
-NEWER WAY TO DISPLAY QUANTITATIVE DATA SETS: STEM AND LEAF PLOT
-In a stem and leaf plot, each number is sepeated by a stem and a leaf. You should have as many leaves as there are entries in the original data set and the leaves should be single digits

MEASURE OF CENTRAL TENDENCY
-MEAN: SUM OF ENTRIES DIVIDED BY NUMBER OF ENTRIES
-MEDIAN: FIRST ORDER THE DATA, THE MEDIAN IS THE MIDDLE NUMBER.
-MODE: NUMBER THAT REPEATS THE MOST. ORDERING DATA HELPS

You May Also Find These Documents Helpful

  • Satisfactory Essays

    ITT Tech MA3110 Vocab 1

    • 539 Words
    • 3 Pages

    Statistics – the science of planning studies and experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting, and drawing conclusions based on the data.…

    • 539 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    My description of Descriptive statistics is that they are the numerical elements that make up a data that can refer to an amount of a categorized description of an item such as the percentage that asks the question, “How many or how much does it take to “ and the outcome…

    • 729 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Hcs/438 Dq's

    • 1323 Words
    • 6 Pages

    The nominal level of measurement is the simplest level of variables such as hair color or gender.…

    • 1323 Words
    • 6 Pages
    Good Essays
  • Good Essays

    QNT 351 week 1 paper

    • 650 Words
    • 2 Pages

    There are two major types of statistics, descriptive and inferential. Descriptive statistics is defined by Lind (2011) as “methods of organizing, summarizing, and presenting data in an informative way” (p.6). An example of descriptive statistics would be a high school report showing that it had 300 graduates in 1990 and 450 graduates on 1991. The information that they provided described the amount of graduates that they had for each year. Inferential statistics is defined by Lind (2011) as “the methods used to estimate a property of a population on the basis of a sample” (p.7). If the same high school sent out a report showing the graduate numbers for 1999- the present to estimate the number of graduates that they would have for this school year, those statistics would be inferential because they are used to estimate future outcomes.…

    • 650 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    c. Nominal: Variable with values that are categories (that is, they are names rather than numbers). Also called categorical variables (Aron, 2013).…

    • 1224 Words
    • 5 Pages
    Powerful Essays
  • Good Essays

    Quiz 2

    • 1178 Words
    • 4 Pages

    The nominal level of measurement is defined by the text as the characteristics of an outcome that fits into one and only one class or category or by their names solely. An example of this would be polling people on their political affiliation. You can measure yourself as only one affiliation, (Republican, Democrat, or Independent), but not more than one.…

    • 1178 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    Descriptive statistics are used to organize and describe the characteristics of a collection of data. Inferential statistics are used to make inferences from a smaller group of data. Descriptive statistics are used primaraly for larger groups and used to collect more amounts of information while inferential is used more for a smaller group. Less information because there is less people and or information that is being refered to. Inferential statistics bring information which is often called a sample. This is a portion or a subset of a population.…

    • 385 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Identify the type of data (quantitative - discrete, quantitative - continuous, or qualitative) that would be used to describe a response.…

    • 673 Words
    • 3 Pages
    Satisfactory Essays
  • Powerful Essays

    research studies use two different categories of statistics to analyze the data collected: descriptive and inferential. Descriptive statistics are simply numerical or graphical summaries of data, and may include charts, graphs, and simple summary statistics such as means and standard deviations to describe characteristics of a population…

    • 1951 Words
    • 8 Pages
    Powerful Essays
  • Powerful Essays

    Stats Chapter 1 Notes

    • 3943 Words
    • 16 Pages

    Population: any complete collection of individuals (people, animals, plants or things) from which we may…

    • 3943 Words
    • 16 Pages
    Powerful Essays
  • Good Essays

    Week 2 Paper

    • 887 Words
    • 4 Pages

    According to Bennett, Briggs & Triola (2009), descriptive statistics transforms data into a picture of information that is readily understandable using measures such as mean, median, mode, variation and standard deviation. Inferential statistics help researchers decide whether their outcomes are a result of factors planned within design of the study or determined by chance referencing probability values (P) to indicate significance of the change in results (Bennett, Briggs & Triola, 2009). “The two approaches are often used sequentially in that first, data are described with descriptive statistics, and then additional statistical manipulations are done to make inferences about the likelihood that the outcome was due to chance through inferential statistics” (Streiner & Norman, 1996).…

    • 887 Words
    • 4 Pages
    Good Essays
  • Good Essays

    TEAM REFLECTION WEEK 4

    • 707 Words
    • 3 Pages

    Statistics refers to the use of numerical information in everyday life to calculate facts and figures in limitless circumstances. In addition, statistics refers to the scientific collecting, classifying, summarizing, organizing, analyzing, and interpreting numerical data. The steps in testing a research hypothesis, to compare the means of two or more groups, and to calculate the correlation between two variables.…

    • 707 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    7. Business intelligence refers to collecting, storing, accessing, and analyzing data on the company's operations in order to make better business decisions.…

    • 944 Words
    • 4 Pages
    Powerful Essays
  • Satisfactory Essays

    qms 102 test banks

    • 1801 Words
    • 8 Pages

    items into separate categories or groups. These are often attributes or nonnumerical categories. Because these are non-numerical items the only statistics…

    • 1801 Words
    • 8 Pages
    Satisfactory Essays
  • Powerful Essays

    Churn in Telecom Sector

    • 2887 Words
    • 12 Pages

    Nominal A variable can be treated as nominal when its values represent categories with no intrinsic ranking (for example, the…

    • 2887 Words
    • 12 Pages
    Powerful Essays