Athenax.ugent.be ; eerst citrix downloaden ; gebruik SPSS 19 Helpdesk.ugent.be
Les 1
1e opgave: zwembad Neptunus
1) Bezoek
2) Prijs
3) Sauna
4) Zonnen
5) Afstand
6) Cijfer
* Ingeven bij variabelen (Var)
* Dubbelklikken op Var: variabelen staan in rijen
* Bij measure:
* scale = continue variabele (zoals # bezoeken
* Ordinal en nominal = categorische variabele
* Label: beschrijving van de variabele
* Value: 1 komt overeen met label laag,…
* Doe da analyze; reports: case summaries: sleep “mening over prijs’ in variable; ok gemakkelijk om betekenis van variabele te gaan weergeven Crtl A : alles toevoegen.
* Frequentietabel maken: analyze; descriptive statistics; frequencies * Median berekenen van mening over prijs: terug naar frequencies en display uitschakelen; statistics klikken en median aanklikken * Met tabel: charts aanklikken…

* Beoordelingscijfer: er zit een fout in, nul = geen mening tonen aan SPSS dat dit een ontbrekende waarde is Opl.: missing: waarde aangeven die verkeerd is: hier 0
* Klassen voor afstand:
* =1 als afstand <= 500m
* =2 als 500m<afstand< 1500m
* =3 als afstand >= 1500m
* Maand erbij voegen (kijk welk oefn dit is)

Les 2
Analyse, descriptive, cross tabs, statistics chi square test

P waarde 0,021 : Ho verwerpen op 5% sign niveau
Niet in elke cel groter dan 5 enkel 4 cellen niet, dus vwen niet voldaan

Oefening 3
Gebruik T-test voor gemiddelden te vgl.
Controleren of beide groepen normaal verdeeld zijn!

Vink bij plots, normality plots with test
Groep1: te weinig info om normaliteit aan te tonen
Groep2: sig. = .68: H0 niet verwerpen

...STAT 600 Statistics and Quantitative Analysis
PROJECT: Stock return estimation
The project must be done by 6-15 a.m. October, 16th. You should submit your projects before the class begins. This is a group project. Read the course outline for general guidelines. Good luck!
The project is closely related to Lectures 1-5 of the class.
Today is September 15, 2013 and you have just started your new job with a financial planning firm. In addition to studying for all your license exams, you have been asked to review a portion of a client’s stock portfolio to determine the risk/return profiles of 12 stocks in the portfolio. Unfortunately, your small firm cannot afford the expensive databases that would provide all this information with a few simple keystrokes, but that’s why they hired you. Specifically, you have been asked to determine the monthly average returns and standard deviations for the 12 stocks for the past five years.
The stocks (with their symbols in parentheses) are:
Apple Computer (AAPL) Hershey (HSY)
Archer Daniels Midland (ADM) Motorola (MOT)
Boeing (BA) Procter and Gamble (PG)
Citigroup (C) Sirius XM radio (SIRI)
Caterpilar (CAT) Wal-Mart (WMT)
Deere&Co. (DE)...

...
MBA 501A – [STATISTICS]
ASSIGNMENT 4
INSTRUCTIONS: You are to work independently on this assignment. The total number of points possible is 50. Please note that point allocation varies per question. Use the Help feature in MINITAB 16 to read descriptions for the data sets so that you can make meaningful comments.
[10 pts] 1. Use the data set OPENHOUSE.MTW in the Student14 folder. Perform the Chi
Square test for independence to determine whether style of home and location are are related. Use α = 0.05. Explain your results.
Pearson Chi-Square = 37.159, DF = 3, P-Value = 0.000
Likelihood Ratio Chi-Square = 40.039, DF = 3, P-Value = 0.000
The P value associated with out chi square is 0.00 and the Alpha level is 0.05 so we reject the null hypothesis. The P- value is less than the alpha level. So, we conclude that style of homes and locations are not related.
[10 pts] 2. Use the data set TEMCO.MTW in the Student14 folder. Perform the Chi
Square test for independence to determine whether department and gender are related. Use α = 0.05. Explain your results.
Pearson Chi-Square = 1.005, DF = 3, P-Value = 0.800
Likelihood Ratio Chi-Square = 1.012, DF = 3, P-Value = 0.798
The P-value associated with out chi square is 0.800 and the Alpha level is 0.05 we can see that we are unable to reject the null hypothesis. The P- value is greater than the alpha level. So, we conclude that departments and gender are related..
[30 pts] 3. Use the data set...

...criterion we get the best model: y = -124.382 + 0.296X1 + 0.048X2 + 1.306X3 + 0.5198X4. This model contains all four predictor variables X1, X2, X3 and X4. This model is selected as best model by the MaxR criterion because it has the largest R-Square 0.9629, which is larger than 0.9615(model containing 3 variables), 0.9330(model containing 2 variables) and 0.8047(model containing 1 variable).
Below is a SAS output of the MaxR criterion.
Obviously, the “best” model obtained from MaxR criterion differs from that obtained from Stepwise and Backward Elimination Method. It is not hard to understand this phenomenon: Since for the Stepwise/Backward Elimination method, F-statistic plays an important role in selecting a variable: the F-statistic for a variable to be added must be significant at the SLENTRY level, the F-statistic for a variable to be removed must be significant at the SLSTAY level. While the MaxR method selects variables depending on which variable or variable combination can produce the largest R square. MaxR makes the switch that produces the largest increase in R square.
Appendix |
Code:
data job;
infile "C:\Users\sandra\Desktop\CH09PR10.txt";
input y x1 x2 x3 x4;
run;
proc reg data=job;
model y=x1 x2 x3 x4/selection=stepwise slstay=.10 slentry=.05;
title "Stepwise Selection";
run;
proc reg data=job;
model y=x1 x2 x3 x4/selection=adjrsq;
run;
proc reg data=job;
model y=x1 x2 x3...

...INTRODUCTION
A. Importance of Statistics
Statistical methods have been applied to problems ranging from business to medicine to agriculture. A review of the professional literature in almost any field will substantiate the extent of statistical analysis.
Accounting: Public accounting firms use statistical sampling procedures when conducting audits for their clients.
Economics: Economists use statistical information in making forecasts about the future of the economy or some aspect of it.
Marketing: Electronic point-of-sale scanners at retail checkout counters are used to collect data for a variety of marketing research applications.
Finance: Financial managers have routine contact with information in numerical form. Financial forecasts, break-even analyses, and investment decisions under uncertainty are but part of their activities.
Production: A variety of statistical quality control charts are used to monitor the output of a production process.
Statistics
the collection, organization, presentation, analysis, or interpretation of numerical data, especially as a branch of mathematics in which deductions are made on the assumption that the relationship between a sufficient sample of numerical data are characteristic of those between all such data.
it is a science which deals with the collection, organization, presentation, analysis, and interpretation of data.
B. Fields of Statistics
Descriptive...

...Lecture Notes on Introductory Statistics, I
(P.P. Leung)
Lecture notes are based on the following textbook:
N.A. Weiss (2012), Introductory Statistics, 9th edition, Pearson.
Chapter 1 The Nature of Statistics 統計本質
§1.1 Two kinds of Statistics
§1.4 Other Sampling Designs (其他抽樣方法)
Chapter 1 The Nature of Statistics 統計本質
What is Statistics? 何謂統計?
From Wikipedia, the free encyclopaedia:
Statistics is a mathematical science pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It is applicable to a wide variety of academic disciplines, from the natural and social sciences to the humanities. Statistics is also used for making informed decisions in government and business.
Statistical methods can be used to summarize or describe a collection of data; this is called descriptive statistics. In addition, patterns in the data may be modeled in a way that accounts for randomness and uncertainty in the observations, and then used to draw inferences about the process or population being studied; this is called inferential statistics. Both descriptive and inferential statistics comprise applied statistics. There is also a discipline called mathematical statistics, which is concerned with the theoretical basis of the subject....

...Descriptive Statistics and Probability Distribution Problem Sets
Emily Noah
QNT561
Anthony Matias
December 24, 2012
Descriptive Statistics and Probability Distribution Problems Sets
Descriptive statistics and probability distribution is two ways to find information with certain data giving. In Descriptive statistics the data can give a mode, mean, median, and range by the numerical information, which is giving to find the information. In probability distribution the data is collected and this is the way to determine the outcome of the information.
Descriptive Statistics
In descriptive statistics this is where the mean, mode, median, and range can be found of different number to find the center of the information and to find the information, which is not the center. The mean stands for the total of the number information added together and divided by the number of the numbers. For example, a student has a 86%, 96%, 85%, and 90%, so the first thing is to add the percentages together which will give the student 357%, so the third step is to divide 357% by four because there were four different percentages. So the total of the student’s grade will be 89.25%, but the teacher would round the grade to an 89%. Median is the middle numbers added together after the numbers are arrange in order after you divide it by the number of numbers that are in the middle. So using the same numbers...

...
Statistical Analysis
BU 510 601
2 Credit Hours
Fall 2013
Instructor: Shrikant Panwalkar Office phone: (410) 234 9456
Office Hours: By appointment panwalkar@jhu.edu
Required Text and Learning Materials
Business Statistics in Practice; 6th Edition, McGraw-Hill Higher Education,
ISBN-13 978-0-07-340183-6 (There are other ISBN numbers)
Authors: Bowerman, Bruce; O'Connell, Richard. (the cover shows a third author – Murphree)
Please note: 7th edition is available, however, we will NOT be using the 7th edition – please purchase the 6th edition
Additional learning material may be posted from time to time
Blackboard Site
A Blackboard course site is set up for this course. Each student is expected to check the site throughout the semester as Blackboard will be the primary venue for outside classroom communications between the instructors and the students. Students can access the course site at https://blackboard.jhu.edu. Support for Blackboard is available at 1-866-669-6138.
Course Evaluation
As a research and learning community, the Carey Business School is committed to continuous improvement. The faculty strongly encourages students to provide complete and honest feedback for this course. Please take this activity seriously because we depend on your feedback to help us improve so you and your colleagues will benefit. Information on how to complete the evaluation will be provided towards the end of the course....

...typically have? You take a random sample of 51 reduced-fat cookies and test them in a lab, finding a mean fat content of 4.2 grams. You calculate a 95% confidence interval and find that the margin of error is ±0.8 grams. A) You are 95% confident that the mean fat in reduced fat cookies is between 3.4 and 5 grams of fat. B) We are 95% confident that the mean fat in all cookies is between 3.4 and 5 grams. C) We are 95% sure that the average amount of fat in the cookies in this study was between 3.4 and 5 grams. D) 95% of reduced fat cookies have between 3.4 and 5 grams of fat. E) 95% of the cookies in the sample had between 3.4 and 5 grams of fat. Determine the margin of error in estimating the population parameter. 12) How tall is your average statistics classmate? To determine this, you measure the height of a random sample of 15 of your 100 fellow students, finding a 95% confidence interval for the mean height of 67.25 to 69.75 inches. A) 1.5 inches B) 0.25 inches C) 1.06 inches D) 1.25 inches E) Not enough information is given. 12) 11) 10)
3
Construct the indicated confidence interval for the difference between the two population means. Assume that the assumptions and conditions for inference have been met. 13) The table below gives information concerning the gasoline mileage for random samples of trucks of two different types. Find a 95% confidence interval for the difference in the means m X - m Y. Brand X Brand Y 50 50 20.1 24.3 2.3 1.8 13)...