Spreadsheet simulation and trial and error methods in statistics Michael Wood University of Portsmouth, UK

Slides:



Advertisements
Similar presentations
Chapter 12: Inference for Proportions BY: Lindsey Van Cleave.
Advertisements

Econ 140 Lecture 81 Classical Regression II Lecture 8.
Simple Linear Regression and Correlation
T-Tests.
t-Tests Overview of t-Tests How a t-Test Works How a t-Test Works Single-Sample t Single-Sample t Independent Samples t Independent Samples t Paired.
T-Tests.
Statistics for Business and Economics
Mean for sample of n=10 n = 10: t = 1.361df = 9Critical value = Conclusion: accept the null hypothesis; no difference between this sample.
Chapter 9 Hypothesis Testing 9.4 Testing a Hypothesis about a Population Proportion.
Final Review Session.
Correlation. Two variables: Which test? X Y Contingency analysis t-test Logistic regression Correlation Regression.
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Simple Linear Regression Analysis
Scot Exec Course Nov/Dec 04 Ambitious title? Confidence intervals, design effects and significance tests for surveys. How to calculate sample numbers when.
Correlation & Regression
4.1 Introducing Hypothesis Tests 4.2 Measuring significance with P-values Visit the Maths Study Centre 11am-5pm This presentation.
Chapter 13: Inference in Regression
Chapter 11 Simple Regression
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Inference for 2 Proportions Mean and Standard Deviation.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Academic Research Academic Research Dr Kishor Bhanushali M
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
1 CHAPTER 4 CHAPTER 4 WHAT IS A CONFIDENCE INTERVAL? WHAT IS A CONFIDENCE INTERVAL? confidence interval A confidence interval estimates a population parameter.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Inference for Proportions Section Starter Do dogs who are house pets have higher cholesterol than dogs who live in a research clinic? A.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
1 Guess the Covered Word Goal 1 EOC Review 2 Scientific Method A process that guides the search for answers to a question.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Problems with statistical methods in management research … and a few solutions Presentation for the INFORMS Annual Meeting, Austin, USA, November 2010.
Review Statistical inference and test of significance.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
MATH Section 7.2.
Samples, Surveys, and Results UNIT SELF-TEST QUESTIONS
Inference for Two-Samples
CHAPTER 10 Comparing Two Populations or Groups
One-Sample Inference for Proportions
MATH 2311 Section 8.2.
Chapter 9 Hypothesis Testing
BA 275 Quantitative Business Methods
Simulation: Sensitivity, Bootstrap, and Power
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
Inferences and Conclusions from Data
CHAPTER 10 Comparing Two Populations or Groups
SIMPLE LINEAR REGRESSION
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
AP Statistics Chapter 12 Notes.
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Presentation transcript:

Spreadsheet simulation and trial and error methods in statistics Michael Wood University of Portsmouth, UK

Computer-intensive, “crunchy” methods Crunch out answers without dependence on sophisticated maths Rationale transparent Sometimes more robust in the sense of no dependence on unrealistic assumptions Often more general—can tackle problems with no convenient formula-based method Detailed step through for beginners Very brief overview of three possibilities

An aside … I am fairly critical of many applications of statistics: not the subject of this talk except that transparency makes problems more obvious

Regression and least squares models Spreadsheet to calculate MSE (mean square error), then use Solver to find parameters for least squares model Identical answers to standard formulae Obvious what’s going on and can be modified if required Single variable: pred1var.xls Multiple regression: predmvar.xls Can easily adjust method—e.g. ExerciseCurve.xls

Test of null hypothesis that two variables are unrelated Randomization test: Spreadsheet simulates no relationship hypothesis Obvious what’s going on with no technical statistical concepts Assumptions less restrictive and more obvious than t-test, etc Flexible – test difference between two means or two proportions, or correlation, etc Difference of two means: diffofmeanstest.xls General spreadsheet: resamplenrh.xls

Bootstrap confidence intervals One method for many different statistics –Use sample to set up a “guessed” population –Experiment drawing samples from guessed population to assess sampling error (resampling with replacement) Obvious what’s going on and when it’s not sensible! Confidence intervals are a subtle concept: simple bootstrapping avoids the mathematical problems but not the conceptual ones Many more complex methods - not simple!

Conclusion Active learning in that learners don’t have to take formulae on trust, but can act out methods and see how they work. And can often adapt to new problems.

References and website All spreadsheets files mentioned at (These all have a Read this sheet for a brief explanation) Approach explained in more detail in Wood, M (2003), Making sense of statistics: a non- mathemical approach, Basingstoke, UK: Palgrave.