Math Stat Course: Making Incremental Changes Mary Parker University of Texas at Austin.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
October 1999 Statistical Methods for Computer Science Marie desJardins CMSC 601 April 9, 2012 Material adapted.
Bayesian inference “Very much lies in the posterior distribution” Bayesian definition of sufficiency: A statistic T (x 1, …, x n ) is sufficient for 
Statistical Estimation and Sampling Distributions
Central Limit Theorem.
Chapter 7: Statistical Applications in Traffic Engineering
M ATH S TAT T RIVIAL P URSUIT (S ORT OF )F OR R EVIEW ( MATH 30)
Chapter Seventeen HYPOTHESIS TESTING
Point and Confidence Interval Estimation of a Population Proportion, p
Intro stat should not be like drinking water through a fire hose Kirk Steinhorst Professor of Statistics University of Idaho.
Programme in Statistics (Courses and Contents). Elementary Probability and Statistics (I) 3(2+1)Stat. 101 College of Science, Computer Science, Education.
Sampling Distributions
Statistical inference
Evaluating Hypotheses
Final Review Session.
The Basics of Regression continued
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 7 Sampling.
Part III: Inference Topic 6 Sampling and Sampling Distributions
Chapter 2 Simple Comparative Experiments
July 3, A36 Theory of Statistics Course within the Master’s program in Statistics and Data mining Fall semester 2011.
Inferences About Process Quality
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Bootstrapping applied to t-tests
Standard Error of the Mean
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Lecture 0 Introduction. Course Information Your instructor: – Hyunseung (pronounced Hun-Sung) – Or HK (not Hong Kong ) –
AM Recitation 2/10/11.
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
ISE 352: Design of Experiments
STAT 5372: Experimental Statistics Wayne Woodward Office: Office: 143 Heroy Phone: Phone: (214) URL: URL: faculty.smu.edu/waynew.
NONPARAMETRIC STATISTICS
Introduction to Statistical Inference Probability & Statistics April 2014.
Topic 5 Statistical inference: point and interval estimate
Introduction to Statistical Inference Chapter 11 Announcement: Read chapter 12 to page 299.
Random Sampling, Point Estimation and Maximum Likelihood.
Introduction: Why statistics? Petter Mostad
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
Ch5. Probability Densities II Dr. Deshi Ye
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Chapter 7: Sample Variability Empirical Distribution of Sample Means.
CHAPTER 11 SECTION 2 Inference for Relationships.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
- 1 - Bayesian inference of binomial problem Estimating a probability from binomial data –Objective is to estimate unknown proportion (or probability of.
+ DO NOW. + Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population.
Sampling and estimation Petter Mostad
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
Course Outline Presentation Reference Course Outline for MTS-202 (Statistical Inference) Fall-2009 Dated: 27 th August 2009 Course Supervisor(s): Mr. Ahmed.
Review of Statistics.  Estimation of the Population Mean  Hypothesis Testing  Confidence Intervals  Comparing Means from Different Populations  Scatterplots.
Introduction to Inference Sampling Distributions.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
Confidence Intervals INTRO. Confidence Intervals Brief review of sampling. Brief review of the Central Limit Theorem. How do CIs work? Why do we use CIs?
Chapter 1 Introduction to Statistics. Section 1.1 Fundamental Statistical Concepts.
Green Belt – SIX SIGMA OPERATIONAL Central Limit Theorem.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.2 Estimating a Population Proportion.
Course Review. Distributions What are the important aspects needed to describe a distribution of one variable? List three types of graphs that could be.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Estimating standard error using bootstrap
Introduction For inference on the difference between the means of two populations, we need samples from both populations. The basic assumptions.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Sampling Distributions and Estimation
Chapter 8: Inference for Proportions
Introductory Statistics
How Confident Are You?.
Presentation transcript:

Math Stat Course: Making Incremental Changes Mary Parker University of Texas at Austin

Intro to Mathematical Statistics (M378K at University of Texas) Prerequisite: Probability course (which is required for all math majors) Students: Math majors, actuarial students, other science majors Previous statistics courses? Some took an applied stat course, either freshman course or after probability course. Some didn’t.

Math Stat Topics Sampling distributions of statistics Estimation of parameters: confidence intervals, method of moments estimation, maximum likelihood estimation, comparison of estimators using mean square error and efficiency, sufficient statistics Hypothesis tests: p-values, power, likelihood ratio tests Distributions used include normal, binomial, Poisson, uniform, gamma, beta, t, F, chi-squared, and other standard distributions. Other topics as time permits.

Some students took this course: M358K Applied Statistics description New course in the last five years Prerequisite: Probability course. Taken by math majors with concentration in secondary school teaching, statistics, and some others. If they take both M358K and M378K, they are encouraged to take M358K first.description Introduction of this course has not decreased enrollment in M378K and some students who take this new course and didn’t plan to take more statistics do go on to M378K.

Questions MAIN: How can a teacher who doesn't have the time/inclination to completely revamp her course make incremental changes that will better prepare students to understand and use contemporary statistics techniques? Preliminary: What aspects of the reform of the first course are also appropriate for the math stat course? What should we preserve in the current math stat course so that it continues to give mathematically sophisticated students a strong foundation in statistics? What additional tools and techniques of theoretical statistics should be introduced at this level? Within twenty years, when all students will be using the equivalent of a Mathematica-level program, what can/should we be teaching in theoretical statistics courses?

Incrementally changing Math Stat Focus on assumptions throughout.  Check assumptions.  Mention alternative techniques if assumptions not met.  Discuss robustness of methods.  Briefly introduce nonparametric statistics and Bayesian inference to illustrate different assumptions / framework. Have students do explorations.

What explorations? Main idea: Simulate and explore sampling distributions of various statistics. Use to illustrate theoretical ideas and to check on robustness of procedures. Preliminary idea 1: Create a complete sampling distribution themselves and check its properties to see that they agree with the theoretical results. Preliminary idea 2: Think of some interesting estimators to investigate. (See that there are more possible estimators for a parameter than the sample mean.)

Why explorations? Explorations help make the theory concrete Robustness of statistical techniques: The concept seems strange to math students and they appreciate tools to explore it on their own.

Simulate and explore a sampling distribution 1. The population is the numbers of potatoes in a 5-lb sack of potatoes from a certain company. Assume the counts are distributed as discrete uniform, from 12 potatoes to 18 potatoes. Choose a reasonable sampling method and construct the sampling distribution of the sample mean for samples of size Find the mean and variance of the population and then find the mean and variance of the sampling distribution. 3. Comment on the results, based on your theoretical understanding from the formulas we proved about the mean and variance of a sample mean. 4. Discuss what would be different for samples of size Investigate the sampling distribution of the sample range.

Strategy Given very early in the semester. Student groups of 2-3. Grading and instructions encourage students to think about it over a couple of weeks without spending much time on it at first, BECAUSE This assignment is not as well-defined as it looks for many students.

Difficulties often encountered Should (13,14) be a different element of the sample space from (14,13)? Should I sample with replacement or without replacement? Why? When computing the standard deviations here, is the denominator n or n-1?

Extensions Sampling without replacement: what changes? What does that tell us about the language/formulas of our text? (independence of samples) Where could we find the equivalent formulas to those in our text for sampling without replacement? What’s different?

Constructing various estimators “German Tank Problem” Assume German tanks had consecutive ID numbers from 001 to ???. Need to estimate the number of the population of German tanks (max ID in the population,) based on the IDs from the sample of tanks we have captured. In groups, think of at least three different reasonable estimators. Then draw a sample of size 5 from my “population of German tank IDs” in the envelope. Give your three estimates. Use a computer to simulate the three sampling distributions

Strategy Done in class before beginning to talk about estimation. Usually students will use (1) two times the mean,(2) the maximum, and then, after a bit of time, will come up with something else. Students will need help simulating the sampling distributions. Again, arrange the timing/grading to encourage them to think about it and discuss it before spending a lot of time doing it.

Difficulties in simulating sampling distributions How do you describe the original population to the computer? (Discrete uniform on 1 to 600, maybe) Is it fairly easy to obtain a random sample from that distribution in your software? (If not, find other software!) Distinguish between the sample size and the number of points from the sampling distribution. What should you do with the sampling distribution?

Looking at sampling distributions What should you look at to summarize a sampling dist’n? (histogram, summary statistics) Is it close to normally distributed? (Discuss normal scores plots.) (More advanced) Is it close to a __ dist’n? (Make available information about probability plots in more generality.) If the statistic is unbiased, what characteristic will the sampling dist’n have? (If yours doesn’t have the mean exactly what it’s supposed to, is that because you made an error? Why or why not?)

Focus on Assumptions Checking assumptions for typical normal-theory techniques  Already discussed normal probability plots  Discuss what types of deviations from assumptions cause problems for a particular technique and why  In two-sample t procedures, help them see exactly why equal variance assumption is more popular among theorists than those working in applications. Robustness  Central Limit Theorem. Explorations of various types of distributions – how large must n be?

Focus on Assumptions II Nonparametric techniques  Sign test, signed rank test, and rank-sum test  Compare results with those from t-test for some examples to further illustrate conditions for robustness of t-tests Bayesian statistics  Very brief introduction, contrasting assumptions of frequentist and Bayesian approaches  Do examples from binomial or normal with conjugate priors and indicate that choosing the prior mean and variance gives quite a lot of flexibility  Mention that using more general, non-conjugate priors leads to the need for more computationally-intensive methods

Actual assignments Construct a sampling distribution German tank problem Simulating sampling distributions in MINITAB Find the actual assignments and supporting material at the website listed on the handout for this session Right now, click hereclick here