Introduction to Statistics

Slides:

Advertisements

Similar presentations

Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.

Advertisements

Statistics: Purpose, Approach, Method. The Basic Approach The basic principle behind the use of statistical tests of significance can be stated as: Compare.

Sampling Distributions (§ )

Descriptive statistics Experiment  Data  Sample Statistics Sample mean Sample variance Normalize sample variance by N-1 Standard deviation goes as square-root.

Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.

Sampling Distributions

Inferences About Process Quality

Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.

One Sample  M ean μ, Variance σ 2, Proportion π Two Samples  M eans, Variances, Proportions μ1 vs. μ2 σ12 vs. σ22 π1 vs. π Multiple.

STA291 Statistical Methods Lecture 16. Lecture 15 Review Assume that a school district has 10,000 6th graders. In this district, the average weight of.

Random Sampling, Point Estimation and Maximum Likelihood.

Chapter 6 Lecture 3 Sections: 6.4 – 6.5.

Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.

Estimating a Population Proportion

Chapter 10 – Sampling Distributions Math 22 Introductory Statistics.

Bernoulli Trials Two Possible Outcomes –Success, with probability p –Failure, with probability q = 1  p Trials are independent.

Revisiting Sampling Concepts. Population A population is all the possible members of a category Examples: the heights of every male or every female the.

Math b (Discrete) Random Variables, Binomial Distribution.

Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.

Chapter 10: Introduction to Statistical Inference.

Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.

Chapter 7: The Distribution of Sample Means. Frequency of Scores Scores Frequency.

Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.

Chapter 5 Joint Probability Distributions and Random Samples  Jointly Distributed Random Variables.2 - Expected Values, Covariance, and Correlation.3.

Section 10.4: Hypothesis Tests for a Population Mean.

MATH Section 4.4.

Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.

Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.

Probability & Statistics Review I 1. Normal Distribution 2. Sampling Distribution 3. Inference - Confidence Interval.

Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate accuracy.

Yandell - Econ 216 Chap 1-1 Chapter 1 Introduction and Data Collection.

Christopher, Anna, and Casey

Topic 8: Sampling Distributions

Experiments vs. Observational Studies vs. Surveys and Simulations

Supplemental Lecture Notes

STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample

Chapter 5 Joint Probability Distributions and Random Samples

Chapter 17 Sampling Distribution Models and the Central Limit Theorem

Sampling Distributions

Elementary Statistics

Chapter 5 Sampling Distributions

Elementary Statistics

Elementary Statistics

Chapter 5 Sampling Distributions

Chapter 5 Sampling Distributions

Probability and the Sampling Distribution

What does it mean to say that the results of an experiment are (or are not) statistically significant? The significance level,  (conventionally set to.

From Simulations to the Central Limit Theorem

Estimation Point Estimates Industrial Engineering

Sampling Distribution Models

Estimating a Population Proportion

ASV Chapters 1 - Sample Spaces and Probabilities

MATH 2311 Section 4.4.

Chapter 5 Sampling Distributions

From Simulations to the Central Limit Theorem

WARM – UP 1. Phrase a survey or experimental question in such a way that you would obtain a Proportional Response. 2. Phrase a survey or experimental.

Chapter 5 Sampling Distributions

CHAPTER 6 Statistical Inference & Hypothesis Testing

Sampling Distributions

Chapter 7: The Distribution of Sample Means

Continuous Probability Distributions

Sampling Distributions (§ )

Statistical Inference

Discrete Random Variables: Joint PMFs, Conditioning and Independence

Interval Estimation Download this presentation.

1/2555 สมศักดิ์ ศิวดำรงพงศ์

Daniela Stan Raicu School of CTI, DePaul University

Introductory Statistics

How Confident Are You?.

MATH 2311 Section 4.4.

Presentation transcript:

Introduction to Statistics Elan Ding Clemson University

What is statistics? Statistics answers questions about our world Data collection Data summarization (descriptive statistics) Data analysis (inferential statistics) Interpretation of results

Example Your friend claimed that among 1000 tosses of a fair coin, she obtained 535 heads. Is her claim plausible?

List of Topics 1. Descriptive Statistics 2. Probability Overview 3. Sampling Distributions 4. Hypothesis Testing 5. Common Statistical Tests

1. Descriptive Statistics Graphical Methods

Perceived Risk in Smoking

Comparative Bar Graph

Comparative Bar Graph

Life Insurance for Cartoon Character

Should doctors get auto insurance discount?

Stem-leaf plot

Math SAT score in 2005

Frequency Histogram

GPA report errors

GPA report errors

1. Descriptive Statistics Numerical Methods

Centrality

Variability

Quartiles

Boxplot

2003-2004 NBA Salaries

The Central Limit Theorem 2. Probability The Central Limit Theorem

Example Two players play a game until one player wins two games in a row. Identify the following: Experiment Outcome Event Sample Space

Probability Distribution

Probability Density

Probability Density

Mean of discrete random variable

Mean of continuous random variable

Variance of discrete random variable

Variance of continuous random variable

Binomial Distribution The probability of k success in n Bernoulli trials is: What does it look like? Web app

Normal Distribution

Standard Normal Distribution

Z-Table

Standardization

The Empirical Rule

Example Suppose 𝑋= the height of a randomly selected 5-year old child follows a normal distribution with 𝜇=100 cm and 𝜎=6 cm. What proportion of height is between 94 cm and 112 cm?

The Central Limit Theorem Web app

3. Sampling Distribution The Central Limit Theorem

Population vs sample

What is a statistic?

What is a sampling distribution? Suppose a random variable 𝑋 has a Bernoulli distribution. Such that 𝑋=1 with probability 0.4 and 𝑋=0 with probability 0.6.

What is a sampling distribution? Suppose we take a sample of 2 from the population, and call them 𝑋 1 and 𝑋 2 . Define the statistic 𝑇= 𝑋 1 + sin 𝑋 2 . What is the distribution of 𝑇? We draw 1000 random samples of size 2 and obtain an approximation:

Why sampling distribution? Populations parameters such as 𝜇 and 𝜎 2 are often unknown. Sample mean 𝑋 and sample variance 𝑆 2 are called the unbiased estimator for 𝜇 and 𝜎 2 : 𝑋 = 1 𝑛 𝑖=1 𝑛 𝑋 𝑖 𝑆 2 = 1 𝑛−1 𝑖=1 𝑛 ( 𝑋 𝑖 − 𝑋 )

Why is sampling distribution useful? The Central Limit Theorem! The most important statistic is 𝑋 , which is approximately normal when the sample size is large! The CLT can be safely applied when 𝑛 exceeds 30. Web app

Example Revisited Your friend claimed that among 1000 tosses of a fair coin, she obtained 535 heads. Is her claim plausible?

Solution

Solution

Solution

Application of sampling distribution 4. Hypothesis Testing Application of sampling distribution

Forming Hypothesis Let’s look at the previous problem in a different light. Suppose now your friend INDEED got 535 heads in 1000 tosses. Can you say something about whether the coin is fair or not? To do that we set up the following hypothesis:

Forming Hypothesis Let’s look at the previous problem in a different light. Suppose now your friend INDEED got 535 heads in 1000 tosses. Can you say something about whether the coin is fair or not? To do that we set up the following hypothesis:

SUPPOSE 𝐻 0 is true

5. Common Statistical Tests Application of hypothesis testing

1. Z-test for sample proportion Researchers at the University of Luton conducted a survey of 321 faculty members at a variety of academic institutions. It was reported that 36% of those surveyed said they occasionally used online searches with key words from student work to check for plagiarism. Assuming it is reasonable to regard this sample as representative of university faculty members, does the sample provide convincing evidence that more than one-third of faculty members occasionally use key word searches to check student work?

1. Z-test for sample proportion

1. Z-test for sample proportion

1. Z-test for sample proportion

1. Z-test for sample proportion

1. Z-test for sample proportion

𝐻 1 :𝑝≠ 𝑝 0 (two-tailed)

𝐻 1 :𝑝> 𝑝 0 (one-tailed)

2. Z-test for sample mean A study investigated whether time perception is impaired during nicotine withdrawal. After a 24-hr smoking abstinence, 20 smokers were asked to estimate how much time had passed during a 45-sec period. Suppose the resulting data on perceived elapsed time (in seconds) were as shown: Researchers want to know whether smoking abstinence can cause overestimation of elapsed time.

2. Z-test for sample mean

2. Z-test for sample mean

The t-distribution

t-table

Statistical Significance does not imply practical importance

3. t-test for two sample means (paired)

3. t-test for two sample means (paired)

4. t-test for two sample means (unpaired)

4. t-test for two sample means (unpaired)

Thank you! yirending@gmail.com