Unit 9: Statistics By: Jamie Fu and Neha Surapaneni.

Slides:



Advertisements
Similar presentations
UNIT 8: Statistical Measures
Advertisements

Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Introduction to Summary Statistics
Frequency Distribution and Variation Prepared by E.G. Gascon.
Math 20: Foundations FM20.6 Demonstrate an understanding of normal distribution, including standard deviation and z-scores. FM20.7 Demonstrate understanding.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
2-5 : Normal Distribution
BHS Methods in Behavioral Sciences I April 18, 2003 Chapter 4 (Ray) – Descriptive Statistics.
Statistics Intro Univariate Analysis Central Tendency Dispersion.
Analysis of Research Data
Data observation and Descriptive Statistics
Statistics for CS 312. Descriptive vs. inferential statistics Descriptive – used to describe an existing population Inferential – used to draw conclusions.
12.3 – Measures of Dispersion
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Objective To understand measures of central tendency and use them to analyze data.
Descriptive Statistics
Chapter 3 – Descriptive Statistics
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Methods for Describing Sets of Data
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
7.7 Statistics & Statistical Graphs p.445. What are measures of central tendency? How do you tell measures of central tendency apart? What is standard.
Statistics 1 Measures of central tendency and measures of spread.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Statistics Chapter 9. Day 1 Unusual Episode MS133 Final Exam Scores
M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.
Descriptive Statistics
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Skewness & Kurtosis: Reference
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
UTOPPS—Fall 2004 Teaching Statistics in Psychology.
Statistical Measures. Measures of Central Tendency O Sometimes it is convenient to have one number that describes a set of data. This number is called.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
MDM4U Chapter 3 Review Normal Distribution Mr. Lieff.
Chapter 9 Statistics.
 The mean is typically what is meant by the word “average.” The mean is perhaps the most common measure of central tendency.  The sample mean is written.
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Chapter 11 Review Important Terms, Symbols, Concepts Sect Graphing Data Bar graphs, broken-line graphs,
2 Kinds of Statistics: 1.Descriptive: listing and summarizing data in a practical and efficient way 2.Inferential: methods used to determine whether data.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Descriptive Statistics Review – Chapter 14. Data  Data – collection of numerical information  Frequency distribution – set of data with frequencies.
Data Summary Using Descriptive Measures Sections 3.1 – 3.6, 3.8
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
Lesson 25 Finding measures of central tendency and dispersion.
Statistics and Data Analysis
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
AP PSYCHOLOGY: UNIT I Introductory Psychology: Statistical Analysis The use of mathematics to organize, summarize and interpret numerical data.
STATS DAY First a few review questions. Which of the following correlation coefficients would a statistician know, at first glance, is a mistake? A. 0.0.
Statistics 1: Statistical Measures
AP Biology Intro to Statistics
Statistics Principles of Engineering © 2012 Project Lead The Way, Inc.
Statistics Statistics Principles of EngineeringTM
Teaching Statistics in Psychology
Unit 4: A Brief Look at the World of Statistics
Ch. 18- Descriptive Statistics.
STATS DAY First a few review questions.
Statistics Statistics Principles of EngineeringTM
An Introduction to Statistics
Statistics Statistics- Inferential Statistics Descriptive Statistics
Statistics: The Interpretation of Data
Statistics Principles of Engineering © 2012 Project Lead The Way, Inc.
Presentation transcript:

Unit 9: Statistics By: Jamie Fu and Neha Surapaneni

Central Tendency ❖ Mean/average( ): sum of n numbers divided by n ❖ Median: The middle number of n numbers that are written in numerical order (If n is even, take the average of the middle two numbers) ❖ Mode: The number or numbers that occur most in a set of n numbers ❖ Range: The difference between the largest and smallest numbers of a set of n numbers ❖ Standard Deviation(σ): Describes the typical difference between a data value and mean N=number of values = mean X 1-n =each of the values of the set

Sample Problem You are competing in a hockey tournament. The winning scores for the first 10 games are: 11, 12, 13, 13, 14, 15, 15, 15, 15, 17 Find the mean, mode, median, range,standard deviation Mean: ( …+17)/10 = 14 Median: (14+15)/2= 14.5 Mode: 15 Range: = 6 Std. Dev.: σ= √[(11-14)^2 + (12-14)^2 …+ (17-14)^2]/10 ≈ 1.7

❖ X-axis is independent variable ❖ Y-axis is the dependent variable ❖ Correlation coefficient: How strong the relationship is between points Scatter plots

Box and Whisker plots ❖ Lower quartile=median between median and lowest value ❖ Upper quartile=median between median and highest value

Histograms ❖ The bars should be touching ❖ Bars should be the same width ❖ Width should represent a quantitative value not categorical ❖ The height indicates frequency

Sample Problem: Box & Whisker vs. Histogram The annual incomes for 6 professions are shown below: Farming: 19,630 Sales: 28,920 Architecture: 56,330 Professional Athlete: 2,476,590 Legal: 69,030 Teaching: 39, 130 Which graph will better represent the data? (histogram/box and whisker) Box and Whisker, due to the big outlier,the athlete Which graph will better represent the data when the outlier is removed? Histogram, because the data is closer together

Frequency Distribution ❖ Lower Class limit: lowest value in a class ❖ Upper Class Limit: Highest value in a class ❖ Class Mark: midpoint of a class ❖ Formula: (Lower class limit + upper class limit)/2 ❖ Class Width: the difference between the lower and upper class limits to the next highest whole number ❖ Formula: (high-low)/number of classes you want ❖ Class Boundaries: the endpoints of the bars in a histogram

Frequency tables ❖ Use frequency distribution to organize data ❖ Used in creating histograms Class Limits Class Boundaries Lower-Upper Tally Frequenc y Class Midpoint

❖ Sample mean: = [∑(x.f)]/n ❖ Sample Standard Deviation: ❖ Variance: sample std. dev. without the square root Sample Frequency Tables x= class mark n= # of entries f= frequency

∑=80 ∑=1290 ∑= Mean: 16.1 Standard Deviation:√119.9 Variance: Sample Frequency Tables Class LimitsMarks (class midpoint) Frequencyx✖fx✖f v ✖ f

Bell Curves ❖ Normal curve: A normal distribution is a frequency distribution where there are large number of values in a set of data: a symmetric, bell-shaped curve ❖ Middle value is the mean, which is 0 standard deviations away from the mean ❖ Probability: When randomly choosing a value from a set, the probability of choosing a number 1 S.D. away is 68% Mean

Choosing a Graph: Use...when... ❖ Scatter plot: Set of points to find correlation ❖ Box & Whisker: Outliers ❖ Histogram: numbers are close to each other, showing for frequency of #s in a certain range ❖ Bell Curve: Showing where majority of data is; finding probability of a certain # picked out of set

Data Classification Population: Group of people or objects you want info about Sample: Subset of population Types of Samples ❖ Self-selected sample: Members volunteer to be samples ❖ Systematic sample: A rule is used to select members ❖ Convenience sample: Easy-to-reach members ❖ Random sample: Each member has equal chance of selection Bias and Unbias ❖ Biased sample: Over or under represents parts of population ❖ Unbiased sample: Representative of population you want info on

Margin of Error ❖ Gives limit on how much the responses of your sample differ from responses from the whole population ❖ ±(1/√N) ❖ Percent of the population that responds in the same way as the sample responding a certain way(p) is likely to be between... ❖ P-(1/√N) and P+(1/√N)

Calculating Probability ❖ What is the probability that a number(x) will be ≥, ≤, +nσ (≤ and are the same) ❖ Example: P(x≥ +σ) ❖ The percent between +1 and +2 is 13.5% ❖ The percent between +2 and +3 is 2.35% ❖ After +3 is.15% ❖ Therefore, P(x≥ +σ) is =16% Mean

Calculating Probability Example: Normal distribution has a mean of 27 and a standard deviation of 5 What is P(22<x<32)? 22 is 27( )-5(σ) and 32 is 27+5 so it is one standard deviation away Therefore, the percentage should be in between -σ and +σ, which is 68% Mean

Calculating Probability using z- score ❖ The standard normal distribution is the normal distribution with mean 0 and standard deviation 1 ❖ The formula (x- )/σ can be used to transform x-values from a normal distribution with mean and standard deviation σ into z-values having a standard normal distribution with mean 0 and standard deviation 1 ❖ The z-value for a particular x-value is called the z-score for the x-value and is the number of standard deviations the x- value lies above or below the mean

Calculating Probability using z- score A normal distribution has a mean of 75 and a standard deviation of 10 Use standard normal table(find z-score first) to find P(x≤70) Answer: Use (x- )/σ to find z-score [ (70-75)/10 = -0.5 ] The z-score is -0.5; Choose row -0 and column.5 to form -0.5 The cross section shows the number.3085 which is 30.85% which is ≈31% The probability of P(x≤70) ≈31% When using z-score and it asks for P(x≥n), then take 1 MINUS the z-score

Standard Error of the Mean ❖ Standard Error( ) measures how well a sample mean estimates the true mean of a population ❖ Formula: Q: The mean weight of 36 boys on the wrestling team is 136.4lbs and the standard deviation is 4.1 what is the standard error of the mean? A: 4.1÷√36 = 4.1÷6 ≈.68 = 68%

Confidence Interval ❖ Lower Confidence Limit ❖ Upper Confidence Limit ❖ formula for confidence interval around pop, mean(μ): ❖ critical confidence value(z c )is found using this table

Confidence Interval Example Q: In a sample of 100 families a Tv reporter found that kids watch an avg. of 4.6hrs of TV a day. The σ is 1.4hrs. a) Find the ? 1.4÷√100 = 1.4÷10=.14 = 14% b) Find the 50% and 99% confidence intervals for the number of hours all kids in Texas watch T.V. 50%: 4.6±(.6745)(.14) = 4.51 ≤μ≤ %: 4.6±(.2.58)(.14) = 4.24 ≤μ≤4.96 c) Which interval has a wider range? 99%

Common Mistakes ❖ Finding the proper class limits: sometimes the formula doesn’t work ❖ Remember: when using z-score and the probability asks for ≥ a number, then you take 1 MINUS the z- score ❖ Deciding the formulas to use in real life situations ❖ Deciding on the classification of the data(which sample to use)

Terminology = mean σ = standard deviation f = frequency x = midpoint = standard error of the mean Z c = Critical confidence value N = number of participants, number of entries, etc. ∑ = summation, sum of all the numbers z = z-score, fractional part of data that lies in the interval ±tσ t = number of standard deviations away from the mean

Using a Graphing Calculator Correlation Coefficient: Stat, 1, 2nd, 0, D, diagnostics on, enter, enter. stat, calc, 4 (It is r) Central Tendency: Stat, 1, stat, calc Get numbers in order: stat, 1, 2nd, stat, ops, 1, 2nd, 1, stat, 1. Line of best fit: stat, 1, 2nd, y=, plot on, stat, calc, 4

Statistics Formulas Margin of Error: ± 1 /√N Class WIdth: (high - low) / # of classes Class Midpoint: (upper limit + lower limit) / 2 Mean of Frequency Distribution: [∑(x.f)]/n Sample standard deviation: Z-score: (x- )/σ Standard Error of the Mean: Confidence Interval: Standard Deviation mean