Download presentation
Presentation is loading. Please wait.
Published byRafe Lyons Modified over 9 years ago
1
1 - 1 Statistics An Introduction
2
1 - 2 Learning Objectives 1.Define Statistics 2.Describe the Uses of Statistics 3.Distinguish Descriptive & Inferential Statistics 4. Define Population, Sample, Parameter, & Statistic 5. Identify data types
3
1 - 3 What is Statistics? The practice (science?) of data analysis Summarizing data and drawing inferences about the larger population from which it was drawn
4
1 - 4 Statistical Methods Statistical Methods Descriptive Statistics Inferential Statistics
5
1 - 5 Descriptive Statistics 1.Involves Collecting Data Collecting Data Presenting Data Presenting Data Characterizing Data Characterizing Data 2.Purpose Describe Data Describe Data X = 30.5 S 2 = 113 0 25 50 Q1Q2Q3Q4 $
6
1 - 6 Inferential Statistics 1.Involves Estimation Estimation Hypothesis Testing Hypothesis Testing 2.Purpose Make Decisions About Population Based on Sample Characteristics Make Decisions About Population Based on Sample Characteristics Population?
7
1 - 7 Key Terms 1.Population (Universe) All Items of Interest All Items of Interest 2.Sample Portion of Population Portion of Population 3.Parameter Summary Measure about Population Summary Measure about Population 4.Statistic Summary Measure about Sample Summary Measure about Sample P in Population & ParameterP in Population & Parameter S in Sample & StatisticS in Sample & Statistic
8
1 - 8 Data Types Quantitative Discrete Discrete Continuous ContinuousQualitative Nominal (categorical) Nominal (categorical) Ordinal (rank ordered categories) Ordinal (rank ordered categories)
9
1 - 9 Sampling Representative sample Same characteristics as the population Same characteristics as the population Random sample Every subset of the population has an equal chance of being selected Every subset of the population has an equal chance of being selected
10
1 - 10 Review Descriptive vs. Inferential Statistics Vocabulary Population Population (Random, representative) sample (Random, representative) sample Parameter Parameter Statistic Statistic Data types
11
1 - 11 Methods for Describing Data
12
1 - 12 Learning Objectives 1.Describe Qualitative Data Graphically 2.Describe Numerical Data Graphically 3.Create & Interpret Graphical Displays 4.Explain Numerical Data Properties 5.Describe Summary Measures 6.Analyze Numerical Data Using Summary Measures
13
1 - 13 Data Presentation
14
1 - 14 Presenting Qualitative Data
15
1 - 15 Data Presentation
16
1 - 16 Student Specializations Specialization | Freq. Percent Cum. ---------------+---------------------------------- HCI | 9 39.13 39.13 HCI | 9 39.13 39.13 IEMP | 9 39.13 78.26 IEMP | 9 39.13 78.26 LIS | 3 13.04 91.30 LIS | 3 13.04 91.30 Undecided | 2 8.70 100.00 Undecided | 2 8.70 100.00---------------+---------------------------------- Total | 23 100.00 Total | 23 100.00
17
1 - 17 Student Specializations
18
1 - 18 Undergrad Majors UG major | Freq. Percent Cum. UG major | Freq. Percent Cum.--------------------------+----------------------------------- American Studies | 1 4.76 4.76 American Studies | 1 4.76 4.76 Cog Sci | 1 4.76 9.52 Cog Sci | 1 4.76 9.52 Comp Sci | 3 14.29 23.81 Comp Sci | 3 14.29 23.81 Economics | 3 14.29 38.10 Economics | 3 14.29 38.10 English | 5 23.81 61.90 English | 5 23.81 61.90 Environmental Engineering | 1 4.76 66.67 Graphic Design | 1 4.76 71.43 Graphic Design | 1 4.76 71.43 Math | 2 9.52 80.95 Math | 2 9.52 80.95 Mechanical Engineering | 1 4.76 85.71 Mechanical Engineering | 1 4.76 85.71 Nutrition | 1 4.76 90.48 Nutrition | 1 4.76 90.48 Sci and Tech Policy | 1 4.76 95.24 Sci and Tech Policy | 1 4.76 95.24 Telecommunications | 1 4.76 100.00 Telecommunications | 1 4.76 100.00--------------------------+----------------------------------- Total | 21 100.00 Total | 21 100.00
19
1 - 19 Favorite Colors color | Freq. Percent Cum. color | Freq. Percent Cum.------------+----------------------------------- black | 2 8.70 8.70 black | 2 8.70 8.70 blue | 12 52.17 60.87 blue | 12 52.17 60.87 green | 1 4.35 65.22 green | 1 4.35 65.22 orange | 1 4.35 69.57 orange | 1 4.35 69.57 purple | 1 4.35 73.91 purple | 1 4.35 73.91 red | 5 21.74 95.65 red | 5 21.74 95.65 white | 1 4.35 100.00 white | 1 4.35 100.00------------+----------------------------------- Total | 23 100.00 Total | 23 100.00
20
1 - 20 Calculus Knowledge integrals | Freq. Percent Cum. integrals | Freq. Percent Cum.------------+----------------------------------- 1 | 3 13.04 13.04 1 | 3 13.04 13.04 2 | 1 4.35 17.39 2 | 1 4.35 17.39 3 | 11 47.83 65.22 3 | 11 47.83 65.22 4 | 6 26.09 91.30 4 | 6 26.09 91.30 5 | 2 8.70 100.00 5 | 2 8.70 100.00------------+----------------------------------- Total | 23 100.00 Total | 23 100.00
21
1 - 21 Presenting Numerical Data
22
1 - 22 Data Presentation
23
1 - 23 Student Age (Reported) Data Stem-and-leaf plot for age 2* | 22233444555777899 2* | 22233444555777899 3* | 01257 3* | 01257 4* | 4* | 5* | 5* | 6* | 6* | 7* | 6 7* | 6
24
1 - 24 Histogram
25
1 - 25 Starting Salaries (in $K) 3* | 8 3* | 8 4* | 000025 4* | 000025 5* | 0000 5* | 0000 6* | 0000005 6* | 0000005 7* | 5 7* | 5 8* | 0 8* | 0
26
1 - 26 Numerical Data Properties
27
1 - 27 Thinking Challenge... employees cite low pay -- most workers earn only $20,000.... President claims average pay is $70,000! $400,000 $70,000 $50,000 $30,000 $20,000
28
1 - 28 Standard Notation MeasureSamplePopulation Mean x Stand. Dev. s Variance s 2 2 SizenN
29
1 - 29 Numerical Data Properties Central Tendency (Location) Variation (Dispersion) Shape
30
1 - 30 Numerical Data Properties & Measures Numerical Data Properties Mean Median Mode Central Tendency Range Variance Standard Deviation Variation Skew Shape Interquartile Range
31
1 - 31 Central Tendency
32
1 - 32 Numerical Data Properties & Measures Numerical Data Properties Mean Median Mode Central Tendency Range Variance Standard Deviation Variation Skew Shape Interquartile Range
33
1 - 33 What’s wrong with this? Measurements 1 4 2 9 8 Middle measurement is 2, so that’s the median X X n XXX n i i n 1 12
34
1 - 34 Ages Mean = 29 Median = 27 2* | 22233444555777899 2* | 22233444555777899 3* | 01257 3* | 01257 4* | 4* | 5* | 5* | 6* | 6* | 7* | 6 7* | 6
35
1 - 35 Summary of Central Tendency Measures MeasureEquationDescription Mean X i /n Balance Point Median(n+1) Position Position 2 Middle Value When Ordered Modenone Most Frequent
36
1 - 36 Shape
37
1 - 37 Numerical Data Properties & Measures Numerical Data Properties Mean Median Mode Central Tendency Range Interquartile Range Variance Standard Deviation Variation Skew Shape
38
1 - 38 Shape 1.Describes How Data Are Distributed 2.Measures of Shape Skew = Symmetry Skew = Symmetry Right-SkewedLeft-SkewedSymmetric Mean =Median =Mode Mean Median Mode Mode Median Mean
39
1 - 39 Variation
40
1 - 40 Numerical Data Properties & Measures Numerical Data Properties Mean Median Mode Central Tendency Range Variance Standard Deviation Variation Skew Shape Interquartile Range
41
1 - 41 Quartiles 1.Measure of Noncentral Tendency 2.Split Ordered Data into 4 Quarters 3.Position of i-th Quartile 25%25%25%25% Q1Q1Q1Q1 Q2Q2Q2Q2 Q3Q3Q3Q3 Positionin g Point of Q i(n i 1) 4
42
1 - 42 Ages RangeQuartiles 2* | 22233444555777899 2* | 22233444555777899 3* | 01257 3* | 01257 4* | 4* | 5* | 5* | 6* | 6* | 7* | 6 7* | 6
43
1 - 43 Box Plots - Age and Salary Quartiles: 24, 27, 30 Inner fences: (15,39) Outer fences: (6, 48) Quartiles: 41K, 50K, 60K Inner fences: ?? Outer fences: ??
44
1 - 44 Variance & Standard Deviation 1.Measures of Dispersion 2.Most Common Measures 3.Consider How Data Are Distributed 4.Show Variation About Mean ( X or ) 4681012 X = 8.3 = 8.3
45
1 - 45 Sample Variance Formula n - 1 in denominator! (Use N if Population Variance) S (X X) n (XX)(XX)(XX) n i i n n 2 2 1 1 2 2 22 1 1 ...
46
1 - 46 Equivalent Formula
47
1 - 47 Another Equivalent Formula
48
1 - 48 Empirical Rule If x has a “symmetric, mound-shaped” distribution Justification: Known properties of the “normal” distribution, to be studied later in the course
49
1 - 49 Preview of Statistical Inference You observe one data point Make hypothesis about mean and standard deviation from which it was drawn Empirical Rule tells you how (un)likely the data point is If very unlikely, you are suspicious of the hypothesis about mean and standard deviation, and reject it If very unlikely, you are suspicious of the hypothesis about mean and standard deviation, and reject it
50
1 - 50 Summary of Variation Measures MeasureEquationDescription Range X largest -X smallest Total Spread Interquartile Range Q 3 -Q 1 Spread of Middle 50% Standard Deviation (Sample) XX n i 21 Dispersion about Sample Mean Standard Deviation (Population) X N iX 2 Dispersion about Population Mean Variance (Sample) (X i - X) 2 n - 1 - 1 Squared Dispersion about Sample Mean
51
1 - 51 Z-scores Number of standard deviations from the mean
52
1 - 52 Conclusion 1.Described Qualitative Data Graphically 2.Described Numerical Data Graphically 3.Created & Interpreted Graphical Displays 4.Explained Numerical Data Properties 5.Described Summary Measures 6.Analyzed Numerical Data Using Summary Measures
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.