An Introduction to Statistics

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Chapter 2: Frequency Distributions
Copyright © 2014 Pearson Education, Inc. All rights reserved Chapter 2 Picturing Variation with Graphs.
Introduction to Summary Statistics
Chapter 1 Displaying the Order in a Group of Numbers
Statistics.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Descriptive Statistics
Frequency Distribution Ibrahim Altubasi, PT, PhD The University of Jordan.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
2 Textbook Shavelson, R.J. (1996). Statistical reasoning for the behavioral sciences (3 rd Ed.). Boston: Allyn & Bacon. Supplemental Material Ruiz-Primo,
Chapter 1: Introduction to Statistics
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Summarizing Scores With Measures of Central Tendency
CHAPTER 2 Percentages, Graphs & Central Tendency.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Smith/Davis (c) 2005 Prentice Hall Chapter Four Basic Statistical Concepts, Frequency Tables, Graphs, Frequency Distributions, and Measures of Central.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
An Introduction to Statistics
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Chapter 2 Describing Data.
Describing Data Lesson 3. Psychology & Statistics n Goals of Psychology l Describe, predict, influence behavior & cognitive processes n Role of statistics.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
1 Review Sections 2.1, 2.2, 1.3, 1.4, 1.5, 1.6 in text.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
LIS 570 Summarising and presenting data - Univariate analysis.
Introduction to statistics I Sophia King Rm. P24 HWB
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
AP PSYCHOLOGY: UNIT I Introductory Psychology: Statistical Analysis The use of mathematics to organize, summarize and interpret numerical data.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Prof. Eric A. Suess Chapter 3
Graphing options for Quantitative Data
Analysis and Empirical Results
ISE 261 PROBABILISTIC SYSTEMS
Descriptive Statistics
Chapter 2: Methods for Describing Data Sets
Frequency Distributions
4. Interpreting sets of data
Chapter 6 ENGR 201: Statistics for Engineers
Summarizing Scores With Measures of Central Tendency
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Statistical Reasoning
IB Psychology Today’s Agenda: Turn in:
IB Psychology Today’s Agenda: Turn in:
Description of Data (Summary and Variability measures)
Descriptive Statistics
An Introduction to Statistics
Chapter 1 Displaying the Order in a Group of Numbers.
Topic 5: Exploring Quantitative data
Descriptive and inferential statistics. Confidence interval
DS4 Interpreting Sets of Data
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
Chapter 3: Central Tendency
Honors Statistics Review Chapters 4 - 5
CHAPTER 1 Exploring Data
Experimental Design Experiments Observational Studies
Advanced Algebra Unit 1 Vocabulary
Types of variables. Types of variables Categorical variables or qualitative identifies basic differentiating characteristics of the population.
Displaying the Order in a Group of Numbers Using Tables and Graphs
Presentation transcript:

An Introduction to Statistics

Introduction to Statistics I. What are Statistics? Procedures for organizing, summarizing, and interpreting information Standardized techniques used by scientists Vocabulary & symbols for communicating about data A tool box How do you know which tool to use? (1) What do you want to know? (2) What type of data do you have? Two main branches: Descriptive statistics Inferential statistics

Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion Inferential statistics Drawing inferences based on data. Using statistics to draw conclusions about the population from which the sample was taken.

Descriptive vs Inferential A. Descriptive Statistics: Tools for summarizing, organizing, simplifying data Tables & Graphs Measures of Central Tendency Measures of Variability Examples: Average rainfall in SB last year Number of car thefts in IV last quarter Your college G.P.A. Percentage of seniors in our class B.Inferential Statistics: Data from sample used to draw inferences about population Generalizing beyond actual observations Generalize from a sample to a population

Populations and Samples A parameter is a characteristic of a population e.g., the average height of all Americans. A statistics is a characteristic of a sample e.g., the average height of a sample of Americans. Inferential statistics infer population parameters from sample statistics e.g., we use the average height of the sample to estimate the average height of the population

Symbols and Terminology: Parameters = Describe POPULATIONS Greek letters   2   Statistics = Describe SAMPLES English letters  s2 s r Sample will not be identical to the population So, generalizations will have some error Sampling Error = discrepancy between sample statistic and corresponding popl’n parameter

Statistics are Greek to me! Statistical notation: X = “score” or “raw score” N = number of scores in population n = number of scores in sample Quiz scores for 5 Students: X = Quiz score for each student X 4 10 6 2 8

Statistics are Greek to me! X = Quiz score for each student Y = Number of hours studying Summation notation: “Sigma” = ∑ “The Sum of” ∑X = add up all the X scores ∑XY = multiply X×Y then add X Y 4 2 10 5 6 1 8 3

Descriptive Statistics Numerical Data Properties Central Variation Shape Tendency Skewness Mean Range Modes Median Interquartile Range Mode Standard Deviation Variance

Ordering the Data: Frequency Tables Three types of frequency distributions (FDs): (A) Simple FDs (B) Relative FDs (C) Cumulative FDs Why Frequency Tables? Gives some order to a set of data Can examine data for outliers Is an introduction to distributions

Making a Frequency Table List each possible value, from highest to lowest Go one by one through the scores, making a mark for each score next to its value on the list Make a table showing how many times each value on your list was used Calculate the percentage of scores for each value

A. Simple Frequency Distributions QUIZ SCORES (N = 30) 10 7 6 5 3 9 7 6 5 3 9 7 6 4 3 8 7 5 4 2 8 6 5 4 2 8 6 5 4 1 Simple Frequency Distribution of Quiz Scores (X) X f 10 9 8 7 4 6 5 3 2 1 f = N = 30

B. Relative Frequency Distributions Each score as a proportion or % of the total N Create new column, p (proportion) p = f / N Create a new column, % (percent) % = p  100 Column of p sums to 1.0 Column of % sums to 100% Note: A proportion (p) is like a probability!

Relative Frequency Distribution Quiz Scores X f p % 10 1 9 2 8 3 7 4 .13 13% 6 5 .17 17% .10 10% .07 7% .03 3% f=N=30 = =

C. Cumulative Frequency Distribution Shows total f (or % or p) at each value and ALL LOWER ranked values Begin at bottom and add upwards Useful when relative standing is important Cumulative percents (c%) have special name percentile ranks = % of observations with the same or smaller value

Cumulative Frequency Distribution __________________________________________________ Quiz Score f p % cf c% 10 1 .03 3% 30 100% 9 2 .07 7% 29 97 8 3 .10 10% 27 90 7 4 .13 13% 24 80 6 5 .17 17% 20 67 15 50 33 __________________________________________________  = 30 =1.0 = 100%

Grouped Frequency Tables Assign fs to intervals Example: Weight for 194 people Smallest = 93 lbs Largest = 265 lbs X (Weight) f 255 - 269 1 240 - 254 4 225 - 239 2 210 - 224 6 195 - 209 3 180 - 194 10 165 - 179 24 150 - 164 31 135 - 149 27 120 - 134 55 105 - 119 90 - 104 7 f = N = 194

Graphs of Frequency Distributions A picture is worth a thousand words! Graphs for numerical data: Stem & leaf displays Histograms Frequency polygons Graphs for categorical data Bar graphs

Making a Stem-and-Leaf Plot Cross between a table and a graph Like a grouped frequency distribution on its side Easy to construct Identifies each individual score Each data point is broken down into a “stem” and a “leaf.” Select one or more leading digits for the stem values. The trailing digit(s) becomes the leaves First, “stems” are aligned in a column. Record the leaf for every observation beside the corresponding stem value

Stem and Leaf Display

Stem and Leaf / Histogram Stem Leaf 2 1 3 4 3 2 2 3 6 4 3 8 8 5 2 5 By rotating the stem-leaf, we can see the shape of the distribution of scores. 6 Leaf 4 3 8 3 2 8 5 1 2 3 2 Stem 2 3 4 5

Histograms Histograms

Histograms f on y axis (could also plot p or % ) X values (or midpoints of class intervals) on x axis Plot each f with a bar, equal size, touching No gaps between bars

Frequency Polygons Frequency Polygons Depicts information from a frequency table or a grouped frequency table as a line graph

Frequency Polygon A smoothed out histogram Make a point representing f of each value Connect dots Anchor line on x axis Useful for comparing distributions in two samples (in this case, plot p rather than f )

Shapes of Frequency Distributions Frequency tables, histograms & polygons describe how the frequencies are distributed Distributions are a fundamental concept in statistics Unimodal Bimodal One peak Two peaks

Typical Shapes of Frequency Distributions

Normal and Bimodal Distributions (1) Normal Shaped Distribution Bell-shaped One peak in the middle (unimodal) Symmetrical on each side Reflect many naturally occurring variables (2) Bimodal Distribution Two clear peaks Often indicates two distinct subgroups in sample

Symmetrical vs. Skewed Frequency Distributions Symmetrical distribution Approximately equal numbers of observations above and below the middle Skewed distribution One side is more spread out that the other, like a tail Direction of the skew Positive or negative (right or left) Side with the fewer scores Side that looks like a tail

Symmetrical vs. Skewed Symmetric Skewed Right Skewed Left

Positively Skewed Positively skewed distribution Cluster towards the low end of the variable

Skewed Frequency Distributions Positively skewed AKA Skewed right Tail trails to the right *** The skew describes the skinny end ***

Negatively Skewed Negatively skewed distribution Cluster towards the high end of the variable

Skewed Frequency Distributions Negatively skewed Skewed left Tail trails to the left

Bar Graphs For categorical data Like a histogram, but with gaps between bars Useful for showing two samples side-by-side

Homework Chapter 1 Chapter 2 1,2, 3, 5, 15,17,22 1, 6, 7, 9, 14, 17, 21, 22