Lecture 3 Dustin Lueker.  Simple Random Sampling (SRS) ◦ Each possible sample has the same probability of being selected  Stratified Random Sampling.

Slides:



Advertisements
Similar presentations
Displaying Data Objectives: Students should know the typical graphical displays for the different types of variables. Students should understand how frequency.
Advertisements

Chapter 2: Frequency Distributions
So What Do We Know? Variables can be classified as qualitative/categorical or quantitative. The context of the data we work with is very important. Always.
Chapter 3 Graphic Methods for Describing Data. 2 Basic Terms  A frequency distribution for categorical data is a table that displays the possible categories.
Chapter 3 Graphical Methods for Describing Data
Chapter Two Organizing and Summarizing Data
Copyright © 2014 Pearson Education, Inc. All rights reserved Chapter 2 Picturing Variation with Graphs.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.
LECTURE 5 5 FEBRUARY 2009 STA291 Fall Itinerary 2.3 Graphical Techniques for Interval Data (mostly review) 2.4 Describing the Relationship Between.
QMS 6351 Statistics and Research Methods Chapter 2 Descriptive Statistics: Tabular and Graphical Methods Prof. Vera Adamchik.
QM Spring 2002 Statistics for Decision Making Descriptive Statistics.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Frequency Distributions and Graphs
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
9 - 1 Module 9: Frequency Distributions This module includes descriptions of frequency distributions, frequency tables, histograms and frequency polygons.
Welcome to Data Analysis and Interpretation
Objectives (BPS chapter 1)
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Presenting Data in Tables and Charts. 2.1 Tables and Charts for Categorical Data Mutual Funds –Variables? Measurement scales? Four Techniques.
Descriptive Statistics: Tabular and Graphical Methods
LECTURE 5 10 SEPTEMBER 2009 STA291 Fall Itinerary Graphical Techniques for Interval Data (mostly review) Describing the Relationship Between Two.
STAT 211 – 019 Dan Piett West Virginia University Lecture 1.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Chapter Two Organizing and Summarizing Data 2.2 Organizing Quantitative Data I.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
LECTURE 4 3 FEBRUARY 2008 STA 291 Fall Administrative (Review) 5.3 Sampling Plans 2.2 Graphical and Tabular Techniques for Nominal Data 2.3 Graphical.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Lecture 3 Dustin Lueker. 2  Suppose the population can be divided into separate, non-overlapping groups (“strata”) according to some criterion ◦ Select.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
LECTURE 6 15 SEPTEMBER 2009 STA291 Fall Review: Graphical/Tabular Descriptive Statistics Summarize data Condense the information from the dataset.
Chapter 2 Describing Data.
Statistics Unit 2: Organizing Data Ms. Hernandez St. Pius X High School
STA Lecture 51 STA 291 Lecture 5 Chap 4 Graphical and Tabular Techniques for categorical data Graphical Techniques for numerical data.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 3 Graphical Methods for Describing Data.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 2-2 Frequency Distributions.
When data is collected from a survey or designed experiment, they must be organized into a manageable form. Data that is not organized is referred to as.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Chapter 2, Part A Descriptive Statistics: Tabular and Graphical Presentations n Summarizing Categorical Data n Summarizing Quantitative Data Categorical.
Understanding Basic Statistics Chapter One Organizing Data.
BUSINESS STATISTICS I Descriptive Statistics & Data Collection.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
Lecture 2.  A descriptive technique  An organized tabulation showing exactly how many individuals are located in each category on the scale of measurement.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Sampling ‘Scientific sampling’ is random sampling Simple random samples Systematic random samples Stratified random samples Random cluster samples What?
Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Two Organizing Data.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
1 Frequency Distributions. 2 After collecting data, the first task for a researcher is to organize and simplify the data so that it is possible to get.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Copyright 2011 by W. H. Freeman and Company. All rights reserved.1 Introductory Statistics: A Problem-Solving Approach by Stephen Kokoska Chapter 2 Tables.
Copyright © 2009 Pearson Education, Inc. 3.2 Picturing Distributions of Data LEARNING GOAL Be able to create and interpret basic bar graphs, dotplots,
Descriptive Statistics – Graphic Guidelines Pie charts – qualitative variables, nominal data, eg. ‘religion’ Bar charts – qualitative or quantitative variables,
Descriptive Statistics: Tabular and Graphical Methods
Frequency Distributions and Graphs
Chapter 2 Frequency Distribution and Graph
3.2 Picturing Distributions of Data
Chapter 2 Descriptive Statistics
CHAPTER 1: Picturing Distributions with Graphs
Frequency Distributions and Graphs
3 2 Chapter Organizing and Summarizing Data
STA 291 Spring 2008 Lecture 3 Dustin Lueker.
Chapter 2 Organizing data
STA 291 Summer 2008 Lecture 4 Dustin Lueker.
STA 291 Spring 2008 Lecture 4 Dustin Lueker.
Organizing, Displaying and Interpreting Data
Presentation transcript:

Lecture 3 Dustin Lueker

 Simple Random Sampling (SRS) ◦ Each possible sample has the same probability of being selected  Stratified Random Sampling ◦ The population can be divided into a set of non-overlapping subgroups (the strata) ◦ SRSs are drawn from each strata  Cluster Sampling ◦ The population can be divided into a set of non-overlapping subgroups (the clusters) ◦ The clusters are then selected at random, and all individuals in the selected clusters are included in the sample  Systematic Sampling ◦ Useful when the population consists as a list ◦ A value K is specified. Then one of the first K individuals is selected at random, after which every Kth observation is included in the sample STA 291 Summer 2010 Lecture 32

3  Summarize data ◦ Condense the information from the dataset  Graphs  Table  Numbers  Interval data ◦ Histogram  Nominal/Ordinal data ◦ Bar chart ◦ Pie chart

 Difficult to see the “big picture” from these numbers ◦ We want to try to condense the data STA 291 Summer 2010 Lecture 34 Alabama 11.6Alaska 9.0 Arizona 8.6Arkansas 10.2 California 13.1Colorado 5.8 Connecticut 6.3Delaware 5.0 D C 78.5Florida 8.9 Georgia 11.4Hawaii 3.8 ……

STA 291 Summer 2010 Lecture 35  A listing of intervals of possible values for a variable  Together with a tabulation of the number of observations in each interval.

STA 291 Summer 2010 Lecture 36 Murder RateFrequency >211 Total51

 Conditions for intervals ◦ Equal length ◦ Mutually exclusive  Any observation can only fall into one interval ◦ Collectively exhaustive  All observations fall into an interval  Rule of thumb: ◦ If you have n observations then the number of intervals should approximately STA 291 Summer 2010 Lecture 37

8  Relative frequency for an interval ◦ Proportion of sample observations that fall in that interval  Sometimes percentages are preferred to relative frequencies

STA 291 Summer 2010 Lecture 39 Murder RateFrequencyRelative Frequency Percentage > Total511100

STA 291 Summer 2010 Lecture 310  Notice that we had to group the observations into intervals because the variable is measured on a continuous scale ◦ For discrete data, grouping may not be necessary  Except when there are many categories  Intervals are sometimes called classes ◦ Class Cumulative Frequency  Number of observations that fall in the class and in smaller classes ◦ Class Relative Cumulative Frequency  Proportion of observations that fall in the class and in smaller classes

STA 291 Summer 2010 Lecture 311 Murder RateFrequencyRelative Frequency Cumulative Frequency Relative Cumulative Frequency > Total511 1

STA 291 Summer 2010 Lecture 312  Use the numbers from the frequency distribution to create a graph ◦ Draw a bar over each interval, the height of the bar represents the relative frequency for that interval ◦ Bars should be touching  Equally extend the width of the bar at the upper and lower limits so that the bars are touching.

STA 291 Summer 2010 Lecture 313

STA 291 Summer 2010 Lecture 314

STA 291 Summer 2010 Lecture 315  Histogram: for interval (quantitative) data  Bar graph is almost the same, but for qualitative data  Difference: ◦ The bars are usually separated to emphasize that the variable is categorical rather than quantitative ◦ For nominal variables (no natural ordering), order the bars by frequency, except possibly for a category “other” that is always last

 First Step ◦ Create a frequency distribution STA 291 Summer 2010 Lecture 316 Highest Degree ObtainedFrequency (Number of Employees) Grade School15 High School200 Bachelor’s185 Master’s55 Doctorate70 Other25 Total550

 Bar graph ◦ If the data is ordinal, classes are presented in the natural ordering STA 291 Summer 2010 Lecture 317

 Pie is divided into slices ◦ Area of each slice is proportional to the frequency of each class STA 291 Summer 2010 Lecture 318

STA 291 Summer 2010 Lecture 319

20  Write the observations ordered from smallest to largest ◦ Looks like a histogram sideways ◦ Contains more information than a histogram, because every single observation can be recovered  Each observation represented by a stem and leaf  Stem = leading digit(s)  Leaf = final digit STA 291 Summer 2010 Lecture 320

21STA 291 Summer 2010 Lecture 321 Stem Leaf #

22  Useful for small data sets ◦ Less than 100 observations  Can also be used to compare groups ◦ Back-to-Back Stem and Leaf Plots, using the same stems for both groups.  Murder Rate Data from U.S. and Canada ◦ Note: it doesn’t really matter whether the smallest stem is at top or bottom of the table STA 291 Summer 2010 Lecture 322

23STA 291 Summer 2010 Lecture 323 PRESIDENTAGEPRESIDENTAGEPRESIDENTAGE Washington67Fillmore74Roosevelt60 Adams90Pierce64Taft72 Jefferson83Buchanan77Wilson67 Madison85Lincoln56Harding57 Monroe73Johnson66Coolidge60 Adams80Grant63Hoover90 Jackson78Hayes70Roosevelt63 Van Buren79Garfield49Truman88 Harrison68Arthur56Eisenhower78 Tyler71Cleveland71Kennedy46 Polk53Harrison67Johnson64 Taylor65McKinley58Nixon81 Reagan 93 Ford 93 StemLeaf

24  Discrete data ◦ Frequency distribution  Continuous data ◦ Grouped frequency distribution  Small data sets ◦ Stem and leaf plot  Interval data ◦ Histogram  Categorical data ◦ Bar chart ◦ Pie chart  Grouping intervals should be of same length, but may be dictated more by subject-matter considerations STA 291 Summer 2010 Lecture 324

25  Present large data sets concisely and coherently  Can replace a thousand words and still be clearly understood and comprehended  Encourage the viewer to compare two or more variables  Do not replace substance by form  Do not distort what the data reveal STA 291 Summer 2010 Lecture 325

26  Don’t have a scale on the axis  Have a misleading caption  Distort by using absolute values where relative/proportional values are more appropriate  Distort by stretching/shrinking the vertical or horizontal axis  Use bar charts with bars of unequal width STA 291 Summer 2010 Lecture 326

27  Frequency distributions and histograms exist for the population as well as for the sample  Population distribution vs. sample distribution  As the sample size increases, the sample distribution looks more and more like the population distribution ◦ This will be explored further later on in the course STA 291 Summer 2010 Lecture 327

28  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets finer and finer  Similar to the idea of using smaller and smaller rectangles to calculate the area under a curve when learning how to integrate  Symmetric distributions ◦ Bell-shaped ◦ U-shaped ◦ Uniform  Not symmetric distributions: ◦ Left-skewed ◦ Right-skewed ◦ Skewed STA 291 Summer 2010 Lecture 328

Symmetric Right-skewed Left-skewed STA 291 Summer 2010 Lecture 329