STA 291 - Lecture 51 STA 291 Lecture 5 Chap 4 Graphical and Tabular Techniques for categorical data Graphical Techniques for numerical data.

Slides:



Advertisements
Similar presentations
Chapter 2 Graphs, Charts, and Tables – Describing Your Data
Advertisements

LECTURE 5 5 FEBRUARY 2009 STA291 Fall Itinerary 2.3 Graphical Techniques for Interval Data (mostly review) 2.4 Describing the Relationship Between.
QBM117 Business Statistics Statistical Inference Sampling 1.
Chapter 2 Presenting Data in Tables and Charts
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 2-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter 2 Graphs, Charts, and Tables – Describing Your Data
Chapter 2 Describing Data Sets
QM Spring 2002 Statistics for Decision Making Descriptive Statistics.
Descriptive statistics (Part I)
Ka-fu Wong © 2003 Chap 2-1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
213Sampling.pdf When one is attempting to study the variable of a population, whether the variable is qualitative or quantitative, there are two methods.
Alok Srivastava Chapter 2 Describing Data: Graphs and Tables Basic Concepts Frequency Tables and Histograms Bar and Pie Charts Scatter Plots Time Series.
Basic Descriptive Statistics Healey, Chapter 2
The Stats Unit.
SECTION 12-1 Visual Displays of Data Slide
Objectives (BPS chapter 1)
1 Statistics This lecture covers chapter 1 and 2 sections in Howell Why study maths in psychology? “Mathematics has the advantage of teaching you.
Chapter 1: Introduction to Statistics
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
CHAPTER 2 Frequency Distributions and Graphs. 2-1Introduction 2-2Organizing Data 2-3Histograms, Frequency Polygons, and Ogives 2-4Other Types of Graphs.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-1 Visual Displays of Data.
Copyright © 2012 by Nelson Education Limited.2-1 Chapter 2 Basic Descriptive Statistics: Percentages, Ratios and Rates, Tables, Charts, and Graphs.
Chapter 2 Presenting Data in Tables and Charts. 2.1 Tables and Charts for Categorical Data Mutual Funds –Variables? Measurement scales? Four Techniques.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Basic Descriptive Statistics Percentages and Proportions Ratios and Rates Frequency Distributions: An Introduction Frequency Distributions for Variables.
Chapter 1 Introduction and Data Collection
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
LECTURE 5 10 SEPTEMBER 2009 STA291 Fall Itinerary Graphical Techniques for Interval Data (mostly review) Describing the Relationship Between Two.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Lecture 2 Graphs, Charts, and Tables Describing Your Data
Basic Business Statistics Chapter 2:Presenting Data in Tables and Charts Assoc. Prof. Dr. Mustafa Yüzükırmızı.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
LECTURE 4 3 FEBRUARY 2008 STA 291 Fall Administrative (Review) 5.3 Sampling Plans 2.2 Graphical and Tabular Techniques for Nominal Data 2.3 Graphical.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Lecture 3 Dustin Lueker. 2  Suppose the population can be divided into separate, non-overlapping groups (“strata”) according to some criterion ◦ Select.
Lecture 3 Dustin Lueker.  Simple Random Sampling (SRS) ◦ Each possible sample has the same probability of being selected  Stratified Random Sampling.
LECTURE 6 15 SEPTEMBER 2009 STA291 Fall Review: Graphical/Tabular Descriptive Statistics Summarize data Condense the information from the dataset.
Chapter 2 Describing Data.
LECTURE 3 THURSDAY, 22 JANUARY STA 291 Fall
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 2-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 2-1 Chapter 2 Presenting Data in Tables and Charts Statistics For Managers 4 th.
Chap 2-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course in Business Statistics 4 th Edition Chapter 2 Graphs, Charts, and Tables.
Essential Statistics Chapter 11 Picturing Distributions with Graphs.
Applied Quantitative Analysis and Practices
Aim: How do we analyze data with a two-way table?
„There are three kinds of lies: lies, damned lies, and statistics.“ (B.Disraeli) Introduction To Statistics.
STA Lecture 41 STA 291 Lecture 4 Jan 26, 2010 Methods of Collecting Data Survey Experiment.
STATISTICS AND OPTIMIZATION Dr. Asawer A. Alwasiti.
Lecture 2 Dustin Lueker.  Parameter ◦ Numerical characteristic of the population  Calculated using the whole population  Statistic ◦ Numerical characteristic.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
Copyright 2011 by W. H. Freeman and Company. All rights reserved.1 Introductory Statistics: A Problem-Solving Approach by Stephen Kokoska Chapter 2 Tables.
Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms.
Elementary Statistics (Math 145) June 19, Statistics is the science of collecting, analyzing, interpreting, and presenting data. is the science.
Frequency Distributions and Graphs. Organizing Data 1st: Data has to be collected in some form of study. When the data is collected in its’ original form.
 2012 Pearson Education, Inc. Slide Chapter 12 Statistics.
MAT 135 Introductory Statistics and Data Analysis Adjunct Instructor
Chapter 2 Describing Data: Graphs and Tables
Frequency Distributions and Graphs
Lecture 3 part-2: Organization and Summarization of Data
STA 291 Spring 2008 Lecture 3 Dustin Lueker.
Descriptive Statistics
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Review for Exam 1 Ch 1-5 Ch 1-3 Descriptive Statistics
Constructing and Interpreting Visual Displays of Data
Gathering and Organizing Data
Displaying Data – Charts & Graphs
Presentation transcript:

STA Lecture 51 STA 291 Lecture 5 Chap 4 Graphical and Tabular Techniques for categorical data Graphical Techniques for numerical data

STA Lecture 52 Review: Stratified Sampling Suppose the population can be divided into non-overlapping groups (“strata”) according to some criterion. Example: All voters divided into male voters and female voters. Select a Simple Random Sample independently from each group.

how it is different from SRS? (SRS) = any possible selection equally likely Any selection got discriminated/eliminated here in stratified sampling? STA Lecture 53

4 Examples of Stratified Sampling The population is divided into male/female sub- populations (Two strata). Within each sub- population do an SRS. The population is divided into [Whites, Blacks, Hispanics, Asians, Others.] Five strata. Within each, do a SRS. Smaller groups may be over-sampled: For example: select from each group a SRS of same size n=500.

STA Lecture 55 How could stratification be useful? We may want to draw inference about population parameters for each subgroup When done right, estimators from stratified random samples are more precise than those from Simple Random Samples

STA Lecture 56 Important Sampling Plans: SRS and variations Simple Random Sampling (SRS) –Each possible sample has the same probability of being selected. Stratified Random Sampling –The population can be divided into a set of non- overlapping subgroups (the strata) –SRSs are drawn from each strata Systematic Sampling (eg. Digital music)

STA Lecture 57 Sampling Error Assume you take a SRS of 100 UK students and ask them about their political affiliation (Democrat, Republican, Independent) Now take another SRS of 100 UK students Will you get the same percentages?

STA Lecture 58 No, because of sampling variability. Also, the result will not be exactly the same as the population percentage, unless you take a “sample” consisting of the whole population of 30,000 students (this would be called a “census”) or if you are very lucky

STA Lecture 59 Sampling Error Sampling Error is the error that occurs when a statistic based on a sample estimates or predicts the value of a population parameter. In SRS, stratified RS, the sampling error can usually be quantified. In other sampling plans, there is also sampling variability, but its extent is not predictable.

STA Lecture 510 Nonsampling Error bias due to question wording, question order, nonresponse (people refuse to answer),

STA Lecture 511 Chapter 4 Display and Describe Categorical Data Summarize data using graphs, tables, and numbers. Condense the information from the dataset Bar chart, Pie chart, scatter plot

STA Lecture 512 Bar Graph features: –The bars are usually separated to emphasize that the variable is categorical rather than quantitative –For nominal variables (no natural ordering), order the bars by frequency, except possibly for a category “other” that is always last

STA Lecture 513 Pie Chart (Nominal/Ordinal Data) First Step: Create a Frequency Distribution

STA Lecture 514 We could display this data in a bar chart… Bar Graph: If the data is ordinal, classes are presented in the natural ordering.

STA Lecture 515

STA Lecture 516 Pie Chart Pie Chart: Pie is divided into slices; The area of each slice is proportional to the frequency of each class.

STA Lecture 517 Pie Chart for Highest Degree Achieved

Scatter plot Plots with two variables (reveal the relationship between the two variables) STA Lecture 518

STA Lecture 519

Dynamic graph: graph change over time – movie or animation. Try watch more of those movies at STA Lecture 520

Distribution of a (continuous, numerical) variable STA Lecture 521 Histogram Smoothed histogram – distribution

A distribution STA Lecture 522

Frequency Tables Suppose the variable can only take one of 5 possible values. We can condense a large sample (n=2000) to STA Lecture 523 value frequency

Contingency tables More complicated tables by rows and columns (cross tabulation) STA Lecture 524

STA Lecture 425 Homework 2 Due Tuesday next week (Feb 5,11 PM). Online homework assignment.

STA Lecture 526 Attendance Survey Question 5 On a 4”x6” index card (or little piece of paper) –Please write down your name and section number. –Today’s Question: What is “SRS” stands for in statistical observational study?

STA Lecture 627 Histogram of Numbers of Males vs. Females

STA Lecture 628 Histogram of Numbers of Males vs. Females

STA Lecture 629 Histogram of Numbers of Males vs. Females

STA Lecture 630 Histogram of Numbers of Males vs. Females

STA Lecture 631 Histogram of Numbers of Males vs. Females

STA Lecture 632 Histogram of Numbers of Males vs. Females

STA Lecture 633 Histogram of Numbers of Males vs. Females

STA Lecture 634 Histogram of Numbers of Males vs. Females

STA Lecture 635 Histogram of Numbers of Males vs. Females

STA Lecture 636 Histogram of Numbers of Males vs. Females

STA Lecture 637

Dynamic graph: graph changes over time talks/hans-rosling-ted-talk seemingly-impossible-is-possible/ STA Lecture 538

STA Lecture 539 Histogram (for continuous numerical type data) Divide the range of possible values into many (contiguous, non-overlap) intervals, then count how many times data falls into each interval. Plot based on this table is called histogram.

STA Lecture 540 Data Table: Murder Rates per1000 Alabama 11.6Alaska 9.0 Arizona 8.6Arkansas 10.2 California 13.1Colorado 5.8 Connecticut 6.3Delaware 5.0 D C 78.5Florida 8.9 Georgia 11.4Hawaii 3.8 ……………………………………………. Difficult to see the “big picture” from these numbers Try to condense the data…

STA Lecture 541 Frequency Distribution A listing of intervals of possible values for a variable Together with a tabulation of the number of observations in each interval.

STA Lecture 542 Frequency Distribution Murder RateFrequency >211 Total51

STA Lecture 543 Frequency Distribution Use intervals of same length (wherever possible) Intervals must be mutually exclusive: Any observation must fall into one and only one interval

STA Lecture 544 Relative Frequencies Relative frequency for an interval: The proportion of sample observations that fall in that interval Sometimes, percentages are preferred to relative frequencies

STA Lecture 545 Frequency and Relative Frequency and Percentage Distribution Murder RateFrequencyRelative Frequency Percentage > Total511100

STA Lecture 546 Frequency Distributions Notice that we had to group the observations into intervals because the variable is measured on a continuous scale For discrete data, grouping may not be necessary (except when there are many categories)

STA Lecture 547 Histogram (for continuous numerical Data) Use the numbers from the frequency distribution to create a graph Draw a bar over each interval, the height of the bar represents the (relative) frequency for that interval Bars should be touching; I.e., equally extend the width of the bar at the upper and lower limits so that the bars are touching.

STA Lecture 548 Histogram

STA Lecture 549 Histogram w/o DC

STA Lecture 550 Histogram Usually produced by software. We need to understand what they try to say. es/histogram/