Lecture 3. Data Compression for Two Variables: Scatterplots, Cross- Tabulations, and Correlation David R. Merrell 90-786 Intermediate Empirical Methods.

Slides:



Advertisements
Similar presentations
Exercise 19: Sample Size. Part One Explore how sample size affects the distribution of sample proportions This was achieved by first taking random samples.
Advertisements

Computer, Minitab, and other printouts. r 2 = 67.8% 67.8% of the variation in percent body fat can be explained by the variation in waist size.
Quiz 3 Results (Review for Midterm 1). Results Problem 1 Everyone scored 100% (4/4) (1)Are the following variables categorical (c) or quantitative (q)?
Essentials of Applied Quantitative Methods for Health Services Managers Class Slides.
Chicago Insurance Redlining Example Were insurance companies in Chicago denying insurance in neighborhoods based on race?
Stat 350 Lab Session GSI: Yizao Wang Section 016 Mon 2pm30-4pm MH 444-D Section 043 Wed 2pm30-4pm MH 444-B.
By Wendiann Sethi Spring  The second stages of using SPSS is data analysis. We will review descriptive statistics and then move onto other methods.
Lecture 10. Random Sampling and Sampling Distributions David R. Merrell Intermediate Empirical Methods for Public Policy and Management.
Lecture 1. Making Sense of Data: Data Variation David R. Merrell Intermediate Empirical Methods for Public Policy and Management.
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
Session 7.1 Bivariate Data Analysis
1 Business 260: Managerial Decision Analysis Professor David Mease Lecture 1 Agenda: 1) Course web page 2) Greensheet 3) Numerical Descriptive Measures.
Lecture 2. Data Compression for One Variable George Duncan Intermediate Empirical Methods for Public Policy and Management.
Multiple Choice Review
Quantitative Analysis
Mother and Child Health: Research Methods G.J.Ebrahim Editor Journal of Tropical Pediatrics, Oxford University Press.
Analyzing Data: Bivariate Relationships Chapter 7.
Data Handling GCSE coursework. Hypothesis Collection of Data Data Handling GCSE coursework Hypothesis.
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
GCSE Data Handling Coursework 1 Examining the Data examine carefully the data you are given it’s important to get a feel for the raw data before you use.
Crosswalk Data Analysis Lynn White’s Stats Class Spring 2011 Add the names of all the team members to this first slide.
Statistics 3502/6304 Prof. Eric A. Suess Chapter 3.
Confidence Intervals I 2/1/12 Correlation (continued) Population parameter versus sample statistic Uncertainty in estimates Sampling distribution Confidence.
(a.k.a: The statistical bare minimum I should take along from STAT 101)
Lecture 5. Basic Probability David R. Merrell Intermediate Empirical Methods For Public Policy and Management.
Sociological metodology Quantification Petr Soukup.
Descriptive Statistics becoming familiar with the data.
Basic data analysis. The basic analysis of SPSS that will be introduced in this class  Frequencies  This analysis produces frequency tables showing.
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
Regression & Correlation. Review: Types of Variables & Steps in Analysis.
Chapter 2 Descriptive Statistics Section 2.3 Measures of Variation Figure 2.31 Repair Times for Personal Computers at Two Service Centers  Figure 2.31.
1 Chapter 10 Correlation. Positive and Negative Correlation 2.
Lecture 9. Continuous Probability Distributions David R. Merrell Intermediate Empirical Methods for Public Policy and Management.
EMPLOYMENT AND EARNINGS James and Clayton. Topic of Interest Describes the economic status of all businesses in Canada (trends) Helps with determining.
College and Career-Readiness Conference Summer 2015 FOR ALGEBRA 1 TEACHERS.
STAT 3120 Statistical Methods I Lecture 1 Introduction.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(1)-1 Chapter 2: Displaying and Summarizing Data Part 1: Displaying Data With Charts.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
I Source: U.S. Census Bureau Analysis by Kentucky Population Research, State Data Center, University of Louisville.
Ch. 3 Histograms In a histogram, the areas of the blocks represent percentages. –Example: Ex. Set B #3 Types of variables: –Qualitative vs. quantitative.
Does a Person’s Income Relate to Their Age?. Variables  Annual Income Measured in Dollars per Year  Age of Individual Measured in Years.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Data: Categorical vs. Quantitative Mr. Diaz Math 3.
Categorizing Data. Qualitative vs Quantitative Data Qualitative Deals with descriptions. Data can be observed but not measured. Colors, textures, smells,
Introduction A histogram is a graph that summarizes data.
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
Agenda Introduction to Statistics Descriptive Statistics Measures of Center Measures of Spread - Variability.
BUS 308 Entire Course (Ash Course) FOR MORE CLASSES VISIT BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308 Week 1.
 Naïve Bayes  Data import – Delimited, Fixed, SAS, SPSS, OBDC  Variable creation & transformation  Recode variables  Factor variables  Missing.
Chapter 11 Summarizing & Reporting Descriptive Data.
Prof. Eric A. Suess Chapter 3
Thursday, May 12, 2016 Report at 11:30 to Prairieview
Lecture #3 Tuesday, August 30, 2016 Textbook: Sections 2.4 through 2.6
Box Plots & Distributions
Review 1. Describing variables.
X AND R CHART EXAMPLE IN-CLASS EXERCISE
Statistical Analysis – Part 2
Descriptive Statistics:
Descriptive Statistics:
Chapter 4 – Numerical Descriptive Techniques
Essential Statistics (a.k.a: The statistical bare minimum I should take along from STAT 101)
Understanding Research Results: Description and Correlation
Summarising and presenting data - Bivariate analysis
EQ: How well does the line fit the data?
Treat everyone with sincerity,
Statistical Inference for Managers
Welcome to AP Statistics
MBA 510 Review for Exam 1 February 8
Introductory Statistics
Presentation transcript:

Lecture 3. Data Compression for Two Variables: Scatterplots, Cross- Tabulations, and Correlation David R. Merrell Intermediate Empirical Methods for Public Policy and Management

Lecture 3: Agenda Review of Lecture 2 Cross-Tabulations Comparison Bar Charts Parallel Box Plots Scatterplots Correlation Coefficients

Review of Lecture 2 Mean or Median Models for Data

Mean or Median Complaints have reached the city manager that Tardy City is taking too long to pay its bills. Data are days taken to pay seven bills: Calculate the mean and median. What do you conclude?

Models for Data Data = Fit + Residual Fit as a Center Mean Median Mode Example: Number of Stat Courses Taken by Students in

Summary Statistics (Excel)

Summary Statistics (Minitab) Descriptive Statistics Variable N Mean Median Tr Mean StDev SE Mean C Variable Min Max Q1 Q3 C

Measures of Error

Data Compression for Two Variables...And More Two-Variable Description Cross-Tabulations Comparison Bar Charts Parallel Box Plots Scatterplots Scatterplot Matrix Correlation Coefficients

Two-Variable Description

Structure of a Cross-Tabulation

Street Repair Practices Study street repair practices of local government Cities and counties handle street repairs: using their own public employees exclusively by contracting out part of the work contracting out all the work

Table 1. Street Repair: Counts Type of Local Government Street Repair Practices by Type of Government: Public Employees and Contracting by Cities and Counties in the United States

Table 2. Street Repair: Percents Type of Local Government Street Repair Practices by Type of Government: Public Employees and Contracting by Cities and Counties in the United States

Educational Achievement Residents of Allegheny County that are in labor force Random sample survey of Allegheny County residents in labor force in 199? Variables: gender and highest educational achievement

Educational Achievement: Coding of Ordinal Variables 1 if grade 4 or less 2 if grades if grade 8 4 if high school incomplete (9-11) 5 if high school graduate (12) 6 if technical, trade, or business after high school 7 if college/ university incomplete 8 if college/university graduate or more

Educational Achievement Table

Bar Chart

Job Satisfaction and Income for Postal Employees

Five Number Summary Age of Allegheny County residents by location: individuals in labor force in 199?.

Parallel Box Plots o o o o The Mon ValleyPittsburgh Other

Scatterplots Creating via Excel ChartWizard Transformation of Variables Scatterplot Matrices

Scatterplot 1 Salary Years employed

Scatterplot 2 Salary Years employed

Scatterplot 3 Salary Years employed

Scatterplot Matrix

Correlation Coefficient, r

Properties of r

International Adoption Visas: 1991 vs 1988 r:/academic/90-786/ Chatterjee/ Adopt.dat

International Adoption Visas Country Etc.

Excel Calculation of r Use statistical function, correl Eliminate missing data values Identify X data Identify Y data Finish Value: r = (.88)

Minitab Calculation of r Correlations (Pearson) Correlation of log 1988 and log 1992 = 0.873

Next Time... Ethics and the Value of Data Social Value of Data Privacy Issues Confidentiality Applications in Health Care