Organizing Data Proportions, Percentages, Rates, and rates of change.

Slides:



Advertisements
Similar presentations
Data, Tables and Graphs Presentation. Types of data Qualitative and quantitative Qualitative is descriptive (nominal, categories), labels or words Quantitative.
Advertisements

DEPICTING DISTRIBUTIONS. How many at each value/score Value or score of variable.
Basic Statistics Frequency Distributions & Graphs.
Quantitative Data Analysis: Univariate (cont’d) & Bivariate Statistics
CHAPTER 2 Basic Descriptive Statistics: Percentages, Ratios and rates, Tables, Charts and Graphs.
PPA 415 – Research Methods in Public Administration Lecture 2 - Counting and Charting Responses.
Calculating & Reporting Healthcare Statistics
Quantitative Data Analysis Definitions Examples of a data set Creating a data set Displaying and presenting data – frequency distributions Grouping and.
MR2300: MARKETING RESEARCH PAUL TILLEY Unit 10: Basic Data Analysis.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Basic Descriptive Statistics Healey, Chapter 2
CHAPTER 2 Basic Descriptive Statistics: Percentages, Ratios and rates, Tables, Charts and Graphs.
The Stats Unit.
Chapter 3: Graphic Presentation
Measures of Central Tendency
Frequency Distributions and Percentiles
Frequency Distributions and Graphs
Basic Descriptive Statistics Chapter 2. Percentages and Proportions Most used statistics Could say that 927 out of 1,516 people surveyed said that hard.
Chapter 2: Organization of Information: Frequency Distributions Frequency Distributions Proportions and Percentages Percentage Distributions Comparisons.
Frequency Table Frequency tables are an efficient method of displaying data The number of cases for each observed score are listed Scores that have 0 cases.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 2 BROOKLYN COLLEGE – CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Basic Statistics Standard Scores and the Normal Distribution.
1 Statistics This lecture covers chapter 1 and 2 sections in Howell Why study maths in psychology? “Mathematics has the advantage of teaching you.
Statistics 1 Course Overview
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
CHAPTER 2 Frequency Distributions and Graphs. 2-1Introduction 2-2Organizing Data 2-3Histograms, Frequency Polygons, and Ogives 2-4Other Types of Graphs.
Graphs of Frequency Distribution Introduction to Statistics Chapter 2 Jan 21, 2010 Class #2.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 2-1 What is a Frequency Distribution? A frequency distribution is a list or a.
Copyright © 2012 by Nelson Education Limited.2-1 Chapter 2 Basic Descriptive Statistics: Percentages, Ratios and Rates, Tables, Charts, and Graphs.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
Data Presentation.
Chapter 2 Organizing the Data. Introduction Learn how to show variable relationship through diagrams Thematically cover graphs and maps Understand the.
Basic Descriptive Statistics Percentages and Proportions Ratios and Rates Frequency Distributions: An Introduction Frequency Distributions for Variables.
Statistics Sampling Distributions
Chapter 11 Descriptive Statistics Gay, Mills, and Airasian
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
FREQUENCY DISTRIBUTION
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Descriptive statistics I Distributions, summary statistics.
DISTRIBUTIONS. What is a “distribution”? One distribution for a continuous variable. Each youth homicide is a case. There is one variable: the number.
Chapter 2 Data Presentation Using Descriptive Graphs.
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
TYPES There are several TYPES of variables that reflect characteristics of the data Ratio Interval Ordinal Nominal.
Chapter 11 Univariate Data Analysis; Descriptive Statistics These are summary measurements of a single variable. I.Averages or measures of central tendency.
GrowingKnowing.com © Frequency distribution Given a 1000 rows of data, most people cannot see any useful information, just rows and rows of data.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
Chapter 3: Organizing Data. Raw data is useless to us unless we can meaningfully organize and summarize it (descriptive statistics). Organization techniques.
Organizing & Reporting Data: An Intro Statistical analysis works with data sets  A collection of data values on some variables recorded on a number cases.
DISTRIBUTIONS. What is a “distribution”? One distribution for a continuous variable. Each youth homicide is a case. There is one variable: the number.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
Organizing the Data Levin and Fox Elementary Statistics In Social Research Chapter 2.
Chapter 2 Describing and Presenting a Distribution of Scores.
Statistics - is the science of collecting, organizing, and interpreting numerical facts we call data. Individuals – objects described by a set of data.
3.3 More about Contingency Tables Does the explanatory variable really seem to impact the response variable? Is it a strong or weak impact?
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
Frequency Distributions and Graphs. Organizing Data 1st: Data has to be collected in some form of study. When the data is collected in its’ original form.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 2 Describing and Presenting a Distribution of Scores.
Making Sense of Statistics: A Conceptual Overview Sixth Edition PowerPoints by Pamela Pitman Brown, PhD, CPG Fred Pyrczak Pyrczak Publishing.
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
The Normal Distributions.  1. Always plot your data ◦ Usually a histogram or stemplot  2. Look for the overall pattern ◦ Shape, center, spread, deviations.
Chapter 2 Organizing the Data
DESCRIPTIVE STATISTICS
Frequency Distributions and Graphs
Copyright © Pearson Education, Inc., Allyn & Bacon 2010
Organizing and Visualizing Variables
Displaying Data – Charts & Graphs
Experimental Design Experiments Observational Studies
Chapter 3: Graphic Presentation
The Frequency Distribution
Data, Tables and Graphs Presentation.
Presentation transcript:

Organizing Data Proportions, Percentages, Rates, and rates of change.

Raw Data Often hard to interpret just a bunch of raw scores Often hard to interpret just a bunch of raw scores Raw scores can be transformed to show patterns and trends in the data Raw scores can be transformed to show patterns and trends in the data Most useful is the frequency distribution or table Most useful is the frequency distribution or table

Frequency Tables will have: Informative title Informative title Two columns for nominal data: Two columns for nominal data: (1) response and (1) response and (2) frequency (How often did certain responses occur?) (2) frequency (How often did certain responses occur?)

Standardizing data Proportion: compare the number of cases for each response (frequency, f) with the total number of cases (N). Proportion: compare the number of cases for each response (frequency, f) with the total number of cases (N). Proportion = frequency / number = f / N frequency / number = f / N In the previous example, 20 out of 45 students earned a B, so the proportion earning a B is 20/45 = , which (rounding to 2 decimals more than the original data) =.44 In the previous example, 20 out of 45 students earned a B, so the proportion earning a B is 20/45 = , which (rounding to 2 decimals more than the original data) =.44

Percentage is the frequency per 100 cases. (It is a special case of a proportion.) Percentage is the frequency per 100 cases. (It is a special case of a proportion.) Percentage = 100 (f / N) Percentage = 100 (f / N) People are “used” to thinking in percentages (such as in cents per dollar....). People are “used” to thinking in percentages (such as in cents per dollar....).

Example 20 our of 45 students earned a B in a course. 20 our of 45 students earned a B in a course. Proportion = f / N = 20/45 = 0.44 Proportion = f / N = 20/45 = 0.44 Percentage = 100 (20/35) = 44% Percentage = 100 (20/35) = 44% (Per cent means per 100, and we write it 0/0. Per thousand would be 0/00) (Per cent means per 100, and we write it 0/0. Per thousand would be 0/00)

Ratios A ratio of “a” to “b” is the frequency of “a” compared to the frequency of “b”, with the frequency of “a” coming first, or in the numerator, just as it does in the sentence. A ratio of “a” to “b” is the frequency of “a” compared to the frequency of “b”, with the frequency of “a” coming first, or in the numerator, just as it does in the sentence. a/b or sometimes expressed as a:b a/b or sometimes expressed as a:b

Comparisons using the Frequency Ratio: f1 / f2 In a certain class, there were 15 women and 30 men, in a class of 45. So, in the class, In a certain class, there were 15 women and 30 men, in a class of 45. So, in the class, Proportion of women = 15/45 = 0.33 Proportion of women = 15/45 = 0.33 Percentage of women = (100).33 = 33% Percentage of women = (100).33 = 33% (note this is not 0. 33%) (note this is not 0. 33%)

Ratio – depends on how the question is stated. Ratio of women to men = 15/30 = 1/2, or there was 1 woman for every 2 men. Ratio of women to men = 15/30 = 1/2, or there was 1 woman for every 2 men. However, the ratio of men to women would be 30/15 = 2 men for every woman. However, the ratio of men to women would be 30/15 = 2 men for every woman. Note ratio is used differently than is the proportion in the class. Note ratio is used differently than is the proportion in the class.

Rate A rate indicates the number of actual cases compared to the number of potential cases. Pretty subtle, eh? A rate indicates the number of actual cases compared to the number of potential cases. Pretty subtle, eh? For population studies, these are usually expressed as the number of actual cases per 1000 potential cases (usually per 1000 people in the population). For population studies, these are usually expressed as the number of actual cases per 1000 potential cases (usually per 1000 people in the population).

Example A town has 5000 people, of whom 450 have graduated from college. A town has 5000 people, of whom 450 have graduated from college. The town’s college graduation rate is: 450/5000 =.09 = 9% or The town’s college graduation rate is: 450/5000 =.09 = 9% or 90 per thousand. 90 per thousand. (Why might I express this a per thousand? I chose the “per” part so the number was something easily visualized.) (Why might I express this a per thousand? I chose the “per” part so the number was something easily visualized.)

What denominators to use? per 100 = percentage per 100 = percentage per 1000 = commonly used for birth and death rates, divorces, etc. per 1000 = commonly used for birth and death rates, divorces, etc. per 100,000 for lots of things determined in the U.S. census per 100,000 for lots of things determined in the U.S. census per 1,000,000 for things determined worldwide per 1,000,000 for things determined worldwide

Generalization Use the denominator that gives you the simplest whole number, easiest for you to grasp. Usually this is a number between 1 and 100. Use the denominator that gives you the simplest whole number, easiest for you to grasp. Usually this is a number between 1 and 100. It’s hard for people to visualize the meaning of very small or large numbers such as , or 132,431,000 It’s hard for people to visualize the meaning of very small or large numbers such as , or 132,431,000

Mortality Rates for example Mortality Rates per 1000 among blacks & whites in Baltimore in 1972 were Mortality Rates per 1000 among blacks & whites in Baltimore in 1972 were for whites, 15.2 per 1000 (or 1.52%) for whites, 15.2 per 1000 (or 1.52%) for blacks, 9.8 per 1000 (or 0.98%) for blacks, 9.8 per 1000 (or 0.98%) Easier to visualize than.0152 for whites and.0098 for blacks. Do you agree? Easier to visualize than.0152 for whites and.0098 for blacks. Do you agree?

Powers of 10 Review Suppose a disease rate of per person (per capita). Suppose a disease rate of per person (per capita). To convert into something more comprehensible, move the decimal point to the right 4 places, to To convert into something more comprehensible, move the decimal point to the right 4 places, to places = 10,000 (4 zeroes), 4 places = 10,000 (4 zeroes), so this becomes 5.67 per 10,000. or go one step further to 56.7 per 100,000. so this becomes 5.67 per 10,000. or go one step further to 56.7 per 100,000.

Rates of change (100) Rate 2 – Rate 1 / Rate 1 (100) Rate 2 – Rate 1 / Rate 1 then convert into the proper units (per 100, 1000, etc.) then convert into the proper units (per 100, 1000, etc.) Ex: a town’s population increases from 20,000 to 30,000 between 1990 and 2005 (note: rate of change can be positive or negative) (100) time2f - time1f = (100) 30,000-20,000 = 50% time 1f 20,000 time 1f 20,000 Increase of 50% Increase of 50%

“Organizing the Data” Review of: Frequency Distributions & Histograms

Frequency Distributions List or plot data List or plot data Nominal Data -- in any order Nominal Data -- in any order Ordinal & Interval Data – Usually highest number at top of table to lowest number at bottom of the table Ordinal & Interval Data – Usually highest number at top of table to lowest number at bottom of the table

Statistics Class Height Data Plotted from shortest to tallest

Intervals – Grouping Data range of values in the data set range of values in the data set numbers of class intervals desired numbers of class intervals desired size of class interval size of class interval upper limit of a class interval upper limit of a class interval lower limit of a class interval lower limit of a class interval

Statistics Class Height Data Grouped in 2 inch intervals

4” intervals

6” intervals

Cumulative Cumulative Frequencies: number of cases at or below a given score. Cumulative Frequencies: number of cases at or below a given score. Cumulative Percentages: percent of cases at or below a given score. Cumulative Percentages: percent of cases at or below a given score. Also = “percentile rank” Also = “percentile rank”

Class Limits Upper class limit = the highest possible score which would “round down” to be included in that class. Upper class limit = the highest possible score which would “round down” to be included in that class. Lower class limit = the lowest possible score which would “round up” to be included in that class. Lower class limit = the lowest possible score which would “round up” to be included in that class.

Midpoints of Intervals Lowest possible score for that interval Lowest possible score for that interval plus highest possible score value plus highest possible score value Divided by 2 Divided by 2

Midpoints The interval of 58-61” actually has limits from 57.5 to 61.5, so = 119 The interval of 58-61” actually has limits from 57.5 to 61.5, so = /2 = 59.5 is the midpoint. 119/2 = 59.5 is the midpoint. Yes, we’d usually get the same answer by saying ( ) / 2 however, for irregular classes, it is better if we get used to the lowest value being 57.5 and the highest being 61.5.

Cumulative Frequency To expand our frequency table, add columns for cumulative frequency, percent, and cumulative percent. To expand our frequency table, add columns for cumulative frequency, percent, and cumulative percent. Arrange your scores from low at the bottom to high at the top. Then, the Cumulative Frequency is simply the frequency of scores at or below the value in question. Arrange your scores from low at the bottom to high at the top. Then, the Cumulative Frequency is simply the frequency of scores at or below the value in question.

Percentile Rank = the cumulative percentage = the cumulative percentage The % at or below that score The % at or below that score So for a height of 5’4”, or 64”, what is the percentile rank in our height data? So for a height of 5’4”, or 64”, what is the percentile rank in our height data? The following chart shows frequency, cum. freq., percentage, & cumulative %. The following chart shows frequency, cum. freq., percentage, & cumulative %.

2" intervals f cf cf % cum% cum% %100.00% %96.88% %84.38% %71.87% %59.37% %40.62% %18.75% %9.40%

Percentile Rank 64-65” has a cumulative percent of 59.37%, so 59.37% of class is in this category or shorter than this category ” has a cumulative percent of 59.37%, so 59.37% of class is in this category or shorter than this category “ has a cumulative percent of 40.62%, so 40.62% of class is in this category or shorter than this category “ has a cumulative percent of 40.62%, so 40.62% of class is in this category or shorter than this category So, percentile rank = cumulative percent when looking at the raw data -- but it is more complex for grouped data, so be wary. So, percentile rank = cumulative percent when looking at the raw data -- but it is more complex for grouped data, so be wary.

Cross-tabulations

Cross-Tabulation: Cross-tabulation review: Cross-tabulation review: a table which presents the distribution of one variable (frequency and/or %) across the categories of one or more additional variables. a table which presents the distribution of one variable (frequency and/or %) across the categories of one or more additional variables.

Common Cross-Tab Example

Cross-Tab: Table 2.15 If asking questions about the differences between males & females in seat belt use, use column percents. If asking questions about the differences between males & females in seat belt use, use column percents. If asking questions about different uses of seat belts by the population as a whole, use the row percents. If asking questions about different uses of seat belts by the population as a whole, use the row percents. Hint: If totals are not given -- put them in before you start to evaluate. Hint: If totals are not given -- put them in before you start to evaluate.

Cross-Tab: Table 2.15

Data Format on SPSS Note that when you are working with raw data sets on the computer, you will put each case in a row, rather than making a cross-tabulation table. We will do this when we work with SPSS. Note that when you are working with raw data sets on the computer, you will put each case in a row, rather than making a cross-tabulation table. We will do this when we work with SPSS.