Download presentation
Presentation is loading. Please wait.
Published byJohn Brown Modified over 9 years ago
1
Describing Data: Graphical Methods ● So far we have been concerned with moving from asking a research question to collecting good quality empirical data ● From now on we'll be concerned with how to make sense of our data. ● Statistical methods can be divided into two main categories: 1) Descriptive and 2) Inferential ● Within 1), graphical or numerical ● Today we focus on graphical description of data (and briefly introduce the software.) ● Note: sections this week meet in Solis 105, a computer lab.
2
Our raw data can be a mess to look at!
3
Raw Data How do we get it before it gets us?
4
Distribution ● Tells what values a variable takes and how often it takes these values ● We want to know, at the minimum, some key characteristics of a distribution, such as central tendency, spread, and general shape ● A distribution can be expressed by a table (e.g., frequency table), graph (e.g. histogram), or function (e.g. density function)
5
Graphing Distributions, etc. ● Categorical Variables (nominal or ordinal) – Places an individual into one of several groups or categories ● Display distribution with a bar graph or a pie chart ● Quantitative (Numerical) – Takes numerical values for which arithmetic operations such as adding and averaging make sense ● Display distribution with a histogram or a stem plot (for small dataset) ● Examine trend over time with a line graph ● Explore relationships with a scatter plot (later)
6
Frequency Table
7
Pie Chart Showing Composition
8
Bar Plot (Can be used on non-compositional data as well)
9
Histogram (Vertical axis could also be % or relative frequency)
10
Another Histogram
11
Yet another Histogram (Small data set—could do stemplot too)
12
% of residents aged 65 and older
13
What to Look For in a Graph of Distribution
14
What to Look For in Any Graph
15
Outliers ● Extreme values, far from the rest of the data. ● May occur due to error in recording, or measuring ● Observational unit may be fundamentally different. ● Outliers need be taken care of before conducting further statistical analysis e.g. If someone in the class is a retired person returning to college, his age would be an outlier (but his weight probably not)
16
Line Graph
17
Beware of Distracting/Misleading Pictograms
19
And Watch the Scales!
20
Making Good Graphs ● Use titles/captions, labels (and legends if necessary) to help deliver what the graph intends to show. Make sure the meaning of variables and their measurement units are clear. Be careful with the scales used. ● Make the data stand out. Avoid distracting grids, artwork, etc. ● Pay attention to what the eyes perceive. Avoid pictograms and tacky effects.
21
Introduction to Stata ● Like any statistical package, has utilities for: – Data input/output – Results saving/printing – Data processing (variable transformation, sampling, sorting/ranking, etc.) – Statistical analysis (summary stats, hypothesis tests, regression models, etc.) – Graphics (bar plots, pie charts, histograms. Stem plots, line plots, scatter plots, etc.) ● Illustrate using sample data: gss2002.dta – use http://dss.ucsd.edu/~lazeng/ps30/gss2002.dtahttp://dss.ucsd.edu/~lazeng/ps30/gss2002.dta – List, desc, sum, codebook, histogram, tab.. ● Sections this week (meet in Solis 105) will explore more.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.