Download presentation
Presentation is loading. Please wait.
Published byHolly Tyler Modified over 9 years ago
1
HUDM4122 Probability and Statistical Inference January 26, 2015
2
ASSISTments Did everyone get an account for the ASSISTments system? Did anyone have difficulties setting up an account? First homework is due in a week
3
Today Ch. 1 in Mendenhall, Beaver, & Beaver Variables and Variable Types Graphing Data Basic Exploratory Data Analysis
4
Variables What is a variable?
5
Variables What is a variable? “A variable is a characteristic that changes or varies over time and/or for different individuals or objects under consideration.” – MBB p. 8
6
Which of these are examples of variables? GPA Shoe size Age Number of correct answers in ASSISTments Number of times gamed the system in ASSISTments Favorite vegetable Favorite type of pie Pi
7
What is a measurement?
8
A measurement is the result of measuring a variable on a single experimental unit – A person, if you are studying people – A class, if you are studying classes – A pizza, if you are studying pizzas
9
A measurement Person furthest towards my left in the front row, what is your name?
10
Now I have a measurement
11
A measurement Person furthest towards my right in the second row, what is your name?
12
Now I have data A set of measurements
13
Now I have data A set of measurements Note that in stats class or education journals, the word “data” is plural
14
Now I have data A set of measurements Note that in stats class or education journals, the word “data” is plural I only know one exception
15
Now I have data A set of measurements Note that in stats class or education journals, the word “data” is plural I only know one exception
16
Everyone repeat after me
17
“My data are in this Excel file.”
18
Everyone repeat after me “My data are in this Excel file.” “Your data aren’t evidence for that conclusion.”
19
Everyone repeat after me “My data are in this Excel file.” “Your data aren’t evidence for that conclusion.” “His data were hard to collect.”
20
However…
21
I do not recommend insisting that data is plural in bars, on first dates, or at Thanksgiving dinner
22
Any questions or concerns?
23
Univariate Data A single variable is collected Height 5’11” 5’10” 5’6”
24
Univariate Data Two variables are collected (for the same data point) HeightDrum-Playing Skill 5’11”1 2 5’10”4 5’6”8
25
Multivariate Data 3+ variables are collected NameHeightDrum-Playing Skill John Lennon5’11”1 Paul McCartney5’11”2 George Harrison5’10”4 Ringo Starr5’6”8
26
Any questions or concerns?
27
Types of Variables
28
Quantitative/Numerical Data Data that can be expressed as numbers
29
What are some examples Of numerical data?
30
Ordinal Data Refers to data where there is a known order, but either – The data clearly isn’t numbers – The space between values is not guaranteed to be equal
31
Examples of Ordinal Data Months of the year: January, February, March, April, … Agreement level: Strongly Agree, Agree, Neutral, Disagree, Strongly Disagree Quality of university: Highly selective, selective, somewhat selective, non-selective
32
Other examples of ordinal data?
33
Nominal data Values have no order or spacing Name State of Residence – New Jersey is not greater or less than New York
34
Nominal data Values have no order or spacing Name State of Residence – New Jersey is not greater or less than New York – Although my brother might disagree
35
Other Examples of Nominal Data?
36
Another name Nominal data is often also called categorical data
37
Another name Nominal data is often also called categorical data Technically ordinal data is also categorical, but no one ever uses the term that way
38
Any questions or concerns?
39
Exploratory Data Analysis “Analyzing data sets to summarize their main characteristics” “Seeing what the data can tell us beyond the formal modeling or hypothesis testing task”
40
Goal Generate hypotheses Understand your data better
41
Often (but not always) done with graphs
42
Which of these is your favorite type of graph? Pie chart Bar graph Frequency histogram Line graph Scatterplot Stem-and-leaf plot Box plot Other
43
Pie Chart Take a set of categories that add to 100% Show the proportion each category has
44
Pie Chart: Example
45
Interpret This Graph Please
46
Never Ever Do This: Completely Visually Misleading Fair use; critique
47
Let’s make a pie chart Using the “your favorite graph” data
48
Any questions?
49
Alternative: Bar Graphs
50
Interpret this graph please
51
What are the advantages/disadvantages relative to pie chart?
52
By the way: X and Y axes X axis Y axis
53
Strengths of bar graphs Categories don’t have to add to 100% Easier to see small differences between categories You can compare variables too
54
Two-group bar graph
55
Let’s make a bar graph Using the “your favorite graph” data
56
Any questions?
57
Some suggest always using bar graphs instead of pie charts
58
“The only thing worse than a pie chart is several of them.” – Edward Tufte “Save the pies for dessert.” – Stephen Few
59
But they’re wrong
60
Pie charts are good for representing part- whole relationships in really easy to see ways Pie charts are good at representing overall proportions
61
Nice example (Gabrielle, 2013)
62
Any questions?
63
Frequency Histogram A type of bar graph – But usually when people say “bar graph”, they do not mean “frequency histogram” – Also: by convention, no space between bars X axis shows values or ranges of a quantitative variable Y axis shows how many data points have that value or range for the quantitative variable
64
Example from the book Visits to Starbucks
65
Another Example
66
Was this an easy exam or a hard exam?
67
Would you rather be in the blue class or the orange class?
68
By the way: outliers OUTLIER
69
If there’s time, let’s make a frequency histogram Everybody: What’s your height in feet-inches? (Example: I’m 5’9”)
70
Any questions?
71
Line Graph Shows trends from left-to-right The trend is usually over time But it doesn’t have to be…
72
Example Line Graph http://www.wilderdom.com/personality/L4-1IntelligenceNatureVsNurture.html http://www.wilderdom.com/personality/L4-1IntelligenceNatureVsNurture.html Used under Creative Commons License
73
Example Line Graph (VanLehn, 2011) (This graph shows perceptions, not data on effectiveness.)
74
Any questions?
75
Not going to discuss today Stem-and-leaf plot Very, very rare to see in actual use Quite poor for any sizable data set If you want to learn about them, see the book
76
Future Classes Scatterplot Box plot
77
Upcoming Classes 1/28 Describing Data with Numerical Measures – Ch. 2 2/2 Describing Bivariate Data (Asgn. 1 due) – Ch. 3 2/4 Introduction to Probability – Ch. 4
78
Questions? Comments?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.