Download presentation
Presentation is loading. Please wait.
1
Introduction to Statistics
2
Problems in Statistics A company took the blood pressure of 1000 people of various ages to see if blood pressure increases with age. The weather forecast predictions were compared with the actual weather to see how accurate weather predictions are. A pollster interviews a certain number of voters to predict who will win an upcoming election. A city planning employee records the number of cars that pass through an intersection every hour to determine if a light should be placed there.
3
What is Statistics? Statistics is the science of collecting, simplifying, and describing data, as well as making inferences (drawing conlusions) based on the analysis of data. Data values or observations are the raw materials of statistics. They are numbers in context e.g. the number of those polled ages 30-49 with blood pressure 91 or the number of cars passing through the intersection at 3:00 pm
4
1.1 Displaying Distributions with Graphs
5
Viewing Data For all intents and (intensive?) purposes, data is meaningless if it cannot be interpreted. We present several ways to “see” the data. Depending on the data, some ways of displaying the data are more beneficial than others.
6
Consider the following “data” No context, no units- the data is meaningless. 128018123509 6029343596 15502072601 32416421817
7
This gives a context to the data, but it might not give any kind of insight. Industry description in Montgomery County, PA http://factfinder.census.gov/servlet/GQRTable?_bm=y&-geo_id=05000US42091&- ds_name=EC0200A1&-_lang=en Number of establishments Manufacturing1280 Wholesale trade1812 Retail Trade3509 Information602 Real estate & rental & leasing934 Professional, scientific, & technical services3596 Administrative & support & waste management & remediation service 1550 Educational services207 Health care & social assistance2601 Arts, entertainment, & recreation324 Accommodation & food services1642 Other services (except public administration)1817
8
Things to look for ShapeCenterSpreadOutliers Symmetric, skewed to the right or left Not all of these will be applicable to all graphical displays.
9
Bar Graph
10
Pie Chart
11
Stem and Leaf Plots With bar graph and pie chart, we were interested in both the value and the identity of the object which gave that value. This information may sometimes be either superfluous or confidential. Consider the midterm grades of a class I taught years ago. 81, 89, 82, 82, 79, 85, 76, 54, 75, 75, 78, 71, 83, 88, 52, 86, 89, 89, 84, 79, 80, 85.
12
StemLeaf 5678956789 2 4 1 5 5 6 8 9 9 0 1 2 2 3 4 5 5 6 8 9 9 9 Stem and Leaf This data skews to the right and clusters in the 70-89 range. Should 52 and 54 both be considered outliers?
13
Histogram Unlike a bar graph which displays categorical data, a histogram displays numerical data. We may consider GPA distribution of 20 students with GPAs 3.1, 2.7, 3.2, 2.9, 2.8, 3.1, 3.3, 2.8, 2.9, 3.2, 2.5, 3.9, 3.8, 2.4, 2.7, 2.8, 3.9, 2.6, 3.1, and 3.1
14
Histogram
15
Time Plots A time plot plots an observation against the time it was measured. A pattern that repeats itself at regular intervals is a seasonal variation. We can graph the working hours per week over the years in the United States (www.gapminder.org)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.