Frequency Tables and Single variable Graphics

Slides:



Advertisements
Similar presentations
Números.
Advertisements

AGVISE Laboratories %Zone or Grid Samples – Northwood laboratory
Review of Descriptive Graphs and Measures Here is a quick review of what we have covered so far. Pie Charts Bar Charts Pareto Tables Dotplots Stem-and-leaf.
Worksheets.
STATISTICS Linear Statistical Models
1 When you see… Find the zeros You think…. 2 To find the zeros...
Describing Data: Frequency Distributions and Graphic Presentation
4-4 Variability Objective: Learn to find measures of variability.
/4/2010 Box and Whisker Plots Objective: Learn how to read and draw box and whisker plots Starter: Order these numbers.
12.3 – Analyzing Data.
Frequency Tables, Stem-and-Leaf Plots, and Line Plots 7-1
CS1512 Foundations of Computing Science 2 Lecture 20 Probability and statistics (2) © J R W Hunter,
The 5S numbers game..
Box and Whiskers with Outliers. Outlier…… An extremely high or an extremely low value in the data set when compared with the rest of the values. The IQR.
St. Edward’s University
The basics for simulations
Very simple to create with each dot representing a data value. Best for non continuous data but can be made for and quantitative data 2004 US Womens Soccer.
Introduction to the Practice of Statistics
Frequency Distributions Quantitative Methods in HPELS 440:210.
Aim: How do we organize and interpret statistical data?
Frequency Tables and Stem-and-Leaf Plots 1-3
Chapter 2: Frequency Distributions
Introduction Our daily lives often involve a great deal of data, or numbers in context. It is important to understand how data is found, what it means,
2011 WINNISQUAM COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=1021.
Before Between After.
2011 FRANKLIN COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=332.
Copyright © 2011 Pearson Education, Inc. Statistical Reasoning.
Static Equilibrium; Elasticity and Fracture
Resistência dos Materiais, 5ª ed.
Practice Descriptive statistics Tables Graphs. Birthweights of 60 infants are given below:
Chart Deception Main Source: How to Lie with Charts, by Gerald E. Jones Dr. Michael R. Hyman, NMSU.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 14 Descriptive Statistics 14.1Graphical Descriptions of Data 14.2Variables.
Unit 1.1 Investigating Data 1. Frequency and Histograms CCSS: S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box.
EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES
IB Math Studies – Topic 6 Statistics.
It’s an outliar!.  Similar to a bar graph but uses data that is measured.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Descriptive statistics (Part I)
CHAPTER 1: Picturing Distributions with Graphs
1. Statistics 2. Frequency Table 3. Graphical Representations  Bar Chart, Pie Chart, and Histogram 4. Median and Quartiles 5. Box Plots 6. Interquartile.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Frequency Distributions and Graphs
Numerical Descriptive Measures
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
Descriptive Statistics
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
STATISTICS I COURSE INSTRUCTOR: TEHSEEN IMRAAN. CHAPTER 4 DESCRIBING DATA.
Copyright © Cengage Learning. All rights reserved. 2 Descriptive Analysis and Presentation of Single-Variable Data.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Percentiles and Box – and – Whisker Plots Measures of central tendency show us the spread of data. Mean and standard deviation are useful with every day.
Chapter 2 Describing Data.
Describing and Displaying Quantitative data. Summarizing continuous data Displaying continuous data Within-subject variability Presentation.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
Descriptive Statistics. Frequency Distributions and Their Graphs What you should learn: How to construct a frequency distribution including midpoints,
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
1 ES Chapter 1 & 2 ~ Descriptive Analysis & Presentation of Single-Variable Data Mean: inches Median: inches Range: 42 inches Variance:
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
MATH 2311 Section 1.5. Graphs and Describing Distributions Lets start with an example: Height measurements for a group of people were taken. The results.
Descriptive Statistics Unit 6. Variable Any characteristic (data) recorded for the subjects of a study ex. blood pressure, nesting orientation, phytoplankton.
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Descriptive Statistics
Exploratory Data Analysis
Statistical Reasoning
Frequency Distributions and Graphs
Honors Statistics Review Chapters 4 - 5
Presentation transcript:

Frequency Tables and Single variable Graphics

FREQUENCY DISTRIBUTION Listing a large set of data does not present much of a picture to the reader. Sometimes we want to condense the data into a more manageble form. This can be accomplished with the aid of a frequency distribution.

To demonstrate the concept of a frequency distribution, let’s use the following set of data: The frequency for x=1 is 3 3 2 2 3 2 4 4 1 2 2 4 3 2 0 2 2 1 3 3 1 A frequency distribution is used to represent this set of data by listing the x values with their frequencies. For example, the value 1 occurs in the sample three times;

frequency distribution The frequency f is the number of times the value x occurs in the sample. x f 1 3 2 8 5 4 3 2 2 3 2 4 4 1 2 2 4 3 2 0 2 2 1 3 3 1 Ungrouped frequency distribution We say ungrouped because each value of x in the distribution stands alone.

CONSTRUCTION OF A FREQUENCY TABLE Classes: When a large set of data has many different x values instead of a few repeated values, as in the previous example, we can group the data into a set of classes and construct a frequency table. Number of classes: It can be take a value between 8 and 15. Lower and upper class limits: Lower class limit is the smallest piece of data that could go into each class. The upper class limits are the largest values fitting into each class.

Class interval is the difference between a lower class limit and the next lower class limit. Relative frequency is a propotional measure of the frequency of an occurence. Class mark (class mid-point) is the numerical value that is exactly in the middle of each class. Class boundaries (true class limits) are numbers that do not occur in the sample data but are halfway between the upper limit of one class and the lower limit of the next class.

(there are some exceptions) The two basic guidelines that should be followed in constructing a grouped frequency distribution are: Each class should be of the same width. (there are some exceptions) Classes should be set up so that they do not overlap and so that each piece of data belongs to exactly one class

n=155 (There are 155 observations) 157 148 165 116 119 165 ........140 198 208 161 312 200 n=155 (There are 155 observations) PROCEDURE OF CLASSIFICATION Rank the data. Identify lowest (L) and highest (H) scores and find the range (range=H-L) Select the number of classes and find class width L=116, H=315 Range=312-116=196 #of classes=8 Class Int.=196/8=24,525

CONSTRUCTION OF THE FREQUENCY DISTRIBUTION TABLE Relative Frequency =100*(8/155)=5.2 115.5 315.5

mi fi L A C=25

Measures of central tendency Mean or Median Mode is 203

Measures of position What are the 25th, 75th percentiles and the median?

25-13.5 x-165.5 x=? 33.5-13.5 25=x1 190.5-165.5

25% of observations lie below 179.88. 165.5 13.5 X 25 190.5 33.5 P25=Q1=179.88 25% of observations lie below 179.88.

Standart deviation or Coefficient of variation

Graphic Presentation of Data We will learn how to present single-variable data by using graphical technique. There are several graphic ways to describe data. The method used is determined by the type of data and the idea to be presented.

BAR GRAPH AND PIE GRAPH Bar graph and pie (circle) graph are often used to summarize attribute data. Data are represented by frequency or proportion. In graphical presentation, proportion is more meaningful than frequency. In a bar graph; x axis represents the attribute, while y axis (bar’s height) represents proportion or frequency of each attribute. In a pie graph, each piece represents proportion of attribute.

Example Marital status of woman are given below: Marital status Freq. % Single 65 46.8 Married 32 23.0 Divorced 27 19.4 Widowed 10 7.2 Separate 5 3.6 Total 139 100.0

Bar chart of marital status of woman 50 40 30 Percent 20 10 Single Married Divorced Widowed Separate Marital Status Bar chart of marital status of woman

Pie chart of marital status of woman 3,6% 7,2% 19,4% 23,0% 46,8% Separate Widowed Divorced Married Single Pie chart of marital status of woman

STEM AND LEAF PLOT This plot provides a convinient means of tallying the observations and can be used as a direct display of data or as a preliminary step in constructing a frequency table. The stem is leading digit(s) of the data, while the leaf is the trailing digit(s). For example, the numerical value 458 might be split into stem (45) and leaf (8).

Let’s construct a stem-and-leaf display of following set of 20 test scores: 82 74 88 66 58 74 78 84 96 76 62 68 72 92 86 76 52 76 82 78 At a quick glance we see that there are scores in 50s, 60s, 70s, 80s and 90s. Let’s use the first digit of score as the stem and second digit as the leaf.

We will construct the display in a vertical position We will construct the display in a vertical position. Draw a vertical line and to the left of it locate the stems in order. Next we place each leaf on its stem. This is accomplished by placing the trailing digit on the right side of the vertical line opposite to its corresponding leading digit. 5 6 7 8 9 2 8 2 6 8 2 4 4 6 6 6 8 8 2 2 4 6 8 2 6

All scores with the same tens digit are placed on the same branch All scores with the same tens digit are placed on the same branch. This may not always be desired. Suppose we construct the display; this time instead of grouping ten possible values on each stem, let’s group the values so that only five possible values could fall in each stem. (50-54) 5 (55-59) 5 (60-64) 6 (65-69) 6 (70-74) 7 (75-79) 7 (80-84) 8 (85-89) 8 (90-94) 9 (95-99) 9 2 8 6 8 4 4 2 8 6 6 6 8 2 4 2 8 6 6

HISTOGRAM Histogram is a type of bar graph representing the frequency distribution of quantitative data. A histogram is made up of the following components: A title, which identifies the sample of concern. A vertical scale, which identifies the frequencies (relative frequencies) in the various classes. A horizantal scale, which identifies the variable x (class mid-points or true class limits or lower class limits).

Birthweights of 60 infants are given below:

bwt 4 8 12 Count 1800,5 2226,5 2652,5 3078,5 3504,5 3930,5 4356,5 4756,5

bwt 0% 5% 10% 15% 20% Percent 1800,5 2226,5 2652,5 3078,5 3504,5 3930,5 4356,5 4756,5

20% 15% Percent 10% 5% 0% bwt 12 Count 8 4 bwt 1800,5 2226,5 2652,5 4 8 12 1800,5 2226,5 2652,5 3078,5 3504,5 3930,5 4356,5 4756,5 Percent bwt 0% 5% 10% 15% 20% 1800,5 2226,5 2652,5 3078,5 3504,5 3930,5 4356,5 4756,5 Count

Symmetric Distribution Right-skewed Distribution Left-skewed Distribution

BOX PLOT (BOX AND WHISKER PLOT) The median and first and third quartiles of the distribution are used in constructing box plots. The location of the midpoint or median of the distribution is indicated with a horizontal line in the box. Straight lines or whiskers extend 1.5 times the interquartile range above and below the 75th and 25th percentiles when there are outliers or extreme observations. If they do not exist, lines represent minimum and maximum values. Cases with values between 1.5 and 3 box lengths from the upper or lower edge of the box are called outliers. Cases with values more than 3 box lengths from the upper or lower edge of the box are called extreme points.

6000 5000 4000 3000 2000 1000

Since there are no outliers BWT Since there are no outliers Maximum 4990 75 3677,5 Percentiles 50 (Median) 3110,0 25 2553,5 Minimum 1588 Range 3402

Mode Median Mean Left Skewed Right Skewed Simetric

SCATTER PLOT WITH ONE VARIABLE Scatter plot displays the value of each observation by a small circle, on an invisible line which is parallel to the y-axis displaying original measurement. BWT 6000 5000 4000 3000 2000 1000

LINE GRAPH In line graph, individual data points are connected by a line. Line plots provide a simple way to visually present a sequence of many values.

The distribution of measles cases among seansons in an area are as follows: SEASONS Winter Fall Summer Spring Frequency 120 100 80 60 40 20 Spring 75 Summer 25 Fall 50 Winter 100

ERROR BARS Error bars help you visualize distributions and dispersion by indicating the variability of the measure being displayed. The mean of a scale variable is plotted for a set of categories, and the length of the bar on either side of the mean value indicates standard deviations. Error bars can extend in one direction or both directions from the mean. Error bars are sometimes displayed in the same chart with other chart elements such as bars or lines.

BWT Mean  1 SD BWT 4000 3000 2000 BWT Mean  2 SD BWT 5000 4000 3000 2000 1000