Prof. Eric A. Suess Chapter 3

Slides:



Advertisements
Similar presentations
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Advertisements

Descriptive statistics (Part I)
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
AP STATISTICS Section 1.1: Displaying Distributions.
Programming in R Describing Univariate and Multivariate data.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Statistics 3502/6304 Prof. Eric A. Suess Chapter 3.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Methods for Describing Sets of Data
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
Chapter 1 The Role of Statistics. Three Reasons to Study Statistics 1.Being an informed “Information Consumer” Extract information from charts and graphs.
Chapter 2: Methods for Describing Sets of Data
Univariate Data Chapters 1-6. UNIVARIATE DATA Categorical Data Percentages Frequency Distribution, Contingency Table, Relative Frequency Bar Charts (Always.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Chapter 2 Describing Data.
Lecture 3 Describing Data Using Numerical Measures.
Unit 4 Statistical Analysis Data Representations.
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
Math 145 September 11, Recap  Individuals – are the objects described by a set of data. Individuals may be people, but they may also be animals.
The field of statistics deals with the collection,
Descriptive Statistics – Graphic Guidelines
What is Statistics?. Statistics 4 Working with data 4 Collecting, analyzing, drawing conclusions.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Unit 1 - Graphs and Distributions. Statistics 4 the science of collecting, analyzing, and drawing conclusions from data.
Descriptive Statistics – Graphic Guidelines Pie charts – qualitative variables, nominal data, eg. ‘religion’ Bar charts – qualitative or quantitative variables,
AP Statistics. Chapter 1 Think – Where are you going, and why? Show – Calculate and display. Tell – What have you learned? Without this step, you’re never.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Descriptive Statistics
COMPLETE BUSINESS STATISTICS
UNIT ONE REVIEW Exploring Data.
Types of variables Discrete VS Continuous Discrete Continuous
Exploratory Data Analysis
Methods for Describing Sets of Data
Chapter 1.1 Displaying Distributions with graphs.
Math 201: Chapter 2 Sections 3,4,5,6,7,9.
Chapter 3 Describing Data Using Numerical Measures
MAT 135 Introductory Statistics and Data Analysis Adjunct Instructor
Chapter 2: Methods for Describing Data Sets
Chapter 4 Review December 19, 2011.
Looking at data Visualization tools.
Unit 4 Statistical Analysis Data Representations
Chapter 1 & 3.
Descriptive Statistics
CHAPTER 5 Basic Statistics
Statistical Reasoning
Laugh, and the world laughs with you. Weep and you weep alone
Distributions and Graphical Representations
Unit 1 - Graphs and Distributions
Basics of Statistics.
Chapter 3 Describing Data Using Numerical Measures
Displaying Distributions with Graphs
Descriptive Statistics
An Introduction to Statistics
Topic 5: Exploring Quantitative data
Frequency Distributions
Displaying Quantitative Data
Drill {A, B, B, C, C, E, C, C, C, B, A, A, E, E, D, D, A, B, B, C}
Means & Medians.
Basic Practice of Statistics - 3rd Edition
Welcome!.
Basic Practice of Statistics - 3rd Edition
Honors Statistics Review Chapters 4 - 5
CHAPTER 1 Exploring Data
Probability and Statistics
Section 1.1: Displaying Distributions
Advanced Algebra Unit 1 Vocabulary
Math 341 January 24, 2007.
Presentation transcript:

Prof. Eric A. Suess Chapter 3 Statistics 3502/6304 Prof. Eric A. Suess Chapter 3

Data Description – One Variable We see descriptions of data all the time. Look at your phone. Go to the doctor. Drive your car. The wireless signal one your phone describes how much of a connection you have to a cell tower. Do you test positive for a disease? How fast am I going? Am I going too fast? Too slow?

Data Description – One Variable Today we will discuss the description of data collected on one variable. We will discuss graphical and numerical methods, such as, pie chart and bar graphs and time plots, and, such as, means, medians, modes and standard deviation. We will discuss the use of Excel and Minitab to make graphs and to computer descriptive Statistics.

Descriptive Statistics and Inferential Statistics The field of Statistics is broken into two main areas. One if Descriptive Statistics and the other is Inferential Statistics. In Descriptive Statistics we work to describe the data and to communicate the big picture and patters in the data. Inferential Statistics uses probability to model the data and to help reach conclusion about the presence of underlying patterns. We start with Descriptive Statistics, sometimes called Exploratory Data Analysis.

Graphical Methods Categories Data is often simplified into ordered or unordered groups or categories. Examples: Gender (Female, Male) Income (Low, Medium, High) Industry (Agriculture, Construction, etc.) see Table 3.4 page 63

Graphical Methods Pie Charts Exercise 3.1 Use MS Excel

Graphical Methods Pie Charts are used with data that is summarized into categories. Each slice of the pie represents the portion or percentage of the pie from each category. Relative Frequency or percentages are usually used.

Graphical Methods Bar Graphs Exercise 3.1 MS Excel

Data Data: Data are values recorded for variables from individuals. There are different types of data. The two main types of data are: Qualitative – which means categorical Quantitative – which means numerical Examples: Hair Color, Height Different graphs are used for different types of data.

Types of Graphs Pie Charts and Bar Graphs are used for Qualitative Data. MS Excel is used to produce these graphs. Histograms are used for Quantitative Data. Minitab is used to produce these graphs. Stem-and-Leaf plots are used for Quantitative Data. By hand or Minitab is used to produce these graphs.

Describing the shape of Histograms Histograms are used to display the distribution of the values of a quantitative variable. The language: Unimodal Bimodal Uniform Symmetric Skewed to the right/+ Skewed to the left/- Page 71

Making a Stem-and-Leaf plot Take the values and split them into stems on the left and leaves on the right. List the stems in order, not skipping and numbers in the list, from smallest to largest. List the leaves in order to the right.

Example 10, 22, 31, 45, 47, 49, 50, 37, 70 Minitab Express Minitab Stem-and-leaf of values N = 9 Leaf Unit = 1.0 1 1 0 2 2 2 4 3 17 (3) 4 579 2 5 0 1 6 1 7 0

Time Series Time Series plots are made for quantitative data recorded in time. Plots of stock market data is a good example. See yahoo finance.

General Guidelines for Successful Graphics See page 82 for the authors guidelines. The main guideline that is important to consider is the first one. What message are you trying to send to the viewer?

Numerical Methods – Center and Spread Measures of central tendency measure the center of the data. Measures of spread or variation measure the variability of the data. What is a parameter? A population measure. What is a statistics? A sample measure. When we compute these measure we are computing statistics. These days often referred to as Analytics.

Numerical Methods – Center Mean, Median, Mode Mode – most common value Median – 50th percentile, middle Mean – average

Numerical Methods – Center To find the median, order the data, find the middle value, if an even number of values, average the two middle values. To calculate the mean, add all the values together and divide by the number of values. The sample size is 𝑛 𝑦 = 𝑦 1 + 𝑦 2 +…+ 𝑦 𝑛 𝑛 = 𝑦 𝑛

Numerical Methods – Outliers What is an outlier? Values that are a long way away from the rest. Sometime they result from errors in the recording of the data. Other times they are part of the data. Example: Income

Numerical Methods – Spread What is less variable? What is more variable? Figure 3.16 on page 91

Numerical Methods – Spread Range = Maximum value – Minimum value The p-th percentile, value with p% of the values below.

Numerical Methods – Spread Inner Quartile Range = 75th percentile – 25 percentile Deviation – how far a value is from the mean 𝑦 𝑖 − 𝑦 Variance – sample variance 𝑠 2 = 𝑦 𝑖 − 𝑦 2 𝑛−1 Standard Deviation – sample standard deviation 𝑠= 𝑠 2 Use MS Excel or Minitab to compute these values for a data set.

Example Figure 3.21 page 96 68, 63, 67, 61, 66

Numerical Methods – Spread Empirical rule – 68-95-99 rule, page 93 Given a set of 𝑛 values possessing a mound-shaped histogram, then 𝑦 ±𝑠 contains approximately 68% of the observations 𝑦 ±2𝑠 contains approximately 95% of the observations 𝑦 ±3𝑠 contains approximately 99.7% of the observations See Figure 3.22 page 100

Numerical Methods – Spread Box Plots plot the 5 number summary Minimum, 25th percentile, Median, 75th percentile, Maximum Use Minitab to produce Box Plots.

Next Time Next Time we will discuss how to describe data for two or more variables. Contingency Tables Stacked Bar Graphs Cluster Bar Graphs Scatterplots, the Scatterplot matrix