Statistics 270 - Lecture 2. Last class began Chapter 1 (Section 1.1) Introduced main types of data: Quantitative and Qualitative (or Categorical) Discussed.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Descriptive Measures MARE 250 Dr. Jason Turner.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 14 Descriptive Statistics 14.1Graphical Descriptions of Data 14.2Variables.
Statistics 100 Lecture Set 6. Re-cap Last day, looked at a variety of plots For categorical variables, most useful plots were bar charts and pie charts.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
1 Chapter 1: Sampling and Descriptive Statistics.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter Two Treatment of Data.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Homework Questions. Quiz! Shhh…. Once you are finished you can work on the warm- up (grab a handout)!
QBM117 Business Statistics
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 12: Describing Distributions with Numbers We create graphs to give us a picture of the data. We also need numbers to summarize the center and spread.
REPRESENTATION OF DATA.
Chapter 1: Exploring Data AP Stats, Questionnaire “Please take a few minutes to answer the following questions. I am collecting data for my.
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
1 Excursions in Modern Mathematics Sixth Edition Peter Tannenbaum.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Chapter 2 Describing Data.
14.1 Data Sets: Data Sets: Data set: collection of data values.Data set: collection of data values. Frequency: The number of times a data entry occurs.Frequency:
Skewness & Kurtosis: Reference
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Categorical vs. Quantitative…
Bellwork 1. If a distribution is skewed to the right, which of the following is true? a) the mean must be less than the.
To be given to you next time: Short Project, What do students drive? AP Problems.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
1 Chapter 2: Exploring Data with Graphs and Numerical Summaries Section 2.1: What Are the Types of Data?
Statistics Lecture 3. Last class: types of quantitative variable, histograms, measures of center, percentiles and measures of spread…well, we shall.
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Plan for Today: Chapter 11: Displaying Distributions with Graphs Chapter 12: Describing Distributions with Numbers.
MATH 2311 Section 1.5. Graphs and Describing Distributions Lets start with an example: Height measurements for a group of people were taken. The results.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Descriptive Statistics
Lecture #3 Tuesday, August 30, 2016 Textbook: Sections 2.4 through 2.6
Exploratory Data Analysis
Describing Distributions Numerically
Chapter 1: Exploring Data
Laugh, and the world laughs with you. Weep and you weep alone
DAY 3 Sections 1.2 and 1.3.
Topic 5: Exploring Quantitative data
Describing Distributions of Data
Warmup Draw a stemplot Describe the distribution (SOCS)
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

Statistics Lecture 2

Last class began Chapter 1 (Section 1.1) Introduced main types of data: Quantitative and Qualitative (or Categorical) Discussed ways to describe qualitative data with plots: Bar Chart & Pie Chart Which is more flexible - Pie Chart or Bar Chart? Why? Is there a difference in shape of a Bar Chart if show counts or percentages?

Important Stuff Statistics and Actuarial Science Stats Lab (Statistics Workshop) What is Stats Lab for?: One-on-one help is available during its operation hours. Where is it? The Stats Lab is located in K9516 (inside k9510). How does the Stats Lab Work? Statistics Lab Schedule The Statistics Workshop opens for regular use from the second week of classes. The hours will depend on the amount of T.A. time available and will be posted at the end of the first week of classes. The Workshop will be open only when there is a T.A. on duty. Typically, Mon-Fri: 9:30-16:30

Important Stuff Course web page can be found: Download lecture notes day before class

Important Stuff Assignments: 5-7 of them Will be due Friday, before 4:30 in boxes outside lab The boxes are labelled by course and also by the first letter of your last name Note: 1.Late assignments will not be accepted 2.Assignments placed in the wrong box (e.g., stat 201) will not be accepted 3.Class list…watch out

Important Stuff Office hours: M-W 3:00-4:00…or appointment… Mid-Term: Friday, February 17 ….Day before reading break.

Some suggested problems: Chapter 1: 1, 5, 13 or 14, 19, 26, 29, 33 Today: Finish with Chapter 1.2, 1.3 and start 1.4 Will do all of Chapter 1…except stem-leaf & time plots Please read Chapter 1…nice introduction

Plots for Quantitative Variables Can summarize quantitative data using plots Most common plots - histogram and box-plots Will introduce box-plots later Must first distinguish between types of quantitative data

Types of Quantitative Data Discrete Variable: A variable is discrete if the set of possible values is finite or else is countably infinite Continuous Variable: A variable is continuous if its possible values consist of interval(s) on the real line

Example (discrete data) In a study of productivity, a large nuber of authors were classified according to the number of articles they published during a particular period of time.

Example (continuous data) Experiment was conducted to investigate the muzzle velocity of a anti-personnel weapon (King, 1992) Sample of size 16 was taken and the muzzle velocity (MPH) recorded

Histogram – Discrete Data Uses rectangles to show number (or percentage) of values in intervals Y-axis usually displays counts or percentages X-Axis usually shows intervals Rectangles are all the same width

Constructing a Histogram – Discrete Data Determine frequency of each value (e.g., number of articles) Mark the possible values for the distribution on an axis (usually the x-axis) Draw a rectangle over each value with height is the frequency (or relative frequency)

Example (discrete data)

Histogram – continuous data Uses rectangles to show number (or percentage) of values in intervals Y-axis usually displays counts or percentages X-Axis usually shows intervals Rectangles are all the same width

Constructing a Histogram – continuous data Find minimum and maximum values of the data Divide range of data into non-overlapping intervals of equal length Count number of observations in each interval Height of rectangle is number (or percentage) of observations falling in the interval How many categories?

Example Experiment was conducted to investigate the muzzle velocity of a anti-personnel weapon (King, 1992) Sample of size 16 was taken and the muzzle velocity (MPH) recorded

What are the minimum and maximum values? How do we divide up the range of data? What happens if have too many intervals? Too Few intervals? Suppose have intervals from and In which interval is the data point 250 included?

Histogram of Muzzle Velocity

What Does a Histogram Demonstrate?

Numerical Summaries Graphic procedures visually describe data Numerical summaries can quickly capture main features Will consider data coming from large populations

Measures of Center Have sample of size n from some population, An important feature of a sample is its central value. Most common measures of center - Mean & Median

Sample Mean The sample mean is the average of a set of measurements The sample mean:

Sample Median Have a set of n measurements, Median is point in the data that divides the data in half Viewed as the mid-point of the data To compute the median: Sort the data from smallest to largest If n is odd, the median is the ordered value If n is even, the median is the average of the and the ordered value

Muzzle Velocity Example

Sample Mean vs. Sample Median Sometimes sample median is better measure of center Sample median less sensitive to unusually large or small values called For symmetric distributions the relative location of the sample mean and median is For skewed distributions the relative locations are

Other Measures of interest Maximum Minimum

Percentile - The 100p th percentile is point where at least 100p% of data are at or above and 100(1-p)% are at or below.

To compute 100p th percentile of data, Order data from largest to smallest Compute np If np is not an integer, round up to nearest integer, k. The k th ordered value is the desired percentile. If np is an integer, k, the desired percentile is the average of the k th and (k+1) th ordered values.

Important Percentiles First Quartile Second Quartile Third Quartile

Example

Measures of Spread Location of center does not tell whole story Usually report measure of center and also spread Spread may be different for two distributions, even if center is the same

Example: Lifetime of 2 types of car (100 cars of each brand)... which would you buy?

Range=Max-Min Interquartile Range (IQR) = Q 3 -Q 1 IQR typically better than range because IQR can be used to detect outliers: