Experimental Statistics I.  We use data to answer research questions  What evidence does data provide?  How do I make sense of these numbers without.

Slides:



Advertisements
Similar presentations
Analyzing Data (C2-5 BVD) C2-4: Categorical and Quantitative Data.
Advertisements

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
1.1 Displaying and Describing Categorical & Quantitative Data.
Descriptive statistics (Part I)
MA 331 Intermediate Statistics Fall 2006 Webpage: aching/ /index331.html.
CHAPTER 1: Picturing Distributions with Graphs
AP Statistics Day One Syllabus AP Content Outline Estimating Populations and Subpopulations.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Objectives (BPS chapter 1)
AP STATISTICS Section 1.1: Displaying Distributions.
Chapter 1 Descriptive Analysis. Statistics – Making sense out of data. Gives verifiable evidence to support the answer to a question. 4 Major Parts 1.Collecting.
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
Chapter 1 – Exploring Data YMS Displaying Distributions with Graphs xii-7.
Introduction to the Practice of Statistics Fifth Edition Chapter 1: Looking at Data—Distributions Copyright © 2005 by W. H. Freeman and Company Modifications.
MA 222 Statistics Lectures Lecture 1. Data, Data, Data, all around us ! We use data to answer research questions We use data to answer research questions.
AP Statistics Introduction & Chapter 1.1 Variables, Distributions & Graphs Goals: What will we know and be able to do as a result of today’s Lesson?
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
StatisticsStatistics Graphic distributions. What is Statistics? Statistics is a collection of methods for planning experiments, obtaining data, and then.
1 Chapter 3 Looking at Data: Distributions Introduction 3.1 Displaying Distributions with Graphs Chapter Three Looking At Data: Distributions.
EXPLORING DATA LESSON 1 – 1 Day 2 Displaying Distributions with Graphs Displaying quantitative variables.
Ch. 1 Looking at Data – Distributions Displaying Distributions with Graphs Section 1.1 IPS © 2006 W.H. Freeman and Company.
Lesson 1 – 1a from Displaying Distribution with Graphs.
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
BPS - 5th Ed. Chapter 11 Picturing Distributions with Graphs.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Essential Statistics Chapter 11 Picturing Distributions with Graphs.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
Math 145 September 11, Recap  Individuals – are the objects described by a set of data. Individuals may be people, but they may also be animals.
CHAPTER 1 Picturing Distributions with Graphs BPS - 5TH ED. CHAPTER 1 1.
Statistics is... a collection of techniques for planning experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting,
Bell Ringer You will need a new bell ringer sheet – write your answers in the Monday box. 3. Airport administrators take a sample of airline baggage and.
Chapter 0: Why Study Statistics? Chapter 1: An Introduction to Statistics and Statistical Inference 1
1 Take a challenge with time; never let time idles away aimlessly.
Statistics - is the science of collecting, organizing, and interpreting numerical facts we call data. Individuals – objects described by a set of data.
Picturing Distributions with Graphs BPS - 5th Ed. 1 Chapter 1.
Class Two Before Class Two Chapter 8: 34, 36, 38, 44, 46 Chapter 9: 28, 48 Chapter 10: 32, 36 Read Chapters 1 & 2 For Class Three: Chapter 1: 24, 30, 32,
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
UNIT ONE REVIEW Exploring Data.
The rise of statistics Statistics is the science of collecting, organizing and interpreting data. The goal of statistics is to gain understanding from.
Chapter 1.1 Displaying Distributions with graphs.
Chapter 1: Exploring Data
Looking at data Visualization tools.
Statistical Reasoning
Laugh, and the world laughs with you. Weep and you weep alone
CHAPTER 1: Picturing Distributions with Graphs
Displaying Distributions with Graphs
CHAPTER 1: Picturing Distributions with Graphs
Frequency Distributions
Descriptive Statistics
Descriptive Statistics
Daniela Stan, PhD School of CTI, DePaul University
Good Morning AP Stat! Day #2
Advanced Placement Statistics Ch 1.2: Describing Distributions
Descriptive Statistics
Descriptive Statistics
Basic Practice of Statistics - 3rd Edition
Welcome!.
Basic Practice of Statistics - 3rd Edition
Methods of Acquiring Information
CHAPTER 1: Picturing Distributions with Graphs
Welcome to AP Statistics
Descriptive Statistics
Chapter 1: Exploring Data
Descriptive Statistics
Math 145 January 24, 2007.
Math 341 January 24, 2007.
Statistics is... a collection of techniques for planning experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting,
Presentation transcript:

Experimental Statistics I

 We use data to answer research questions  What evidence does data provide?  How do I make sense of these numbers without some meaningful summary? SubjectSBPHRBGAgeWeightTreatment

 Study to assess the effect of exercise on cholesterol levels. One group exercises and other does not. Is cholesterol reduced in exercise group? ◦ people have naturally different levels ◦ respond differently to same amount of exercise (e.g. genetics) ◦ may vary in adherence to exercise regimen ◦ diet may have an effect ◦ exercise may affect other factors (e.g. appetite, energy, schedule)

 Recognize the randomness: the variability in data.  …“ the science of understanding data and making decisions in face of variability ” Three steps to the process of statistics:  Design the study  Analyze the collected Data  Discover what data is telling you …

Displaying Distributions with Graphs

 Individuals – objects described by a set of data ◦ people, animals, things ◦ also called Cases ◦ called Subjects if they are human  Variable – characteristic of an individual, takes different values for different subjects.  The three questions to ask : ◦ Why: Purpose of study? ◦ Who: Members of the sample, how many? ◦ What: What did we measure (the variables) and in what units?

7 Key Characteristics of a Data Set Every data set is accompanied by important background information. In a statistical study, always ask the following questions:  Who? What cases do the data describe? How many cases does a data set have?  What? How many variables does the data set have? How are these variables defined? What are the units of measurement for each variable?  Why? What purpose do the data have? Do the data contain the information needed to answer the questions of interest?

8 Categorical and Quantitative Variables  A categorical variable places each case into one of several groups, or categories.  A quantitative variable takes numerical values for which arithmetic operations such as adding and averaging make sense.  The distribution of a variable tells us the values that a variable takes and how often it takes each value.

Distribution of a Variable 10 To examine a single variable, we graphically display its distribution.  The distribution of a variable tells us what values it takes and how often it takes these values.  Distributions can be displayed using a variety of graphical tools. The proper choice of graph depends on the nature of the variable.  The distribution of a variable tells us what values it takes and how often it takes these values.  Distributions can be displayed using a variety of graphical tools. The proper choice of graph depends on the nature of the variable. Categorical variable Pie chart Bar graph Categorical variable Pie chart Bar graph Quantitative variable Histogram Stemplot Quantitative variable Histogram Stemplot

Categorical Variables 11 The distribution of a categorical variable lists the categories and gives the count or percent of individuals who fall into each category.  Pie charts show the distribution of a categorical variable as a “pie” whose slices are sized by the counts or percents for the categories  Have to know the whole pie  Bar graphs represent categories as bars whose heights show the category counts or percents  more flexible

Bar Graph

Pie Chart

Quantitative Variables 14 The distribution of a quantitative variable tells us what values the variable takes on and how often it takes those values.  Histograms show the distribution of a quantitative variable by using bars. The height of a bar represents the number of individuals whose values fall within the corresponding class.  Stemplots separate each observation into a stem and a leaf that are then plotted to display the distribution while maintaining the original values of the variable.  Time plots plot each observation against the time at which it was measured.

15 To construct a stemplot:  Separate each observation into a stem (first part of the number) and a leaf (the remaining part of the number).  Write the stems in a vertical column; draw a vertical line to the right of the stems.  Write each leaf in the row to the right of its stem; order leaves if desired. Stemplots

16 Stemplots

17 If there are very few stems (when the data cover only a very small range of values), then we may want to create more stems by splitting the original stems. Example: If all of the data values are between 150 and 179, then we may choose to use the following stems: Leaves 0–4 would go on each upper stem (first “15”), and leaves 5–9 would go on each lower stem (second “15”). Stemplots

Numbers of home runs that Hank Aaron hit in each of his 23 years in the Major Leagues:

 Step 1: Identify all the stems ◦  Step 2: Write the stems in increasing order (usually from top to bottom)

 Step 3: Draw a line next to the stem and write the leaves against the stem

 Step 4: Rewrite the stemplot rearranging the leaves in ascending order (this can be done simultaneously with step 3):

 Compare the numbers of Hank Aaron to Barry Bonds:

 Describe the pattern ◦ Shape  How many modes (peaks)?  Symmetric or skewed in one direction? ◦ Center – midpoints?  Mean/average; median ◦ Spread  range between the smallest and the largest values, standard deviation, 5-number summary, quartiles  Look for outliers – individual values that do not match the overall pattern.

Histograms

ClassCountPercent 75 – < – < – < – < – < – < – < – <1551

 Shape: Somewhat symmetric, unimodal  Center: about 110 or 115  Spread : values between 80 and 150  Remember! ◦ Histograms only meaningful for quantitative data

Quantitative Example  Breaking strength of connections for electronic components:  Need to discuss variation ◦ How to group these items with so many different values?

Dealing with outliers

 Check for recording errors  Violation of experimental conditions  Discard it only if there is a valid practical or statistical reason, not blindly!

Time plots

 We care about two important parts ◦ Trend – persistent, long-term rise or fall ◦ Seasonal variation – a pattern that repeats itself at known regular intervals of time.  Mississippi data: ◦ Increasing trend ◦ Large seasonal variations –there is usually a large spike every few years

 Categorical and Quantitative variables  Graphical tools for categorical variables ◦ Bar Chart ◦ Pie Chart  Graphical tools for quantitative variables ◦ Stem and leaf plot ◦ Histogram ◦ Maybe timeplot if appropriate  Distributions ◦ Describe: Shape, center, spread ◦ Watch for patterns and/or deviations from patterns.