Chapter – Statistics Organizing Data

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Statistical Issues in Research Planning and Evaluation
QUANTITATIVE DATA ANALYSIS
Slides by JOHN LOUCKS St. Edward’s University.
CHAPTER 6 Statistical Analysis of Experimental Data
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 3 Describing Data Using Numerical Measures.
Statistics for CS 312. Descriptive vs. inferential statistics Descriptive – used to describe an existing population Inferential – used to draw conclusions.
Discrete and Continuous Random Variables Continuous random variable: A variable whose values are not restricted – The Normal Distribution Discrete.
12.3 – Measures of Dispersion
Chapter 11: Random Sampling and Sampling Distributions
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
CONFIDENTIAL 1 Grade 8 Algebra1 Data Distributions.
Describing distributions with numbers
A Look at Means, Variances, Standard Deviations, and z-Scores
Descriptive Statistics
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Chapter 10 Introduction to Statistics Frequency Distributions; Measures Of Central Tendency Population, variable, sample Example 1 (p. 572 – 573)
1.3 Psychology Statistics AP Psychology Mr. Loomis.
A Closer Look at the NEW High School Statistics Standards Focus on A.9 and AII.11 K-12 Mathematics Institutes Fall 2010 A Closer Look at the NEW High School.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 PROBABILITIES FOR CONTINUOUS RANDOM VARIABLES THE NORMAL DISTRIBUTION CHAPTER 8_B.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
© The McGraw-Hill Companies, Inc., Chapter 3 Data Description.
Stat 1510: Statistical Thinking and Concepts 1 Density Curves and Normal Distribution.
7 Chapter 10 Tables Graphs Probability….. In this Chapter you will… (Goals)
Statistics Chapter 9. Day 1 Unusual Episode MS133 Final Exam Scores
Correct Answer: A AP1.14. Boxplots of two data sets are shown.
Interpreting Performance Data
Describing distributions with numbers
Review Ways to “see” data –Simple frequency distribution –Group frequency distribution –Histogram –Stem-and-Leaf Display –Describing distributions –Box-Plot.
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Discrete Probability Distributions Define the terms probability distribution and random variable. 2. Distinguish between discrete and continuous.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter The Normal Probability Distribution 7.
 The mean is typically what is meant by the word “average.” The mean is perhaps the most common measure of central tendency.  The sample mean is written.
© 2010 Pearson Education, Inc. All rights reserved Data Analysis/Statistics: An Introduction Chapter 10.
Chapter Eight: Using Statistics to Answer Questions.
What are the effects of outliers on statistical data?
What’s with all those numbers?.  What are Statistics?
Copyright © 2005 Pearson Education, Inc. Slide 6-1.
Warm Up Simplify each expression
PCB 3043L - General Ecology Data Analysis.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
LIS 570 Summarising and presenting data - Univariate analysis.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
Copyright © 2015, 2011, 2008 Pearson Education, Inc. Chapter 6, Unit C, Slide 1 Putting Statistics to Work 6.
Statistics and Data Analysis
Holt McDougal Algebra Measures of Central Tendency and Variation Work through the notes Then complete the class work next to this document on the.
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
CHAPTER 3 – Numerical Techniques for Describing Data 3.1 Measures of Central Tendency 3.2 Measures of Variability.
MR. MARK ANTHONY GARCIA, M.S. MATHEMATICS DEPARTMENT DE LA SALLE UNIVERSITY.
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Normal Probability Distributions 1 Larson/Farber 4th ed.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
13-5 The Normal Distribution
AP Biology Statistics From BSCS: Interaction of experiments and ideas, 2nd Edition. Prentice Hall, 1970 and Statistics for the Utterly Confused by Lloyd.
Statistics Unit Test Review
The Normal Probability Distribution
Section Ii: statistics
Math Review #3 Jeopardy Random Samples and Populations
Chapter Nine: Using Statistics to Answer Questions
Presentation transcript:

Chapter 16 16.1 – Statistics Organizing Data 16.2 – Measures of Central Tendency 16.3 – Measures of Variation

Stem and Leaf Stem-and-Leaf Plots 54 77 ASTRONAUTS Display the data shown in a stem-and-leaf plot. Step 1 Find the least and the greatest number. 54 77 Stem and Leaf

Stem-and-Leaf Plots ASTRONAUTS Display the data shown in a stem-and-leaf plot. 54 77 Step 2 Draw a vertical line and write the stems from 5 to 7 to the left of the line.

Stem-and-Leaf Plots ASTRONAUTS Display the data shown in a stem-and-leaf plot. Step 3 Write the leaves to the right of the line, with the corresponding stem. Stem and Leaf Diagram

Ranked Stem and Leaf Plot Stem-and-Leaf Plots ASTRONAUTS Display the data shown in a stem-and-leaf plot. Step 4 Rearrange the leaves so they are ordered from least to greatest. Ranked Stem and Leaf Plot

Frequency Distribution Slides Relative Frequency 0.67 0.23 0.06 0 0.03 Relative Frequency 0.14 0.33 0.25 0.17 0.11 Frequency Distribution Slides

Relative Frequency Measures of Variation The ratio of the absolute frequency to the total number of data points in a frequency distribution Also known as Experimental Probability. As the number of data points in any experiment increase the Experimental Probability approaches the theoretical probability. Measures of Variation Used to describe the distribution of the data: Range: Difference between the high and the low data points Variance: Standard Deviation:

Relative Frequency Measures of Variation The ratio of the absolute frequency to the total number of data points in a frequency distribution Also known as Experimental Probability. As the number of data points in any experiment increase the Experimental Probability approaches the theoretical probability. Measures of Variation Used to measure how spread out the data is. Range: Difference between the high and the low data points Mean, Median, Mode: Measures of central tendency Variance and Standard Deviation: Measures how much the data values differ from the mean. Mean Deviation

Mean Sample Space Complement The average of all data points. To find the mean of a set of data, add all the data and divide by the number of data points. Sample Space The sum of all possible outcomes for the event The sum of all probabilities assigned to outcomes in a sample space must be 1. Complement For an event A the event Not A is the complement of A

Mean Median Complement The average of all data points. To find the mean of a set of data, add all the data and divide by the number of data points. Median The value of the data point that is exactly in the middle of the data. To find the median, put the data in order from least to greatest and find the term exactly in the middle. If there is no one term exactly in the middle, average the two that are in the middle. Complement For an event A the event Not A is the complement of A

Mean The average of all data points. To find the mean of a set of data, add all the data and divide by the number of data points. Median The value of the data point that is exactly in the middle of the data. To find the median, put the data in order from least to greatest and find the term exactly in the middle. If there is no one term exactly in the middle, average the two that are in the middle. Mode The value of the data point that occurs most often. Possible to have more than one mode

Box-and-Whisker Plots

Step 1 Find the least and greatest number Step 1 Find the least and greatest number. Then draw a number line that covers the range of the data.

Lower Quartile : Lower hinge Step 2 Find the median, the extremes, and the upper and lower quartiles. Mark these points above the number line. Lower Extreme Lower Quartile : Lower hinge Median Upper Quartile : Upper hinge Upper Extreme

Step 3 Draw a box and the whiskers.

Computing Measures of Variation Range: Difference between the high and the low data points Mean Deviation Variance: Standard Deviation:

Find the mean, median, mode, variance, mean deviation, and standard deviation of the following data

Find the mean, median, mode, variance, mean deviation, and standard deviation of the following data

HW #16.1-3 Pg 692-693 3, 9, 10-13 Pg 698-699 3, 6, 9-16 Pg 702-703 5, 7-10

16.4 – The Normal Distribution Chapter 16 16.4 – The Normal Distribution

The heights of 16-year-old girls are not distributed uniformly, or evenly. Many more girls are average height than are very short or very tall. The values are distributed so that they are frequent near the mean, and become more rare and infrequent the farther they are from the mean. The most common distribution with this characteristic is a

The spread of each curve is defined by its standard deviation. Normal curves are symmetric with respect to the vertical line at the mean. The spread of each curve is defined by its standard deviation. Areas under this curve represent probabilities from normal distributions.

When data are distributed in a bell-shaped or normal curve about 68% of the data lie within one standard deviation on either side of the mean, and about 95% lie within two standard deviations of the mean.

Consider the bell-shaped distribution of IQ scores for students in a school. The mean is 100 and the standard deviation is 15. What percent of students in the school would we expect to have IQs between 85 and 115? What percent of the students can we expect to have IQs between 70 and 115?

What percent of the students in the school can we expect to have IQs above 115? What percent of the students in the school can we expect to have IQs in the range from 85 to 145?

The given times of 33 minutes and 57 minutes represent one standard deviation on either side of the mean, So, 68% of the shoppers will spend between 33 an 57 minutes in the supermarket.

According to a survey by the National Center for Health Statistics, the heights of adult men in the United States are normally distributed with a mean of 69 inches and a standard deviation of 2.75 inches. If you randomly choose 1 adult man, what is the probability that all he is 71.75 inches tall or taller?

Z-Score A z-score for a value is the number of standard deviations the value is from the mean. The sign of the z-score is its direction from the mean. For example, if a value has a z-score of -2, it is two standard deviations below the mean.

What is the z-score for 46 in a normal distribution whose mean is 44 and whose standard deviation is 2? For this distribution, mean = 44 and Standard Deviation = 2. Thus, 46 is one standard deviation above the mean.

A value is selected randomly from a normal distribution A value is selected randomly from a normal distribution. What is the probability that its z-score is less than -1.46? The probability that a value has a z-score less than -1.46 is equal to the area of the shaded region under the curve. This value is given in Table 7. Table 7 on page 850 gives the probability that a value in the distribution has a z-score that is less than a given value

A student received a score of 56 on a normally distributed standardized test. The test had a mean of 50 and a standard deviation of 5.What is the probability that a randomly selected student achieved a higher score? We need to find the probability that a value greater than 56 is selected from a normal distribution with a mean of 50 and a standard deviation of 5. Look up 1.2 in Table 7. P(Score < 56) = 0.8849 P(Score > 56) = 1-0.8849 = 0.1151

How often would we expect to find an 1Q greater than 142 in a sample of students whose mean 1Q was 110 where the standard deviation was 16?

A company manufactures cover plates for boxes with lengths of 4 inches A company manufactures cover plates for boxes with lengths of 4 inches. Due to variation in the process, the lengths of the plates are normally distributed about a mean of 4 inches with a standard deviation of 0.01 inch. A plate is considered a "reject" if its length is less than 3.98 inches or greater than 4.02 inches.What percent of the production are considered "rejects"?

HW #16.4 Pg 708 1-31 Odd, 33-43

16.5 – Collecting Data Randomness and Bias Chapter 16 16.5 – Collecting Data Randomness and Bias Objective: Evaluate and select sampling methods. Objective: Describe how to take a stratified random sample.

Objective: Evaluate and select sampling methods. A scientist is studying the weight gain or loss of mice that are given a certain treatment. When choosing mice for the experiment, the scientist reaches into a cage with 30 mice and selects the 5 largest mice in the cage. Is the sample random?

Objective: Evaluate and select sampling methods. Describe how a random sample of 10 individuals might be chosen from a high school graduating class of 202 to receive a gift certificate.

Objective: Evaluate and select sampling methods. Although the processes involved in the development of a random sample guarantee that requirements of equal probability and independence are satisfied, they do not guarantee that the sample drawn will be REPRESENTATIVE

Objective: Evaluate and select sampling methods. A town newsletter is doing an article on high school students. A questionnaire is sent to a random sample of school-aged students. Are the data representative? No. A sample of students from kindergarten through grade 12 would not give results representative of high school students. Even when the sample is restricted to high school students, the data may not be representative. Data might have been collected largely from members of the high school chorus. In this case, the data would most likely be biased. That is, it is likely that the data would be overly influenced by factors that are related to musical interests.

Objective: Evaluate and select sampling methods.

Objective: Describe how to take a stratified random sample. To draw a representative sample, we may need to divide the population into distinct subgroups, called strata. Then we can use stratified random sampling to assure that the sample has the same characteristics as the population. Each member of the population must be placed in one and only one stratum. A random sample is drawn so that the sample has the same distribution among the strata as the population.

Objective: Describe how to take a stratified random sample. A college has 1260 freshmen, 1176 sophomores, 840 juniors, and 924 seniors. Describe how to take a stratified random sample of 200 students. We would randomly sample 60 freshmen, 56 sophomores, 40 juniors, and 44 seniors.

HW #16.5 Pg 713-714 1-19

Chapter 16 16.6 – Testing Hypothesis

Statisticians are often asked to test whether a given set of observational data represents what one would expect to observe by chance or whether it differs greatly from what one might expect. To determine what constitutes a significant difference, statisticians establish a level of error they are willing to tolerate.

If we throw a coin 30 times and the coin shows tails 23 times, we may decide either that The results occurred by chance. The coin is fair and randomly showed 23 tails, although the likelihood that a coin lands tails 23 times is remote. The results did not occur by chance. The coin is unevenly weighted or was tossed so that tails had a higher probability of being thrown. Before stating that the results were biased or influenced by other circumstances, we want to be sure there is a 5% or less probability that the results occurred by chance. Then we can state that the coin is fair at the 5% level of significance.

Find the probability of throwing 23 tails in 30 tosses Since the probability is less than 5%, we may state that at the 5% level of significance, the results did not occur by chance.

One of the most common ways to test whether a given set of data differs from what one would expect by chance is to use the chi-square (2) test. This test is used to compare observed data with expected data. If there is a large difference between observed and expected data, we get a large value for  2. If there is no difference,  2 = O.

If we toss a coin 30 times and get 23 heads calculate 2 :

What does that tell us about the whether or not the event occurred by chance or not? The chi-square test is typically used to accept or reject a hypothesis about a set of data and to generalize about a population. null hypothesis: There is no statistical difference between the expected and the observed data. Thus, the observed results occurred by chance. The larger the value of  2,the higher the probability the null hypothesis is false.

How large must X2 be for us to reject the null hypothesis? For a specific level of significance, we reject the null hypothesis if the calculated value of chi-square exceeds the table value for the number of possible outcomes.

Suppose we rolled a number cube 72 times and found that we had 13 ones, 18 threes, and 12 sixes. Determine whether these results occurred by chance. Use a 5% level of significance. The null hypothesis is that the results occurred by chance.

There are 4 possible outcomes There are 4 possible outcomes. The chi-square value for a 5% significance level for 4 possible outcomes is 7.81. Since 4.44 does not exceed this value, we can state that our results occurred by chance. Thus, we can accept the null hypothesis.

HW #16.6 Pg 718-720 1-13

Test Review

Stem and Leaf Plots Frequency plots Box and Whisker plots Relative Frequency Box and Whisker plots Normal Distributions Mean, Median, Mode, Standard Deviation, Mean Deviation, Variance, Z-score Hypothesis testing Chi-Square, Significance Level, Null hypothesis Random Sample Representative, Biases, Stratified random sample Challenge Problems

Find the variance and standard deviation of each set of data. {5, 8, 2, 9, 4} {16, 22, 18, 31, 25, 22}

The useful life of a radial tire is normally distributed with a mean of 30,000 miles and a standard deviation of 5000 miles. The company makes 10,000 tires a month. About how many tires will last between 25,000 and 35,000 miles? About how many tires will last more than 40,000 miles? About how many tires will last less than 25,000 miles? What is the probability that if you buy a radial tire at random, it will last between 20,000 and 35,000 miles?

The vending machine in the school cafeteria usually dispenses about 6 ounces of soft drink. Lately, it is not working properly, and the variability of how much of the soft drink it dispenses has been getting greater. The amounts are normally distributed with a standard deviation of 0.2 ounce. What percent of the time will you get more than 6 ounces of soft drink? What percent of the time will you get less than 6 ounces of soft drink? What percent of the time will you get between 5.6 and 6.4 ounces of soft drink? If you purchased a soda, what is the probability that it dispenses less than 6.5 oz of soda

If the mean GPA at Troy is 3. 4 with a standard deviation of 0 If the mean GPA at Troy is 3.4 with a standard deviation of 0.38, what is the GPA for a student in the top 5%?

Mr. Burnum gave an exam to his 30 Algebra 2 students at the end of the first semester. The scores were normally distributed with a mean score of 78 and a standard deviation of 6. What percent of the students would you expect to receive a grade of at least 70%.

Cucumbers grown on a certain farm have weights with a standard deviation of 2 ounces. What is the mean weight if 85% of the cucumbers weigh less than 16 ounces?

A coke machine is set up so that it will dispense soda into a can A coke machine is set up so that it will dispense soda into a can. If the actual amount of soda dispensed is normally distributed such that only 10% of all cans have less than 11.75 ounces and only 25% of all cans have more than 12.75 ounces, what are the mean and standard deviation of the soda dispensing machine?

HW #R-16 Pg 722-724 1-17