Week 2 LT Assignment DUE: 10/17/2011 MTH233

Slides:



Advertisements
Similar presentations
Histograms Bins are the bars Counts are the heights Relative Frequency Histograms have percents on vertical axis.
Advertisements

Describing Quantitative Variables
Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set.
HOW TALL ARE APRENDE 8 TH GRADERS? Algebra Statistics Project By, My Favorite Math Teacher In my project I took the heights of 25 Aprende 8 th graders.
Review Unit 4A What is the mean of the following numbers? {10, 15, 14, 8, 10, 11, 12, 13, 15, 10} Answer: Mean= = 118,
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Text Exercise 1.22 (a) (b) (c) (d) (This part is asking you whether the data looks reasonably bell-shaped.) The distribution looks somewhat close to bell-shaped.
Programming in R Describing Univariate and Multivariate data.
Jeopardy Q $100 Q $200 Q $300 Q $400 Q $500 Q $100 Q $200 Q $300 Q $400 Q $500 Final Jeopardy.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Warm-Up If the variance of a set of data is 12.4, what is the standard deviation? If the standard deviation of a set of data is 5.7, what is the variance?
Data Distributions.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Review Measures of central tendency
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Descriptive Statistics becoming familiar with the data.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Lecture 3 Describing Data Using Numerical Measures.
14.1 Data Sets: Data Sets: Data set: collection of data values.Data set: collection of data values. Frequency: The number of times a data entry occurs.Frequency:
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
DATA MANAGEMENT MBF3C Lesson #5: Measures of Spread.
MDM4U Chapter 3 Review Normal Distribution Mr. Lieff.
REVIEW OF UNIT 1 1) The table displays the number of videos rented. Number of Videos Rented Number of Families a. How many families.
Measurement Variables Describing Distributions © 2014 Project Lead The Way, Inc. Computer Science and Software Engineering.
1 Chapter 2 Bivariate Data A set of data that contains information on two variables. Multivariate A set of data that contains information on more than.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Is their a correlation between GPA and number of hours worked? By: Excellent Student #1 Excellent Student #2 Excellent Student #3.
LIS 570 Summarising and presenting data - Univariate analysis.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
Module 8 Test Review. Find the following from the set of data: 6, 23, 8, 14, 21, 7, 16, 8  Five Number Summary: Answer: Min 6, Lower Quartile 7.5, Median.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
StatisticsStatistics Unit 5. Example 2 We reviewed the three Measures of Central Tendency: Mean, Median, and Mode. We also looked at one Measure of Dispersion.
STATISTICS Chapter 2 and and 2.2: Review of Basic Statistics Topics covered today:  Mean, Median, Mode  5 number summary and box plot  Interquartile.
Minds on! Two students are being considered for a bursary. Sal’s marks are Val’s marks are Which student would you award the bursary.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
MATH-138 Elementary Statistics
Welcome to Week 04 Tues MAT135 Statistics
Math 21 Midterm Review Part 1: Chapters 1-4.
Engineering Probability and Statistics - SE-205 -Chap 6
Chapter 5 : Describing Distributions Numerically I
Boxplots.
Chapter 3 Describing Data Using Numerical Measures
Chapter 2b.
Percentiles and Box-and- Whisker Plots
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Boxplots.
Warmup Draw a stemplot Describe the distribution (SOCS)
Measuring Variation – The Five-Number Summary
Boxplots.
Data Analysis and Statistical Software I Quarter: Spring 2003
Warmup - Just put on notes page
1.3 Describing Quantitative Data with Numbers
Boxplots.
Describing Distributions
10-5 The normal distribution
Boxplots.
Click the mouse button or press the Space Bar to display the answers.
Histograms and Measures of Center vs. Spread
Boxplots.
Advanced Algebra Unit 1 Vocabulary
Describing Data Coordinate Algebra.
Boxplots.
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Boxplots.
Presentation transcript:

Week 2 LT Assignment DUE: 10/17/2011 MTH233 Team B: Katie Johnson, Laura Laskey, Rosana Cedeno, & Taci Toves

Create a frequency table with these data. (Date sheet) The ages (years) for a random sample of 35 participants in the 2007 Youngstown Peace Race 12 14 15 17 19 20 23 24 26 27 33 37 39 40 41 44 45 47 51 52 56 59 1146 Total age of Participants Median Katie Johnson

Create a frequency table with these data. Bin Frequency 14 2 23 11 32 3 41 7 50 59 5 Frequency Table Age of Participants Number of Participants 6 – 14 2 15 – 23 11 24 – 32 3 33 – 41 7 42 – 50 51- 59 5 MODE is 11 participants in age group 2 ,15 to 23 years olds.  We also have to age groups that have the same number of participants, 7 in age group 4 & 5. Katie Johnson

Construct a histogram of these data, and describe its shape. The histogram suggests there are no presents of outliers. I believe that the shape is bimodal. It has two peaks with age group 2 and age groups 4 & 5. According to our class lecture (MATH 233-Week 2 Lecture 2011) quartiles (Q1 & Q3) can be used to determine the Five Number Summary and Turkey’s Rule uses the Interquartile Range (IQR to determine outliers. Therefore, using Example 5 from our lecture for Q1 & Q3. Q1 = (35)(0.25) Q1 = 8.25 rounding up to next whole number is 9 Q1-9 Q3 = (35)(0.75) Q3 = 26.25 rounding up to 27 Q3 = 27 Using Example 6 from our lecture for finding the Five Number Summary for the ages of the sampled participants are as follows: Min: 14 Q1: 17 Median: 37 (taken from Excel sheet using the median function.) Q3: 44 Max: 59 Then using the Interquartile Range (IQR) and Tukey’s Rule we can determine the outliers. IQR = Q3-Q1 IQR = 44-17 IQR = 27 Therefore Tukey’s Rule is calculated as follows: IQR x 1.5 = T 27 x 1.5 = T 40.5 = T Q1 – T = low outliers 17- 40.5 = low outliers -33.5 = low outliers Q3 + T = high outliers 44 + 40.5 = high outliers 84.5 = high outliers Katie Johnson

Find the mean, median, and standard deviation of the data values. 52 44 20 51 56 14 39 37 17 15 40 23 26 24 45 47 19 12 27 59 41 33 STDEV MEAN MEDIAN 14.42494 32.74286 14.4 32.7 The mean(32.7) and median(37) compared to the mode(11) indicate that the graph should be skewed to the right or Positively Skewed. Taci Toves

Does the empirical rule hold for these data? The Empirical Rule does hold true for the given data. As discussed in Week 2’s Lecture (p. 7), the empirical rule shows that the data follows a normal distribution by relating the mean and standard deviation. See the attached document (Excel spreadsheet- mean and standard deviation) in which I calculated the mean of the data to be 32.74 and the standard deviation 14.42. The Empirical Rule states that 68% of the data will fall within one standard deviation of the mean, 95% of the data will fall within two standard deviations of the mean, and 99.7% of the data will fall within three standard deviations of the mean (Week 2 Lecture, pp. 7. 8). For the given data, one standard deviation from the mean would be 32.74 +/- 14.42. The data needs to fall between 18.32 and 47.16. Two standard deviations from the mean would be 32.74 +/- 28.84 (which is 2 times the standard deviation of 14.42). The data needs to fall between 3.9 and 61.58. Three standard deviations from the mean would be 32.74 +/- 43.26 (which is 3 times the standard deviation of 14.42). The data needs to fall between -10.56 and 76. My calculations show that 21 of the 35 ages fall within one standard deviation from the mean, which calculates to 60%. (68% was predicted by the Empirical Rule) 35 of the 35 ages in the data fall within two standard deviations from the mean, which calculates to 100%. (95% was predicted by the Empirical Rule) 35 of the 35 ages in the data fall within three standard deviations from the mean, which calculates to 100%. (99.7% was predicted by the Empirical Rule) Therefore, the observed data results are consistent with a normal distribution. See the attached table (Next Slide) to compare the observed and predicted percentages for the data.   Laura Laskey

Does the empirical rule hold for these data? (Cont’d-Empirical Table) Interval Percent predicted by empirical rule Percent observed 13.32 to 47.16 68% 60% 3.9 to 61.58 95% 100% -10.56 to 76 99.70% Empirical Rule- Predicted and Observed percentages Laura Laskey

Do the computations for the previous question support your claim about the presence of outliers? Explain. 52 44 20 51 56 14 39 37 17 15 40 23 26 24 45 47 19 12 27 59 41 33 32.74286 14.42494 These computations support the claim that there are no outliers in the data, because 100% of the data falls within two standard deviations of the mean. If outliers were present, some of the data would fall outside of three standard deviations of the mean. That is not the case with the given data, so there are no outliers. Laura Laskey

Write a short summary of your findings about the distribution of ages. Rosana Cedeno