Kinds of data 10 red 15 blue 5 green 160cm 172cm 181cm 4 bedroomed 3 bedroomed 2 bedroomed size 12 size 14 size 16 size 18 fred lissy max jack callum zoe.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

S1: Chapter 4 Representation of Data
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Histograms, Frequency Polygons and Ogives
Cumulative Frequency and Box Plots. Learning Objectives  To be able to draw a cumulative frequency curve and use it to estimate the median and interquartile.
Spread of Data Whilst we have been looking at averages: mean, medians and modes these tell us little without knowing how the data spreads out around these.
Very Basic Statistics.
Starter 1.Find the median of Find the median of Calculate the range of Calculate the mode.
Starter List 5 types of continuous data List 3 types of discrete data Find the median of the following numbers: 2, 3, 6, 5, 7, 7, 3, 1, 2, 5, 4 Why is.
1 CUMULATIVE FREQUENCY AND OGIVES. 2 AS (a) Collect, organise and interpret univariate numerical data in order to determine measures of dispersion,
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
2.1: Frequency Distributions and Their Graphs. Is a table that shows classes or intervals of data entries with a count of the number of entries in each.
CHAPTER 39 Cumulative Frequency. Cumulative Frequency Tables The cumulative frequency is the running total of the frequency up to the end of each class.
Chapter 13 – Univariate data
Data Handling Collecting Data Learning Outcomes  Understand terms: sample, population, discrete, continuous and variable  Understand the need for different.
REPRESENTATION OF DATA.
Dr. Serhat Eren DESCRIPTIVE STATISTICS FOR GROUPED DATA If there were 30 observations of weekly sales then you had all 30 numbers available to you.
 FREQUENCY DISTRIBUTION TABLES  FREQUENCY DISTRIBUTION GRAPHS.
Chapter 1: Exploring Data AP Stats, Questionnaire “Please take a few minutes to answer the following questions. I am collecting data for my.
Graphical Summary of Data Distribution Statistical View Point Histograms Skewness Kurtosis Other Descriptive Summary Measures Source:
1 Excursions in Modern Mathematics Sixth Edition Peter Tannenbaum.
STEM AND LEAF DIAGRAMS Don’t forget to order Include a key.
Statistical Inference for Managers Lecture-2 By Imran Khan.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
14.1 Data Sets: Data Sets: Data set: collection of data values.Data set: collection of data values. Frequency: The number of times a data entry occurs.Frequency:
Warm Up Find the mean, median, mode, range, and outliers of the following data. 11, 7, 2, 7, 6, 12, 9, 10, 8, 6, 4, 8, 8, 7, 4, 7, 8, 8, 6, 5, 9 How does.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
General Maths Unit One Univariate Data. Chapter One – Univariate Data Assigned textbook questions to be up to date by 2 nd Feb Exercise 1A Exercise 1B.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
Introduction to statistics I Sophia King Rm. P24 HWB
Chapter 2 Descriptive Statistics 2.1 Frequency Distributions and Their Graphs.
Stat 101Dr SaMeH1 Statistics (Stat 101) Associate Professor of Environmental Eng. Civil Engineering Department Engineering College Almajma’ah University.
4.2 Displays of Quantitative Data. Stem and Leaf Plot A stem-and-leaf plot shows data arranged by place value. You can use a stem-and-leaf plot when you.
Statistics and Probability IGCSE Cambridge. Learning Outcomes Construct and read bar charts, histograms, scatter diagrams and cumulative frequency diagrams,
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
STATISTICS Chapter 2 and and 2.2: Review of Basic Statistics Topics covered today:  Mean, Median, Mode  5 number summary and box plot  Interquartile.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
Unit 3 Guided Notes. Box and Whiskers 5 Number Summary Provides a numerical Summary of a set of data The first quartile (Q 1 ) is the median of the data.
MM2D1: Using sample data, students will make informal inferences about population means and standard deviations b. Understand and calculate the means and.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Statistics Descriptive Statistics. Statistics Introduction Descriptive Statistics Collections, organizations, summary and presentation of data Inferential.
S1: Chapter 4 Representation of Data Dr J Frost Last modified: 20 th September 2015.
Descriptive Statistics
Exploratory Data Analysis
Figure 2-7 (p. 47) A bar graph showing the distribution of personality types in a sample of college students. Because personality type is a discrete variable.
Measures of Position – Quartiles and Percentiles
Mathematics GCSE Revision Key points to remember
Cumulative Frequency Diagrams
You will need your calculator today (and every day from now on)
7. Displaying and interpreting single data sets
Descriptive Statistics SL
Frequency Distributions and Their Graphs
Histograms, Frequency Polygons and Ogives
Representation and Summary of Data - Location.
DS2 – Displaying and Interpreting Single Data Sets
Topic 5: Exploring Quantitative data
S1: Chapter 4 Representation of Data
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
Representation of Data
You will need your calculator today (and every day from now on)
Displaying and Summarizing Quantitative Data
Honors Statistics Chapter 4 Part 3
Summary (Week 1) Categorical vs. Quantitative Variables
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
Review of 6th grade material to help with new Statistics unit
Presentation transcript:

Kinds of data 10 red 15 blue 5 green 160cm 172cm 181cm 4 bedroomed 3 bedroomed 2 bedroomed size 12 size 14 size 16 size 18 fred lissy max jack callum zoe luke stephen baby 5lb3oz 6lb10oz 7lb12oz 11lb1oz Qualitative Continuous Quantitative Discrete

Page 10 Exercise 2A Q1 - 5 and 7

Averages There are three types of average and they all begin with M.....most popular value or class.....middle value if all values are placed in order.....the sum of all the values shared by how many values there are

Mean How would you say the mean average differed from the median average? In which circumstances may you use the mean rather than the median and vice versa? Rather than describing how to find the mean in words we need to learn some notation. The sum of all the x values should be written as: The sum of the values in the fx column should be written as:

Page 15 Exercise 2B Q1, 2, 4, 6 and 7

In what kind of questions will we need to add the values in the fx column rather than just add together all the x values? Try finding the mean of the year 7 girl heights and compare it to the year 7 boy heights. Would you use the mean or the median to summarise this data and why? If you used the raw data for the above calculation why? If you used the grouped data from the frequency polygon or histogram work why am I going to tell you you've made a mistake?

We use the "x Bar" notation to represent a sample mean. If I was using all the data possible it would be called a population mean and we use the "mu" notation Mean Formulae for a list of data for a frequency table of data for grouped data it is necessary to find the midpoint of each class first and use this as a value for x and then use the same equation above.

Page 18 Exercise 2C Q1, 4 and 5

Stem and Leaf Put the data below into a suitable stem and leaf diagram. 127, 135, 147, 147, 149, 139, 145, 155, 149, 155, 151, 159, 139, 141, 155, 160, 138, 144, 155, , 143, 147, 157, 152, 150, 161, 133, 146, 155 The data represents heights of a first year class in a boys school. How else can you summarise or represent this data?

Below is a list of the heights of 30 year 7 girls. Add these to the other side of your stem and leaf diagram and make some comparitive statements based on suitable summary data you find. 127, 145, 147, 147, 149, 149, 145, 165, 139, 157, 152, 169, 129, 121, 158, 160, 148, 141, 155, , 143, 157, 156, 152, 150, 161, 133, 146, 155

Page 55 Exercise 4A Q2 and 5

If you have grouped your data with equal class widths, you have little to worry about. However if the class widths are uneven you will need to plot them against frequency density rather than just frequency. Frequency Density = Class Frequency Class Width Sometimes Relative Frequency Density is plotted on the y axis. This can be calculated as: Rel Freq Dens = Class Frequency Total Frequency

It is best that Histograms are plotted against frequency density or relative frequency density. They should also only be drawn with continuous data. Discrete or qualitative data can be plotted in Bar Charts but their bars should not really touch as they aren't connected Histograms Histograms are similar to bar charts apart from the consideration of areas. In a bar chart, all of the bars are the same width and the only thing that matters is the height of the bar. In a histogram, the area is the important thing.

Page 64 Exercise 4E Q1, 4 and 5 We may also need to find the average from these grouped continuous data sets Page 22 Exercise 2D Q1 - 5

Summarising Data What types of data summary have you come up with so far? And how do they differ? Give examples of when one type would be better than another.

Quartiles You will recall finding the median from a list of data involves adding one to the number of values before halving it to find out which value (placed in order) you should use. For example in a list of 7 numbers the median value is the (7 + 1) / 2 th value the 4th value 3, 5, 5, 6, 8, 10, 10 If you consider the quartiles you can see that it is the 2nd and 6th values. These can be found by dividing the number of values by 4 and as long as this gives a whole number find the average of it's value and the value above it. If this yields a decimal value rather than a whole number then always round UP to the value above it. In a list of 14 numbers the lower quartile would be taken as the 14 / 4 th value or 3.5th value so you'd take the 4th value 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14

Page 34 Exercise 3A Q1 and 2

Interquartile Ranges and Boxplots We use the Quartiles and minimum and maximum values to draw boxplots. These are great for comparing the spread of data between two or more data sets Q1Q1 Q3Q3 Q2Q2 Q0Q0 Q4Q4 Where Q 0 = min value Q 1 = lower quartile Q 2 = median Q 3 = upper quartile Q 4 = max value Draw box plots to compare the year 7 height data you put into stem and leaf diagrams earlier

Read page 57 Page 58 Exercise 4B Q1 - 2 Page 59 Exercise 4C Q1 - 2 Page 61 Exercise 4D Q1 - 2

Cumulative Frequency We have seen how to find the quartiles from a list of data or stem and leaf diagrams. We have also seen that data is often stored in frequency distributions. If these are grouped it becomes difficult to find these quartiles. Why? We used to overcome this by drawing cumulative frequency curves. 60 students got below 50 marks 30 students got below 40 marks by joining up these two points we pretend the 30 students with between 40 to 50 marks are spread evenly throughout the band.

Interpolation We could actually find this value much quicker by using some simple mathematics known as interpolating. Think how you could find the mark of the 40th student from this year 10 class using just the data rather than reading from the graph. Now try estimating the mark of the 55th student. Remember estimating doesn't mean guessing; it involves exact calculations but it is unlikely to be the true mark as the students are unlikely to be spaced evenly througout the class. we cannot know the exact mark without the raw data - we are not given this with grouped data - hence we estimate

Quantile = b + (Qn - f) x w f m Where f m is the frequency of the class the quantile falls in f is the cumulative frequency up to the class the quantile falls in w is the class width of the class the quantile falls in Q is the quantile you are finding expressed as a fraction n is the number of data b is the LOWER bound of the class the quantile fits in We can divide the data into as many equal parts as we like. Quartiles divide in four Deciles in ten and Percentiles into 100 The formula below is known as interpolating and estimates a quantile by assuming the data collected in each class is spread evenly. It should be very similar to the formula yo created earlier to find quartiles without drawing a cumulative frequency curve. Quantiles

Page 34 Exercise 3A Q3 and 5 Page 37 Exercise 3B Q1, 3 and 5