1. Mean - easy to calculate but is affected by extreme values - to calculate use: Sum of all values Total number of values e.g. Calculate the mean of.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

PROBLEM This is where you decide what you would like more information on. PLAN You need to know what you will measure and how you will do it. What data.
IB Math Studies – Topic 6 Statistics.
Chapter 11 Data and Graphs (part 1). Day….. 1.Misleading GraphsMisleading Graphs 2.Frequency Tables and Tally ChartsFrequency Tables and Tally Charts.
“Teach A Level Maths” Statistics 1
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Descriptive statistics (Part I)
Starter 1.Find the median of Find the median of Calculate the range of Calculate the mode.
Introduction to Data Analysis. Defining Terms  Population- the entire group of people or objects that you want information about  Sample- specific part.
Frequency Distributions and Graphs
Stem and leaf diagrams Sometimes called ‘stem and leaves’ too.
CHAPTER 39 Cumulative Frequency. Cumulative Frequency Tables The cumulative frequency is the running total of the frequency up to the end of each class.
Data Handling Collecting Data Learning Outcomes  Understand terms: sample, population, discrete, continuous and variable  Understand the need for different.
Bell Ringer Get out your notebook and prepare to take notes on Chapter 9 Convert your height to inches and be prepared to write this value on the whiteboard.
Definitions Data: A collection of information in context.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
REPRESENTATION OF DATA.
STATISTICAL GRAPHS.
 FREQUENCY DISTRIBUTION TABLES  FREQUENCY DISTRIBUTION GRAPHS.
10. Presenting and analysing data * Averages for discrete data * Stem and leaf diagrams * Using appropriate averages * The quartiles * Measures of spread.
AS Statistics Data Representation.
INTRODUCTORY STATISTICS Chapter 2 DESCRIPTIVE STATISTICS PowerPoint Image Slideshow.
Data a collection of facts, such as values or measurements.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Statistics Chapter 9. Day 1 Unusual Episode MS133 Final Exam Scores
Objectives Describe the central tendency of a data set.
Chapter 2 Describing Data.
Data a collection of facts, such as values or measurements.
1. Mean - easy to calculate but is affected by extreme values - to calculate use: Sum of all values Total number of values e.g. Calculate the mean of.
Grade 8 Math Project Kate D. & Dannielle C.. Information needed to create the graph: The extremes The median Lower quartile Upper quartile Any outliers.
Graphing Data Box and whiskers plot Bar Graph Double Bar Graph Histograms Line Plots Circle Graphs.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
Demonstrate knowledge of measures and displays used to compare data sets.
Surveillance and Population-based Prevention Department for Prevention of Noncommunicable Diseases Displaying data and interpreting results.
Time Series - A collection of measurements recorded at specific intervals of time. 1. Short term features Noise: Spike/Outlier: Minor variation about.
MATH 2311 Section 1.5. Graphs and Describing Distributions Lets start with an example: Height measurements for a group of people were taken. The results.
Histograms, Box and Whisker Plots, and Dot Plots
Statistics Year 9. Note 1: Statistical Displays.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Statistics Descriptive Statistics. Statistics Introduction Descriptive Statistics Collections, organizations, summary and presentation of data Inferential.
Data organization and Presentation. Data Organization Making it easy for comparison and analysis of data Arranging data in an orderly sequence or into.
Statistics 1: Statistical Measures
Mathematics GCSE Revision Key points to remember
Chapter 2: Methods for Describing Data Sets
Statistics Unit Test Review
4. Interpreting sets of data
Common Core Math I Unit 1: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
7. Displaying and interpreting single data sets
Graphing Data Box and whiskers plot Bar Graph Double Bar Graph
MATH 2311 Section 1.5.
Description of Data (Summary and Variability measures)
DS2 – Displaying and Interpreting Single Data Sets
Topic 5: Exploring Quantitative data
CHAPTER 1 Exploring Data
Unit 3: Statistics Final Exam Review.
Representing Quantitative Data
Unit 2: Statistics Final Exam Review.
Tuesday, February 18th What is the range of the upper 75%?
Representation of Data
10.5 Organizing & Displaying Date
Common Core Math I Unit 2: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
Displaying and Summarizing Quantitative Data
Common Core Math I Unit 1: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
Common Core Math I Unit 1: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
CHAPTER 1 Exploring Data
Constructing and Interpreting Visual Displays of Data
(-4)*(-7)= Agenda Bell Ringer Bell Ringer
Presentation transcript:

1. Mean - easy to calculate but is affected by extreme values - to calculate use: Sum of all values Total number of values e.g. Calculate the mean of 6, 11, 3, 14, Mean = = 42 5 Push equals on calculator BEFORE dividing = 8.4 e.g. Calculate the mean of 6, 11, 3, 14, 8, 100 Mean = = = 23.7 (1 d.p.)

2. Median - middle number when all are PLACED IN ORDER (two ways) - harder to calculate but is not affected by extreme values a) for an odd number of values, median is the middle value e.g. Find the median of 39, 44, 38, 37, 42, 40, 42, 39, 32 32, 37, 38, 39, 39, 40, 42, 42, 44 To find placement of median use: n n = amount of data = 10 = OR Cross of data, one at a time from each end until you reach the middle value. b) for an even number of values, median is average of the two middle values e.g. Find the median of 69, 71, 68, 85, 73, 73, 64, 75 64, 68, 69, 71, 73, 73, 75, 85 n + 1 = = Median = = 144 = OR Median = 39

3. Mode - only useful to find most popular item - is the most common value (can be none, one or more) e.g. Find the mode of 188, 93, 4, 93, 15, 0, Mode = 15 and 93 Range - can show how spread out the data is - is the difference between the largest and smallest values e.g. Find the range of 4, 2, 6, 9, 8 highest valuelowest value Range = 9 – 2 Note: Its a good idea to write in brackets the values that make up the range. (2 – 9)= 7

- Useful when dealing with large amounts of discrete data e.g. Here are the number of fundraising tickets sold by 25 members of a Hockey team. Place data on a frequency table. 3, 5, 0, 1, 0, 2, 5, 2, 4, 0, 1, 2, 3, 5, 7, 2, 3, 3, 1, 4, 3, 3, 2, 0, 1 No. of tickets sold (x) TallyFrequency (f)x.f Total IIII I II III I x 4 = 0 1 x 4 = 4 2 x 5 = 10 3 x 6 = 18 4 x 2 = 8 5 x 3 = x 1 = 7 62 To find the mean, we need the sum of the ticket numbers multiplied by their frequencies, and divide this by the total frequency. Mean = sum of x.f. total frequency = = 2.48 tickets Check total frequency matches question!

No. of tickets sold (x) TallyFrequency (f)x.f Total IIII I II III I x 4 = 0 1 x 4 = 4 2 x 5 = 10 3 x 6 = 18 4 x 2 = 8 5 x 3 = x 1 = 7 62 To find the median, determine its position by using the previous formula. n + 1 = = Now, by adding down the frequency column, locate position of median Therefore: Median = 2 tickets To find the mode, look for the highest frequency Therefore: Mode = 3 tickets

1. Discrete Data – usually found by counting, usually whole numbers e.g.Number of cars passing the school 2. Continuous Data – usually found by measuring e.g.Weights and heights of students 1. Bar Graph – shows discrete data – must have GAPS between bars Number of dinnersFrequency e.g. Beside are the number of times 28 students went out for dinner last month. Place data on a bar graph.

Don’t forget a title Or axis labels Note gaps between bars

2. Dot Plots – are like a bar graph – each dot represents one item e.g. Plot these 15 golf scores on a dot plot 70, 72, 68, 74, 74, 78, 77, 70, 72, 72, 76, 72, 76, 75, 78 Range plot between lowest and highest values

3. Pictograms – uses symbols to represent fixed numbers – key shows the value of the symbol e.g. Using an appropriate symbol, draw a pictogram displaying the number of hours per week spent completing homework for the following subjects. Hours of Study in a Week Science English Maths KEY 1 hour

4. Pie Graphs – show comparisons – slices are called sectors – uses percentages and angles (protractor and compass) e.g.Students of a class arrived to school in the following manner. Show on a Pie Graph Walked = 6 Cycled = 5 Car = 4 Bus = 9 To calculate angle of sectors use: Amount of sector x 360 Total Data Walked = 6 x 360 = °135°60°90° Student Mode of Transport Cycled WalkedCar Bus Note: Instead of labels, a key could also be used.

5. Strip Graph – shows the proportion of each part to the whole – should have a scale – linked to pie graphs e.g.Using Pie Graph example, Strip Graph drawn could use a scale of1 cm = 2 students – are measures of spread which with the median splits the data into quarters – method used is similar as to when finding median When the data is in order: – the lower quartile (LQ) has – the upper quartile (UQ) has 25% or ¼ 75% or ¾ of the data below it. – the Interquartile Range (IQR) =UQ – LQ

e.g. Find the LQ, UQ and the interquartile range of the following data 6, 6, 6, 7, 8, 9, 10, 10, 11, 14, 16, 16, 17, 19, 20, 20, 24, 24, 25, 29 Note: always find the median first = 21 = = 11 = ORLQ =8 + 9 = 17 = UQ = = 40 = OR e.g. Find the LQ, UQ and the interquartile range of the following data 5, 6, 8, 10, 11, 11, 12, 15, 18, 22, 23, 28, 30 Remember, always find the median first = 14 = or cross off data As the median is an actual piece of data, it is ignored when finding the LQ and UQ = 7 = LQ = = 18 = UQ = = 45 = IQR = UQ - LQIQR = 20 – 8.5 = 11.5 IQR =22.5 – 9 = 13.5

– records and organises data – most significant figures form the stem and the final digits the leaves – can be in back to back form in order to compare two sets of data e.g. Place the following heights (in m) onto a back to back stem and leaf plot BOYS = 1. 59, 1.69, 1.47, 1.43, 1.82, 1.70, 1.73, 1.35, 1.76, 1.68, 1.62, 1.84, 1.45, 1.50, 1.54, 1.73, 1.84, 1.71, 1.66 GIRLS = 1. 44, 1.46, 1.63, 1.29, 1.48, 1.57, 1.51, 1.42, 1.34, 1.45, 1.57, 1.59, 1.42 Unordered Graph of Heights Ordered Graph of Heights Boys Girls Boys Girls Look at the highest and lowest data values to decide the range of the stem Place the final digits of the data on the graph on the correct side,7,9,3,2,0,3 5,6,8,2 4 5,0 4,3 1 6,4 4,6, 3 9 8, 7,1, 2, 4 5, 7,9 2 4, 4, 2 6, 3, 3, 1, 0 9, 8, 6, 2 9, 4, 0 7, 5, , 7, 7, 9 2, 2, 4, 5, 6, 8 4 9

Graph of Heights Boys Girls , 4, 2 6, 3, 3, 1, 0 9, 8, 6, 2 9, 4, 0 7, 5, , 7, 7, 9 2, 2, 4, 5, 6, e.g. From the ordered plot state the minimum, maximum, LQ, median, UQ, IQR and range statistics for each side BOYSGIRLS Minimum: Maximum: LQ: Median: UQ: IQR: Range: For each statistic, make sure to write down the whole number, not just the ‘leaf’! 1.29 m 1.63 m 1.63 – 1.29 = 0.34 m Median = = 7 2 When finding median, LQ and UQ, make sure you count/cross in the right direction! LQ/UQ = = m 1.46 m 1.57 m 1.57 – 1.42 = 0.15 m Remember: If you find it hard to calculate stats off graph, write out data in a line first! 1.35 m 1.84 m 1.50 m 1.68 m 1.73 m 1.73 – 1.50 = 0.23 m 1.84 – 1.35 = 0.49 m

– shows the minimum, maximum, LQ, median and UQ – ideal for comparing two sets of data e.g. Using the height data from the Stem and Leaf diagrams, draw two box and whisker plots (Boys and Girls) Note: Use the minimum and maximum values to determine length of scale Males Females Question: What is the comparison between the boy and girl heights? ANSWER? MinimumLQMedianUQMaximum EVIDENCE? Box and Whisker Plot of Boys and Girls Heights

– used when dealing with a large amount of continuous data and groups are needed e.g. Listed below are the heights (in cm) of 25 students. Represent the data on a frequency table 167, 173, 171, 149, 162, 174, 185, 165, 160, 170, 173, 161, 158, 172, 168, 168, 178, 170, 180, 166, 183, 150, 164, 161, 164 IntervalTallyFreq. (f)Midpoint (x)x.f 140 – – – – – 189 TOTAL To calculate the mean a midpoint is needed and the formula used is: e.g. Calculate the mean from the above data and state the modal interval I II IIII IIII I IIII III III ( ) / x Mean = = cmModal Interval =160 – 169 cm Note: Make sure you have enough groups but don’t make them too small!

– display grouped data – frequency is along vertical axis, group intervals are along horizontal axis – there are NO gaps between bars e.g. Graph the grouped frequency table data about heights onto a histogram Note: The groups from the table form the intervals along the horizontal axis and the highest frequency determines the height of the vertical axis.

– Side by side histograms can also be used to compare data Question: What is the comparison between the female and male heights? ANSWER? EVIDENCE?

– looks for a relationship between two measured variables – points are plotted like co-ordinates e.g. Below are the heights and weights of Year 7 boys. Place on a scatter plot. Height (cm) Weight (kg) Use the data to determine scale to use on both axes If points form a line (or close to) we can say there is a relationship between the two variables. Line of best fit Outliers can generally be ignored What is the relationship between the boys height and weight? ANSWER? EVIDENCE?

– a collection of measurements recorded at specific intervals where the quantity changes with time. Features of Time Series a) Order is important with all measurements retained to examine trends b) Long term trends where measurements definitely tend to increase or decrease c) Seasonal trends resulting in up and down patterns e.g. Draw a time series graph for the following data: SeasonQuarterly sales Sept.9040 Dec.8650 Mar June9250 Sept.9033 Dec.8578 Mar June9407 Sept.9209 Dec.8740 Mar June9504 Sept.9246 Dec.8929 Mar Join up each of the points What are the short and long term trends? ANSWER?EVIDENCE?

Good graphs should have: - an accurate heading (watch emotive headings) - scales in even steps - scales from zero unless a break is shown - values easy to read - bar graphs have the same width bars and similar shading Sample:When part of the group is surveyed Census:Whole population is surveyed Population:The entire group of members under consideration Survey:Collection of information from some or all members of a population Sampling Frame:A list covering the target population A Good Sampling Frame:- should have each unit listed only once - has each unit distinguishable from others - is up to date

When planning an investigation: - think carefully about what you are trying to find (question) - what data is needed - how will you obtain the data - is the method practical and convenient - how will you record the information - how will you present the data

A sample should: 1) Be large enough to be representative of whole population 2) Have people/items in it that are representative of the population It is best to choose samples that are large and random but size may be affected by time, money, personnel, equipment etc. Simple random sampling: 1- obtain a population list 2- number each member 3- use random table or random number on calculator Systematic sampling: 1- obtain a population list 2- randomly select a starting point on the list 3- select every nth member until desired sample size is reached Note: every nth member is found by:Population/group size Size of sample needed

- Biased sample - Wrong measurements - Poorly worded, misleading questions - Mistakes in calculations and/or display For when comparing two sets of data. 1. If the two sets of data are NOT related (have no affect on each other) Use the wordsCOMPAREORCOMPARISON e.g. What is theCOMPARISONbetween… How does …COMPAREto … THEN: (also if justifying statements) - Get as many statistics as possible (averages, quartiles, max and min, range etc) - Draw aSTEM and LEAF GRAPHand a BOX and WHISKER PLOT (maybeSIDE BY SIDE HISTOGRAMS) - Answer your question in one sentence - Back up your answer with at least 2-3 statements using the data from your statistics/graphs (at least one each on average and spread) Remember to use “generally/on average”

2. If the two sets of data ARE related (do have an affect on each other) Use the wordsRELATEORRELATIONSHIP e.g. What is theRELATIONSHIPbetween… How does …RELATEto … THEN: - Get as many statistics as possible - Draw aSCATTERPLOT - Answer your question in one sentence - Back up your answer with at least 2-3 statements 3. If it is a single set of data taken over time we look for short and long term trends. - Write your question in the following manner: What are theSHORTandLONG TERMtrends in …. THEN: - Get as many statistics as possible - Draw aTIME SERIES GRAPH - Answer your question and back it up with justifications

1. In terms of Data Collection Typical Limitations Improvements - Sample too small - Not random or representative - Outliers distort data - Get a representative sample - Taken over too short a time period - Take data over a longer time period - Obtain a bigger sample - Ignore extreme outliers 2. In terms of Your Process Typical Limitations Improvements - Not enough statistics calculated - Calculate more statistics - Not enough graphs used, data could be compared better - Use other graphs (i.e. comparative histograms) - Scales on graphs too large- Change scales on graph (smaller) - Way graphs are drawn- Alter the way the graphs may be drawn