Discovering Mathematics Week 5 BOOK A - Unit 4: Statistical Summaries 1
Unit 4: Statistical Summaries This unit includes the following sections: Questions Dealing with data Summarising data: Location Summarising data: Spread Measuring with accuracy and precision Statistical Pictures (Unit 11) Discussing TMA 2
1. Questions This unit deals with statistical thinking. It starts with questions that can be addressed by statistical methods. Statistical information is all around you and arises as an attempt to answer questions of various kinds. (e.g. Should people stop smoking?) Statistical thinking is presented as a helpful way of seeing the world quantitatively. For examples: table of numbers, graphs & charts. 3
Economics – Forecasting – Demographics Sports – Individual & Team Performance Engineering – Construction – Materials Business – Consumer Preferences – Financial Trends Section 0: What is Statistics? Application Areas 4
Types of statistical questions Most of the questions of investigations can be categorized into one of the following types : -Summarising: how can the information be reduced. -Comparing: is there a difference? -Seeking a relationship: what sort of relations is there. 5
Types of statistical questions(II) Summarizing questions: Summarizing information is by reducing the many figures to just one representative number or using charts or plots. Examples of investigations: -How many people die from road accidents each day. -How old are the students studying MU123 Comparing questions: Comparing investigation is to decide whether or not there is a difference between two things or comparing between two averages. Examples are: -Do more people, on average, die from road accidents on weekdays or at weekends? -Are students studying MU123 older or younger than students on LB160? 6
Types of statistical questions(III) Relationship Questions: Investigations could be a relationship between quite separate things. What the relationship is. For example: -As the numbers of road deaths in different countries linked to their respective speed limits? -What is the connection between the numbers of hours that students work on a level 3 course in mathematics and their final grade. Discuss in the class Activities I and 2 (page 179) and distinguish between the three types of statistical investigations. 7
The statistical investigation cycle There are four stages in most statistical investigations: P: Posing a question. This is the start of the investigations. C: Collecting data, choosing samples or design a survey. A: Analyze the data, calculating averages and plotting graphs I: Interpreting the results, takes the action back and see whether the data analysis help to answer the posed question. (check Activities 3 and 4 for complete example) 8
Activity 9
Dealing with Data Data that you collect yourself are called primary data. Data that already exist and can be used or adapted are called Secondary data. It is extremely important when presenting any dataset, whether primary or secondary, is to provide an accurate reference to the data source. Take a quick look at the dataset at page 184. Explain the rows and columns. Solve Activity 6. Discrete data are data that can take one of a particular set of values. E.g: Number of days in a week you study, or number of websites you visit per day. Binary data: are the data coded by two numerical values. 1,0 Solve activity 9 10
Summarising data: location Statisticians call the location of a dataset to represent a single number that represents an ‘average’, typical or ‘central’ value. It is always good idea to scan the data, looking closely at the numbers to see if there is a missing data, inspect the data. Measuring Location: The most important summary of a dataset is a measure of its location. The three most common measures of location are: Mean, the mode and the median. Let us look at the following examples: 11
Summarising data: location (II) 12
Summarising data: location (II) 13
Summarising data: Spread A second basic property of data is how widely spread the values are. Range: TMA marks for six students are: 42, 58, 60, 68, 78, in MIN and 92 is MAX, this show the data is wide spread. Another TMA marks are: 60, 65, 72, 74, 75, 80 this ranges from 60 to 80, this is much narrower. Range is the difference between the two values Maximum and Minimum after arranging the dataset in increasing order. Range = Max – Min = = 20 14
Summarising data: Spread Quartiles: Lower quartile (Q1) and upper quartile (Q3) The Median Q2 is (72+74)/2 = 73 Lower quartile = 65 Upper quartile = 75 Interquartile range = 75 – 65 = 10 15
Standard Deviation Standard deviation (SD) is the best known measure of spread. To find the SD, follow the following steps: Assume that we have the following data: 1,2,4,6,7 Solve Activity 26 16
Summary Location and spread are the two key forms of data summary, that two measures of location are the mean and median, and that three measures of spread are the range, the interquartile range and the standard deviation. 17
Exercise: consider the dataset below, which consists of the weights in kilograms of 32 women at the start of their pregnancies A class is one of the categories The class frequency is the number of observations in a particular class. Unit 11. Statistical pictures Section 2: Histograms, bar charts, pie charts 2.1 Histogram (Continuous data) 18
Exercise: The bar chart below shows the frequencies with which the six scores 1, 2, 3, 4, 5, 6 cropped up as a result of 30 rolls of a die. Section 2: Histograms, bar charts, pie charts 2.2 Bar charts (Discrete data) 19
Exercise: MajorCount Accounting130 Economics20 Management50 Total200 Section 2: Histograms, bar charts, pie charts 2.2 Bar charts (Categorical data) 20
Econ. 10% Mgmt. 25% Acct. 65% 36° Section 2: Histograms, bar charts, pie charts 2.4 Pie Charts (Categorical data) 1.Shows breakdown of total quantity into categories 2.Useful for showing relative differences 3.Angle size =(360°)(percent)
Revision : The four stages of a statistical investigation cycle PCAI Stage 1 Pose a question Stage 2 Collect relevant data Stage 3 Analyse the data Stage 4 Interpret the results 22
Revision: Recall Activity: Here are nine of the common types of task that tend to arise in the C, A and I stages of a statistical investigation. Try to match each task to one of these three stages 1.Calculate an average 2.Calculate a percentage 3.Choose a set of values, or sample 4.Make a decision based on an observed, numerical difference 5.Design a questionnaire 6.Draw a conclusion 7.Draw a helpful graph 8.Key the data into a spreadsheet 9.Make a prediction about the real world 23
Discussing the TMA 24
Thank you