Data Handling Collecting Data Learning Outcomes  Understand terms: sample, population, discrete, continuous and variable  Understand the need for different.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
IB Math Studies – Topic 6 Statistics.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
1 Economics 240A Power One. 2 Outline w Course Organization w Course Overview w Resources for Studying.
ISE 261 PROBABILISTIC SYSTEMS. Chapter One Descriptive Statistics.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter Two Treatment of Data.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter Two Descriptive Statistics McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Descriptive statistics (Part I)
Chapter 2 Frequency Distributions and Graphs 1 © McGraw-Hill, Bluman, 5 th ed, Chapter 2.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
GCSE Data Handling Coursework 1 Examining the Data examine carefully the data you are given it’s important to get a feel for the raw data before you use.
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
REPRESENTATION OF DATA.
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
May 06th, Chapter - 7 INFORMATION PRESENTATION 7.1 Statistical analysis 7.2 Presentation of data 7.3 Averages 7.4 Index numbers 7.5 Dispersion from.
Descriptive Statistics
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
CHAPTER 1 Basic Statistics Statistics in Engineering
STAT 211 – 019 Dan Piett West Virginia University Lecture 1.
Percentiles and Box – and – Whisker Plots Measures of central tendency show us the spread of data. Mean and standard deviation are useful with every day.
Chapter 2 Describing Data.
Data Analysis Qualitative Data Data that when collected is descriptive in nature: Eye colour, Hair colour Quantitative Data Data that when collected is.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Describing Data Using Numerical Measures. Topics.
Basic Statistics  Statistics in Engineering  Collecting Engineering Data  Data Summary and Presentation  Probability Distributions - Discrete Probability.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
BUSINESS STATISTICS I Descriptive Statistics & Data Collection.
 The mean is typically what is meant by the word “average.” The mean is perhaps the most common measure of central tendency.  The sample mean is written.
Exam Review Day 6 Chapters 2 and 3 Statistics of One Variable and Statistics of Two Variable.
Numerical Measures. Measures of Central Tendency (Location) Measures of Non Central Location Measure of Variability (Dispersion, Spread) Measures of Shape.
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Chapter 11 Review Important Terms, Symbols, Concepts Sect Graphing Data Bar graphs, broken-line graphs,
CHAPTER 1 Basic Statistics Statistics in Engineering
Sampling ‘Scientific sampling’ is random sampling Simple random samples Systematic random samples Stratified random samples Random cluster samples What?
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Introduction to statistics I Sophia King Rm. P24 HWB
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
MATH 2311 Section 1.5. Graphs and Describing Distributions Lets start with an example: Height measurements for a group of people were taken. The results.
Statistics and Data Analysis
CHAPTER 1 Basic Statistics Statistics in Engineering
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
StatisticsStatistics Did you hear about the statistician who put her head in the oven and her feet in the refrigerator? She said, "On average, I feel just.
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Ms. Drake 7th grade Math Measures of Central Tendency Lesson 2 Mean, Median, Mode and Range.
Descriptive Statistics – Graphic Guidelines Pie charts – qualitative variables, nominal data, eg. ‘religion’ Bar charts – qualitative or quantitative variables,
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
Graphs. Types of Graphs  Bar Graphs  Pie Charts  Dotplots  Stem and Leaf Plots  Histograms  Box Plots  Scatter Plots  Normal Curves.
Descriptive Statistics
Exploratory Data Analysis
Figure 2-7 (p. 47) A bar graph showing the distribution of personality types in a sample of college students. Because personality type is a discrete variable.
Statistics 1: Statistical Measures
ISE 261 PROBABILISTIC SYSTEMS
BUSINESS MATHEMATICS & STATISTICS.
MATH 2311 Section 1.5.
Descriptive Statistics
DS1 – Statistics and Society, Data Collection and Sampling
Descriptive Statistics
An Introduction to Statistics
Basic Statistical Terms
10.5 Organizing & Displaying Date
Statistics: The Interpretation of Data
Presentation transcript:

Data Handling Collecting Data Learning Outcomes  Understand terms: sample, population, discrete, continuous and variable  Understand the need for different sampling techniques including random and stratified sampling and be able to generate random numbers with a calculator or computer to obtain a sample  Be able to design a questionnaire (taking bias into account)  Understand the need for grouping data and the importance of class limits and class boundaries when doing so

DH - Collecting Data Data Handling Sample: A sample is a subset of the population. 11A would be a subset of the following populations → year 11, senior pupils, pupils of St Mary’s Population: The total number of individuals or objects being analyzed; this quantity is user defined. E.g. pupils in a school, people in a town, people in a postal code. Discrete: A discrete variable is often associated with a count, they can only take certain values – usually whole numbers. E.g. number of children in a family, number of cars in a street, number of people in a class.

DH - Collecting Data Data Handling Continuous: A continuous variable is often associated with a measurement, they can take any value in given range. E.g. height, weight, time. Variable: See discrete & continuous above.

DH - Collecting Data Data Handling Random Sampling: In simple random sampling every member of the population is a given number. If the population has 100 member, they will each be given a number between 000 and 999 (inclusive) then 3 digit random numbers are used to select the sample (ignore repeats) Stratified Sample: Often data is collected in sections (strata). Eg. Number of pupils in a school. In selecting such a sample data is taken as a proportion of the total population. Here we should sample twice as many people in year 10 than in year 8. YearNo. of Pupils Total700

DH - Collecting Data Data Handling Stratified Sample: To obtain as sample of 70 pupils out of the 700, we construct the following table Year No. of Pupils Proportion of totalNo. of pupils to be sampled / 700 = 1 / / 700 = 1 / 7 × 70 = / 700 = 1 / / 700 = 1 / 14 × 70 = / 700 = 2 / / 700 = 2 / 7 × 70 = / 700 = 2 / / 700 = 2 / 7 × 70 = / 700 = 3 / / 700 = 3 / 14 × 70 =

DH - Collecting Data Questionnaires 1. Sample should represent population 2. Sample must be of a reasonable size to represent population (at least 30) sample mean = population mean 3. Questions should: i) be as short as possible ii) use tick boxes iii) avoid bias iv) avoid leading questions

Additional Notes

Data Handling Collecting Data Understand terms: sample, population, discrete, continuous and variable Understand the need for different sampling techniques including random and stratified sampling and be able to generate random numbers with a calculator or computer to obtain a sample Be able to design a questionnaire (taking bias into account) Understand the need for grouping data and the importance of class limits and class boundaries Learning Outcomes: At the end of the topic I will be able to Can Revise Do Further        

Data Handling Analysing Data Learning Outcomes  Understand that in order to gain a mental picture of a collection of data it is necessary to obtain a measure of average and range  Be able to determine the mean, median and mode for a set of raw scores and an ungrouped frequency table  Be able to obtain the median and interquartile range for grouped data from a cumulative frequency graph  Understand the advantages and disadvantages of each average and measure of spread

DH - Analysing Data Measures of Central Tendency Mean Sum of all measures divided by total number of measures. Mode Most popular / most frequent occurrence.  everyone included × affected by extremes × not everyone included  not affected by extremes Median Arrange data in ascending order; the median is the middle measure. Position = ½ (n + 1) × not everyone included  not affected by extremes

DH - Analysing Data Measures of Central Tendency Examples Calculate the Mean, Median and Mode for: a)3, 4, 5, 6, 6, b) 2.4, 2.4, 2.5, 2.6 * Normal distribution is where the mean, median and mode are close eg example b)

DH - Analysing Data Frequency Distribution The number of children in 30 families surveyed are surveyed. The results are given below. Calculate a)The mean number of children per family b)The median (No. of children) x (No of families) f

DH - Analysing Data Grouped Frequency Distribution Often data is grouped so that patterns and the shape of the distribution can be seen. Group sizes can be the same, although there are no applicable rules. Find the mean of: MarkFrequency ( f )Midpoint ( x ) fx 30 – – – – 699 ∑f = 51

DH - Analysing Data Cumulative Frequency Curves Find the median of the following grouped frequency distribution. LengthFrequency Cumulative Frequency Upper Limit 21 – – – – – 404

DH - Analysing Data Cumulative Frequency Curves Cumulative frequency Upper Limit Q3 Q2 Q1 Median = Measure of central location Interquartile range = Measure of spreadQ 1 = 25th percentile = Q 3 – Q 1 Q 3 = 75th percentile Q 1 = ¼ (n + 1) Q 2 = ½ (n +1) Q 3 = ¾ (n +1) = 8.25 th → 26 = 16.5 th → 30 = th → 33 Interquartile Range = Q 3 – Q 1 = 33 – 26 = 7

DH - Analysing Data Additional Notes

Data Handling Analysing Data Learning Outcomes: At the end of the topic I will be able to Can Revise Do Further          Understand that in order to gain a mental picture of a collection of data it is necessary to obtain a measure of average and range  Be able to determine the mean, median and mode for a set of raw scores and an ungrouped frequency table  Be able to obtain the median and interquartile range for grouped data from a cumulative frequency graph  Understand the advantages and disadvantages of each average and measure of spread

Data Handling Presenting Data Learning Outcomes  Revise drawing of pie charts, line graphs and bar charts  Be able to present data using a stem and leaf diagram, determine mean, Median and quartiles  Be able to draw a boxplot for a set of values and compare more than one box and whisker plots with reference to their average, spread, skewness  Be able to draw a histogram to represent groups with unequal widths  Know which diagram to use to represent data, the advantages and disadvantages of each type.  Be aware of the shape of a normal distribution and understand the concept of skewness

DH - Presenting Data Box & Whisker Plots A box & Whisker plot illustrates: a) The range of data b) The median of data c) The quartiles and interquartile range of data d) Any indication of skew within the data Scale Q1 Q2 Q3

DH - Presenting Data Scatter Diagrams y x × × × × × × × × × y x × × × × × × × ×× Positive Correlation x ▲ y ▲ Negative Correlation x ▲ y▼ * The closer the points, the stronger the correlation y x × × × × × × × × × No Correlation x & y are independent × × × ×

DH - Presenting Data Histograms 32 packages were brought to the local post office. The masses of the packages were recorded as follows Mass (g)0 < m ≤ 3030 < m ≤ 4040 < m ≤ 5050 < m ≤ 90 No of packages With unequal class widths we draw a histogram. There are 2 important differences between a bar chart and a histogram 1.In a bar chart the height of the bar represents the frequency. 2.In a histogram the ‘ x ’ axis is a continuous scale.

DH - Presenting Data Histograms GroupFrequencyClass Width Frequency Density 0 < m ≤ < m ≤ < m ≤ < m ≤ When the classes are of unequal width we calculate and plot frequency density Frequency Density = Frequency Class Width

DH - Presenting Data Stem & Leaf Diagram When data are grouped to draw a histogram or a cumulative frequency distribution, individual results are lost. The advantage of grouping is that patterns (distribution) can be seen. In a stem and leaf diagram individual results are retained and the spread / distribution of the data can be seen. Draw a stem and leaf diagram for the data: 10, 11, 12, 15, 23, 26, 29, 32, 33, 34, 35,36, 42, 43, 44, 56, 57 StemLeaf

DH - Presenting Data Additional Notes

Data Handling Presenting Data Can Revise Do Further          Revise drawing of pie charts, line graphs and bar charts  Be able to present data using a stem and leaf diagram, determine mean, Median and quartiles  Be able to draw a boxplot for a set of values and compare more than one box and whisker plots with reference to their average, spread, skewness  Be able to draw a histogram to represent groups with unequal widths  Know which diagram to use to represent data, the advantages and disadvantages of each type.  Be aware of the shape of a normal distribution and understand the concept of skewness  