Using a statistics package to analyse survey data Module 2 Session 8.

Slides:



Advertisements
Similar presentations
1 Questionnaire design Module 3 Session 3. 2 Overview (of Session) This session starts by introducing some aspects that need to be considered when designing.
Advertisements

1 Session 10 Sampling Weights: an appreciation. 2 To provide you with an overview of the role of sampling weights in estimating population parameters.
SADC Course in Statistics Sampling weights: an appreciation (Sessions 19)
SADC Course in Statistics Sample size determinations (Session 11)
SADC Course in Statistics Processing single and multiple variables Module I3 Sessions 6 and 7.
SADC Course in Statistics Session 4 & 5 Producing Good Tables.
SADC Course in Statistics Exploratory Data Analysis (EDA) in the data analysis process Module B2 Session 13.
SADC Course in Statistics Graphical summaries for quantitative data Module I3: Sessions 2 and 3.
SADC Course in Statistics Common complications when analysing survey data Module I3 Sessions 14 to 16.
SADC Course in Statistics Handling Data Module B2.
SADC Course in Statistics Objectives and analysis Module B2, Session 14.
SADC Course in Statistics Risks and return periods Module I3 Sessions 8 and 9.
1 Table design Module 3 Session 2. 2 Objectives of this session By the end of this session, you will be able to: appreciate the different type of objectives.
SADC Course in Statistics Analysing Data Module I3 Session 1.
SADC Course in Statistics Good graphs & charts using Excel Module B2 Sessions 6 & 7.
SADC Course in Statistics Excel for statistics Module B2, Session 11.
SADC Course in Statistics Module B2, Session3
Chapter 7 Sampling Distributions
Chapter 1 Why Study Statistics?
Categorical variable We need a frequency table, preferably with values expressed as percentages, to summarise the values of the variable. We also need.
C HAPTER 1.1 Analyzing Categorical Data. I NDIVIDUALS AND V ARIABLES Individuals are the objects described by a set of data. Individuals may be people,animals,
Preparing Data for Quantitative Analysis
Do Angry People Have More Heart Disease?
Concept Quiz Ch. 1-3 True/False
SADC Course in Statistics Taking measurements Module I1, Session 17.
SADC Course in Statistics Analysing numeric variables Module B2, Session 15.
Tables and graphs for frequencies and summary statistics
Chap 1-1 Chapter 1 Why Study Statistics? EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
SADC Course in Statistics Introduction to the module and the session Module I1, Session 1.
SADC Course in Statistics Producing Good Tables In Excel Module B2 Sessions 4 & 5.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
+ The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1: Exploring Data Introduction Data Analysis: Making Sense of Data.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
1 Statistical concepts Module 1, Session 2. 2 Objectives From this session participants will be able to: Define statistics Enter simple datasets once.
Eagle Survey 2011 Data Analysis. Survey 1: Population Count 1.Calculate the averages for adults, juveniles, and overall using the data from all the survey.
Multivariate Data Summary. Linear Regression and Correlation.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
PROCESSING, ANALYSIS & INTERPRETATION OF DATA
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Working with one variable data. Measures of Central Tendency In statistics, the three most commonly used measures of central tendency are: Mean Median.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Statistical Questioning Lesson After completing this lesson, you will be able to say: I can recognize and write a statistical question. I can recognize.
Randomness, Probability, and Simulation
Chapter 3: Organizing Data. Raw data is useless to us unless we can meaningfully organize and summarize it (descriptive statistics). Organization techniques.
Statistics with TI-Nspire™ Technology Module E Lesson 1: Elementary concepts.
+ Warm Up Which of these variables are categorical? Which are quantitative?
Survey Training Pack Session 2 – Data Analysis Plan.
Multivariate Data Summary. Linear Regression and Correlation.
Survey Training Pack Session 18 – Checking Data Analysis.
STATISTICS 200 Lecture #2Thursday, August 25, 2016 Distinguish between: - A statistic and a parameter - A categorical and a quantitative variable - A response.
Sampling Distributions
FREQUENCY DISTRIBUTION
Statistics 200 Lecture #7 Tuesday, September 13, 2016
Chapter 11 Chi-Square Tests.
Chapter 1 Why Study Statistics?
Chapter 1 Why Study Statistics?
Combining Random Variables
Chapter 2 Describing Data: Graphs and Tables
Multivariate Data Summary
Lecture 18 Section 8.3 Objectives: Chi-squared distributions
Chapter 11 Chi-Square Tests.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 11 Chi-Square Tests.
CHAPTER 1 Exploring Data
Presentation transcript:

Using a statistics package to analyse survey data Module 2 Session 8

Objectives of this session You should be able to: Use a statistics package to produce tables and graphs of frequencies and proportions Pproduce tables of summary statistics Explain why weights are sometimes needed in analysing survey data Produce weighted tables of counts and other statistics

How to describe data well - review Look for oddities in the data and be prepared to adapt the summaries that you calculate Study the data as tables and graphs Use frequencies and percentages to summarize categorical variables Use averages and measures of variability to summarize numeric variables Identify any structure in the data and use it in producing your summaries

Look at the data The 2 types of variable are summarized in different ways

Analysis to meet objectives Simple objectives Not so simple objectives

Meeting simple objectives Summaries made with Instat – see practical 1

Answering more complicated objectives AND explaining some of the variability as shown in Module 1 These were also with Instat

Practical 1 Reviews the construction of tables Using a statistics package Particularly to look at percentages Because percentages have to be understood clearly to analyse multiple response data This practical also gives more practice In the use of a statistics package

Common complications when analysing survey data Common complications include: Missing values in survey data Weights are sometimes needed Because some observations represent more of the population than others Multiple response questions have to be processed These are all easier with an appropriate statistics package Here, as an example we introduce the need for weights

Introducing weights Suppose a sample of 2 farmers Farmer Yield A1 t/ha B2 t/ha What is the mean? Obviously it is (1 + 2)/2 = 1.5 t/ha! But…

Introducing weights - continued Suppose the same sample of 2 farmers FarmerArea YieldProduction A 5 ha1 t/ha5 tons B 0.5 ha2 t/ha1 ton Now what is the mean? It could still be (1 + 2)/2 = 1.5 t/ha Or it could be (5 + 1)/5.5 = 1.1 t/ha

But which is right? They are both right, but they answer different questions Take food security Are you interested in the farmer Or the production Or both If the farmer is the unit of interest Then there are 2 farmers The mean is 1.5 If the area is the unit of interest Then there are 5.5 ha And Farmer A is 10 times as important as farmer B So a weighted mean is produced

The weighted mean So if the area is of interest – then with FarmerArea Yield A 5 ha1 t/ha B 0.5 ha2 t/ha Weight each yield by the area it represents mean = (1*5 + 2*0.5)/5.5 = 1.1 Here the areas are the weights They are used when different observations represent different proportions of the population

Weights in the Tanzania agriculture survey The number of people in the population represented by each observation It was roughly a 1% sample, so the weights are about 100 The technical guide explains the calculations

Practical 2 Weights using a statistics package First the rice survey Weighting by the size of field Then the Tanzania agriculture survey Investigate ownership of radios By type of farming household

Possession of radio by type of farming Unweighted analysis Uses the observed numbers and percentages in the sample Look at livestock – but numbers small

Possession of radio by type of farming Weighted analysis The estimated numbers and percentages in the region of Tanzania Look at livestock now – what do you conclude?

Why such a large change with weighting? Examine the weights for these 2 groups Average weight = 60Average weight = 20 So estimated % with radio = 100*(42*20)/(10*60+42*20) = 59%

And always take care with small numbers Large sample overall But still a small sample of livestock-only farmers

Can you now? Use a statistics package to produce tables and graphs of frequencies and proportions Produce tables of summary statistics Explain why weights are sometimes needed in analysing survey data Produce weighted tables of counts and other statistics