Methods of HRDU (PDU) population estimates Bruno Sopko, Ph.D. 11 th June 2014.

Slides:



Advertisements
Similar presentations
1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
Advertisements

Genetic Statistics Lectures (5) Multiple testing correction and population structure correction.
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
How are we doing? Sort the types of error into sampling and non-sampling errors, then match the situations to the types of error.
Imperfect Testing Unit 7 Lesson 1 and 2. Lesson One Part One Vocabulary Sample Representative sample Population Percentage Pie chart.
McGraw-Hill/Irwin McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All rights reserved.
1 A heart fills with loving kindness is a likeable person indeed.
Documentation and survey quality. Introduction.
Statistical Methods Descriptive Statistics Inferential Statistics Collecting and describing data. Making decisions based on sample data.
Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression.
Sampling Theory Determining the distribution of Sample statistics.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
1 of 27 PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2013, Michael Kalsher Michael J. Kalsher Department of Cognitive Science Adv. Experimental.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
McGraw-Hill/Irwin McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All rights reserved.
Sampling January 9, Cardinal Rule of Sampling Never sample on the dependent variable! –Example: if you are interested in studying factors that lead.
Sampling: Theory and Methods
Investment Analysis and Portfolio management Lecture: 24 Course Code: MBF702.
Using car4ams, the Bayesian AMS data-analysis code V. Palonen, P. Tikkanen, and J. Keinonen Department of Physics, Division of Materials Physics.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Estimation PowerPoint Prepared by Alfred P. Rovai.
Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.
Naive Bayes Classifier
Use of web scraping and text mining techniques in the Istat survey on “Information and Communication Technology in enterprises” Giulio Barcaroli(*), Alessandra.
SAMPLING.
Appendix 2.1:RDS and TLS An Overview of Probability-Based Sampling Methods For Key Populations.
Chapter 6: Random Errors in Chemical Analysis CHE 321: Quantitative Chemical Analysis Dr. Jerome Williams, Ph.D. Saint Leo University.
Data Collection and Sampling
Sampling. Sampling Can’t talk to everybody Select some members of population of interest If sample is “representative” can generalize findings.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 7 Inferences Based on a Single Sample Estimation with Confidence Intervals.
Chapter 7 The Logic Of Sampling. Observation and Sampling Polls and other forms of social research rest on observations. The task of researchers is.
5-4-1 Unit 4: Sampling approaches After completing this unit you should be able to: Outline the purpose of sampling Understand key theoretical.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Estimation PowerPoint Prepared by Alfred P. Rovai.
Uncertainty Management in Rule-based Expert Systems
Eurostat Weighting and Estimation. Presented by Loredana Di Consiglio Istituto Nazionale di Statistica, ISTAT.
7: Sampling Theory and Methods. 7-2 Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. Hair/Wolfinbarger/Ortinau/Bush, Essentials.
Chapter 10 Sampling: Theories, Designs and Plans.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Random Variables Numerical Quantities whose values are determine by the outcome of a random experiment.
Sampling Theory and Some Important Sampling Distributions.
Investigating the Potential of Using Non-Probability Samples Debbie Cooper, ONS.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
 1 Species Richness 5.19 UF Community-level Studies Many community-level studies collect occupancy-type data (species lists). Imperfect detection.
Quantitative research Meeting 7. Research method is the most concrete and specific part of the proposal.
Monitoring and Estimating Species Richness Paul F. Doherty, Jr. Fishery and Wildlife Biology Department Colorado State University Fort Collins, CO.
+ Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population Mean.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
As a data user, it is imperative that you understand how the data has been generated and processed…
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
Chapter 8: Estimating with Confidence
Logic of Sampling Cornel Hart February 2007.
Logic of Sampling (Babbie, E. & Mouton, J The Practice of Social Research. Cape Town:Oxford). C Hart February 2007.
Chapter 7 Inferences Based on a Single Sample
Effect Sizes (continued)
Chapter 8: Estimating with Confidence
محيط پژوهش محيط پژوهش كه قلمرو مكاني نيز ناميده مي شود عبارت است از مكاني كه نمونه هاي آماري مورد مطالعه از آنجا گرفته مي شود .
Selecting Research Participant
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
An Online CNN Poll.
Reasoning in Psychology Using Statistics
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
2/5/ Estimating a Population Mean.
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Presentation transcript:

Methods of HRDU (PDU) population estimates Bruno Sopko, Ph.D. 11 th June 2014

Content  Capture-recapture method  Multiplier method (overview)  RDS method

Two population study

Two population – non coded clients

Three and more samples  R – statistical program  R add-ons (Rcapture)  Log-linear estimation (Poisson, Chao, AIC)  Data preparation

Robust design

R-project  – –  Required packages: – Rcapture – foreign

Rcapture installation

Data preparation Subst 2010Needle 2010Subst 2011Needle 2011Subst 2012Frequency [1,] [2,] [3,] [4,] [5,] [6,] [7,] [8,] [9,] [10,] [11,] [12,] [13,] [14,] [15,] [16,] [17,] [18,] [19,] [20,] [21,] [22,] [23,] [24,] [25,] [26,] [27,] [28,] [29,] [30,] [31,] Total % use all services These data were wrongly converted

Results Closed population model for every period: M0 Model fit: deviance df AIC fitted model Capture probabilities: estimate stderr period period Survival probabilities: estimate stderr period 1 -> Abundances: estimate stderr period period Number of new arrivals: estimate stderr period 1 -> Total number of units who ever inhabited the survey area: estimate stderr all periods Total number of captured units: 2686

Multiplier method  Network size reliability  Statistical weight  Standard error of weighted mean – SPSS x “rest of the world”

Response driven sampling Method Respondent-driven sampling (RDS), combines “snowball sampling" (getting individuals to refer those they know, these individuals in turn refer those they know and so on) with a mathematical model that weights the sample to compensate for the fact that the sample was collected in a non- random way.

RDS – essential information  Personal Network Size (Degree) - Number of people the respondent knows within the target population.  Respondent's Serial Number - Serial number of the coupon the respondent was recruited with.  Respondent's Recruiting Serial Numbers - Serial numbers from the coupons the respondent is given to recruit others.

RDS - program  Analyst_Install Analyst_Install

RDS Analyst

RDS computing – data format Identifikacioni brojMrezaKupon1Kupon2Kupon3PolGodina rodjenjaGodine m m ž m m m ž m m m m m m m197835

Video

Results – recruitment tree

RDS – convergence (theory)

RDS - result MeanMedianMode90%3%98% Prior Posterior Summary of Population Size Estimation

RDS – convergence results

Thank you for your attention

Corrected CRM – preliminary bias observation  The codes supplied were from Syringe exchange program and from Substitution treatment program  These programs are mutually exclusive – therefore negatively biased  The data were collected over three years, therefore data from the same program have been positively biased  The data from Syringe exchange program for year 2012 were incomplete

Corrected CRM - method  Due to the previously described bias problems, the classical robust design method (robustd.0 or robustd.t functions in R) could not be deployed.  The capture-recapture method with bias correction over all years have been used (closedp.mX function in R), the total number of estimated PDU has been broken into years values by the corresponding code counts.

Corrected CRM - data Number of captured units: 6782 Frequency statistics: fi i = i = i = i = 4 51 i = 5 32 i = 6 15 fi: number of units captured i times

Corrected CRM - results YearPopulationSTD ErrorArrivedLeftStayedSTD Error Stayed all years STD Error