About a survey method on sensitive matters in human life Jong-Min Kim Discipline of Statistics Division of Science & Mathematics University of Minnesota,

Slides:



Advertisements
Similar presentations
Chapter 7 Hypothesis Testing
Advertisements

Testing for Marginal Independence Between Two Categorical Variables with Multiple Responses Robert Jeutong.
Introduction Simple Random Sampling Stratified Random Sampling
Standardized Scales.
Nonparametric estimation of non- response distribution in the Israeli Social Survey Yury Gubman Dmitri Romanov JSM 2009 Washington DC 4/8/2009.
Sample size estimation
LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
Psychology 290 Special Topics Study Course: Advanced Meta-analysis April 7, 2014.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Statistics: Purpose, Approach, Method. The Basic Approach The basic principle behind the use of statistical tests of significance can be stated as: Compare.
Introduction to Categorical Data Analysis
Analysis of frequency counts with Chi square
Data Analysis Statistics. Inferential statistics.
PSYC512: Research Methods PSYC512: Research Methods Lecture 10 Brian P. Dyre University of Idaho.
Section 7.1 Hypothesis Testing: Hypothesis: Null Hypothesis (H 0 ): Alternative Hypothesis (H 1 ): a statistical analysis used to decide which of two competing.
Privacy Preserving Market Basket Data Analysis Ling Guo, Songtao Guo, Xintao Wu University of North Carolina at Charlotte.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 9: Hypothesis Tests for Means: One Sample.
Inference about a Mean Part II
Topic 3: Regression.
Stat Today: Multiple comparisons, diagnostic checking, an example After these notes, we will have looked at (skip figures 1.2 and 1.3, last.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Inferences About Process Quality
Data Analysis Statistics. Inferential statistics.
Statistics for CS 312. Descriptive vs. inferential statistics Descriptive – used to describe an existing population Inferential – used to draw conclusions.
Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.
The Neymann-Pearson Lemma Suppose that the data x 1, …, x n has joint density function f(x 1, …, x n ;  ) where  is either  1 or  2. Let g(x 1, …,
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Chapter 1: Introduction to Statistics
QNT 531 Advanced Problems in Statistics and Research Methods
EDRS 6208 Analysis and Interpretation of Data Non Parametric Tests
Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.
Irkutsk State Medical University Department of Faculty Therapy Correlations Khamaeva A. A. Irkutsk, 2009.
Understanding Basic Statistics Chapters Covered in Term 1 1.Getting Started 2.Organizing Data 3.Averages and Variation 4.Correlation and Regression 5.Elementary.
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
© 2005 Pearson Education Canada Inc. Chapter 2 Sociological Investigation.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Yes….. I expect you to read it! Maybe more than once!
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
BPS - 5th Ed. Chapter 191 Inference about a Population Proportion.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Chapter 13: Categorical Data Analysis Statistics.
Creswell, Educational Research: Planning, Conducting, and Evaluating Quantitative and Qualitative Research, 4e © 2012, 2008, 2005, 2002 Pearson Education,
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
CHAPTER 2 Research Methods in Industrial/Organizational Psychology
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
1 Follow the three R’s: Respect for self, Respect for others and Responsibility for all your actions.
Review of Statistics.  Estimation of the Population Mean  Hypothesis Testing  Confidence Intervals  Comparing Means from Different Populations  Scatterplots.
1 If we live with a deep sense of gratitude, our life will be greatly embellished.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
§2.The hypothesis testing of one normal population.
Two-Sample Proportions Inference. Sampling Distributions for the difference in proportions When tossing pennies, the probability of the coin landing.
MATH Section 3.1.
Chapter 13 Understanding research results: statistical inference.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Rerandomization to Improve Covariate Balance in Randomized Experiments Kari Lock Harvard Statistics Advisor: Don Rubin 4/28/11.
Hypothesis Testing and Statistical Significance
Lesson 3 Measurement and Scaling. Case: “What is performance?” brandesign.co.za.
STATISTICS People sometimes use statistics to describe the results of an experiment or an investigation. This process is referred to as data analysis or.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
 Confidence Intervals  Around a proportion  Significance Tests  Not Every Difference Counts  Difference in Proportions  Difference in Means.
Goodness-of-Fit and Contingency Tables Chapter 11.
Quantitative Methods in the Behavioral Sciences PSY 302
One-Sample Inference for Proportions
Essential Statistics Inference about a Population Proportion
What are their purposes? What kinds?
Essential Statistics Inference about a Population Proportion
Presentation transcript:

About a survey method on sensitive matters in human life Jong-Min Kim Discipline of Statistics Division of Science & Mathematics University of Minnesota, Morris

Introduction  Why do we need Randomized Response (RR) technique ? Direct question queries often fail to get reliable data on such sensitive matters of human life. Warner (1965) invented RR technique which is designed to encourage cooperation and truthful replies to socially undesirable questions.

 Qualitative RR technique (Dichotomous nature). Warner’s RR technique (1965, JASA). Do you belong to a sensitive group? Don’t you belong to a sensitive group? An unrelated question RR technique (1969, JASA). Do you belong to a sensitive group? Do you belong to a non-sensitive group? Introduction (cont’d)

 How does RR technique work ? Campbell and Joiner (1973) - Apply an unrelated question RR technique to estimate the proportion of regular “pot” smokers. - Use the table of random numbers(00-99). Introduction (cont’d) 41 % of students use pot at least once a week (from RR Technique.) 38 % of students were regular pot smokers. (the students privately verify the technique.) P(“yes” answer) = P(Sensitive question is chosen)*P(Yes | Sensitive question) + P(Nonsensitive question is chosen)*P(Yes | Nonsensitive question).

Introduction (cont’d)  Quantitative RR technique Unrelated question RR technique (1971, JASA). - Modify a qualitative unrelated question RR technique to get a quantitative data. 1. How many abortions have you had during your lifetime? 2. If a woman has to work full-time to make a living, how many children do you think she should have? - The survey can be designed more efficiently when the mean and variance of the innocuous question are known in advance.

Introduction (cont’d)  Quantitative RR technique A new model for RR (Eriksson, 1973). - Modify a quantitative unrelated question RR technique. - Use a deck of two kinds of cards. 1. Give a true answer ! 2. Say the figure that is on your card. A new discrete RR model (Liu and Chow, 1976). - Use the Hopkins’ Randomizing Device which consists of red balls, and white balls with a discrete number. - Use the predetermined combination of balls instead of asking innocuous question.

Introduction (cont’d) Hopkins’ Randomizing Device.

Introduction (cont’d) Tracy and Fox (1981) - Conduct a field-validation of a quantitative RR technique. - Compare self-report of past criminal behavior (Arrests) Direct question vs. RR technique. - Use the Hopkins’ Device presented by Liu and Chow(1976).  Validation of RR technique Mean Reported Arrests Mean Official Arrests Two or more Arrests Sample size (n) RR technique Direct question

Research Topics  The First Research Topic: A Stratified Warner’s Randomized Response Technique.  The Second Research Topic: A Mixed Randomized Response Technique.  The Third Research Topic: A Multinomial Distribution Approach to A Quantitative Randomized Response Technique.

The Third Research  The Third Research Topic: A Multinomial Distribution Approach to A Quantitative Randomized Response Technique. Why must the problem be addressed? By using the Hopkins’ randomizing device, we can derive a multinomial distribution for sensitive categories. It means that we can easily estimate the proportion of each category. Thus the attempt is useful to treat ordinal data in quantitative manner by assigning values to the categories. For example, if we apply the model to a criminology then we can assign a value like this; one arrest (1), two arrests (2), more than two arrests (3).

Focus on the Third Research  Estimate the proportions of sensitive categories using a multinomial distribution.  Derive the true estimates of the sensitive categories through the random transformation of a lying model.  Apply quantitative RR to a multiple contrast method by Goodman (1964).  Present a Pearson product-moment Correlation between two sensitive variables.

Contents of the Third Research, and as follow: ~ MULT.. Our RR technique utilizes the Hopkins’ device to derive a multinomial distribution for sensitive categories (A 1,A 2,…A k+1 ), non-sensitive categories (B 1,B 2,…B k+1 ) and total categories (T 1,T 2,…T k+1 ). Then we can define a multinomial distribution of

Contents of the Third Research The estimate of a proportion of sensitive category h is where and. The estimate of variance is. The estimate of covariance is where.

Contents of the Third Research What is the lying model of Mukhopadhay (1980) ? Assume that there is a sensitive category for such that has no social stigma and is more social stigma as i increases. Let R ij denote the probability that a person of category A i announces himself or herself as one of category A j. Intuitively, we can stipulate the following:.

Contents of the Third Research Then Rewriting the above one like this:.. If R is nonsingular,. The estimator ofis given by.

Contents of the Third Research Observed outcomes (Estimated Expected Outcomes) Total Female Group155(57)10(10.8)10(7.2)75 Female Group275(76)21(14.4)4(9.6)100 Female Group360(57)5(10.8)10(7.2)75 Total Table 1. Observed and Estimated expected outcomes for a three-sample.

Contents of the Third Research We want to test the null hypothesis. The test statistics is The set of 95% confidence intervals about two different proportions is given by

Contents of the Third Research Not Significant Significant Not Significant Significant

Contents of the Third Research How to estimate the correlation between two sensitive variables ? For two different sensitive questions, we are going to use two Hopkins’ devices which have different ratios of red balls and green balls with a designated number. Bourke (1981) presented a multivariate RR design for category proportions. By using the multivariate RR design, we can derive the true proportion that a respondent belongs in the category i for the first question and belongs in the category j for the second one.

Contents of the Third Research The product moment correlation between two sensitive variables and is where and.,, and

Contents of the Third Research Observed Outcomes A(c 1 )=1A(c 2 )=2A(c 3 )=3Total A(r 1 )= A(r 2 )= A(r 3 )= Table 2. The number of respondents who belong to two different sensitive categories.

Contents of the Third Research The proportion that a respondent of category i and category j announces a respondent as one of category k and one of category l is The proportioncan be expressed as the Matrix form:..

Contents of the Third Research From Table 2, each can be obtained as follows: By rewriting in terms of and, Supposeand...

Contents of the Third Research Sinceis nonsingular and is known, each true proportion is..

Contents of the Third Research Suppose that we get and from the randomized randomized response technique. Then the correlation between two sensitive variables is. The relationship between two sensitive variables strongly correlated each other..

Discussion on three researches New qualitative randomized response technique - A stratified Warner’s RR technique. - A mixed randomized response technique. - A stratified mixed RR technique. New quantitative randomized response technique. - A discrete RR technique using a multinomial distribution. The purpose of this research is to cover various randomized response techniques and investigate the properties of RR technique.

Thank you.