CNNIC Symposium 2003 1 Conceptual and Operational Issues in the Measurement of Internet Use * Jonathan Zhu City University of Hong Kong

Slides:



Advertisements
Similar presentations
The Impact of LFS & APS Reweighting Marilyn Thomas Labour Force Survey Output Manager, Office for National Statistics.
Advertisements

Sample size estimation
Internet Development in Chinese Mainland Wang Enhai
ERCOT Analysis of 2005 Residential Annual Validation Using the Customer Survey Results ERCOT Load Profiling Presented to PWG - October 26, 2005.
Calculating The Estimates And Weights For The Vacancy Rate Prepared for the 2010 SDC-CIC Joint Steering Committee Meetings February 24, 2010 Edward C.
Prepared by Wirthlin Worldwide A Survey of Handwashing Behavior Prepared for: American Society for Microbiology September 2000 #6649.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
Why sample? Diversity in populations Practicality and cost.
Questions re Internet Use How many/what percent use it How often, how much For what purposes From where (home, office, public-access computer) Differences.
Ch. 3.1 – Measurements and Their Uncertainty
BPS - 3rd Ed. Chapter 131 Confidence intervals: the basics.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Squeezing more out of existing data sources: Small Area Estimation of Welfare Indicators Berk Özler The World Bank Development Research Group, Poverty.
Determining the Size of
Joint Canada/U.S. Health Survey Catherine Simile, National Center for Health Statistics Patrice Mathieu, Statistics Canada Ed Rama, Statistics Canada NCHS.
Validity and Reliability Dr. Voranuch Wangsuphachart Dept. of Social & Environmental Medicine Faculty of Tropical Medicine Mahodil University 420/6 Rajvithi.
Arun Srivastava. Types of Non-sampling Errors Specification errors, Coverage errors, Measurement or response errors, Non-response errors and Processing.
Customer : contractor : December, 2012 Sociologic Research on Awareness of Industrial Property Protection Possibilities.
Unit 6: Standardization and Methods to Control Confounding.
Jan Internet Broadband Usage Adoption in Taiwan: An Overview Report Jan Dr. Te-Hsin Liang Associate Professor Dep’t of Statistics Information.
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
Chapter 1: Introduction to Statistics
January 2004 Internet Use and Broadband Adoption in Taiwan: An Overview Report January 2004 Dr. Te-Hsin Liang Associate Professor Dep’t of Statistics Information.
World Internet Project: Researching the use of the Internet in New Zealand NetHui 2011.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
1/26/09 1 Community Health Assessment in Small Populations: Tools for Working With “Small Numbers” Region 2 Quarterly Meeting January 26, 2009.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Survey on Broadband and Wireless Usage in Taiwan Taiwan Network Information Center JULY 2003.
Methodology for producing the revised back series of population estimates for Julie Jefferies Population and Demography Division Office for.
1 Things That May Affect Estimates from the American Community Survey.
2008 Missoula Long-Range Transportation Plan Survey May 20, 2008.
American Community Survey (ACS) 1 Oregon State Data Center Meeting Portland State University April 14,
2.6 Confidence Intervals and Margins of Error. What you often see in reports about studies… These results are accurate to within +/- 3.7%, 19 times out.
RECONCILING THE VALUE ESTIMATES Chapter 15. Consistency Evaluate the approaches Evaluate the data Range of values Reconciliation Rounding the answer Significant.
Growing Challenges to State Telephone Surveys of Health Insurance Coverage: Minnesota as a Case Study Supported by a grant from the Minnesota Department.
American Community Survey “It Don’t Come Easy”, Ringo Starr Jane Traynham Maryland State Data Center March 15, 2011.
Things that May Affect the Estimates from the American Community Survey Updated February 2013.
5-4-1 Unit 4: Sampling approaches After completing this unit you should be able to: Outline the purpose of sampling Understand key theoretical.
Sub-regional Workshop on Census Data Evaluation, Phnom Penh, Cambodia, November 2011 Evaluation of Age and Sex Distribution United Nations Statistics.
Week 41 Estimation – Posterior mean An alternative estimate to the posterior mode is the posterior mean. It is given by E(θ | s), whenever it exists. This.
1 ACS Statistical Issues and Challenges: One-, Three-, and Five-year Period Estimates Alfredo Navarro U.S. Census Bureau Association of Professional Data.
1 Sampling Theory Dr. T. T. Kachwala. 2 Population Characteristics There are two ways in which reliable data or information for Population Characteristics.
Effects of Sampling and Screening Strategies in an RDD Survey Anthony M. Roman, Elizabeth Eggleston, Charles F. Turner, Susan M. Rogers, Rebecca Crow,
Korea Network Information Center Internet Usage Statistics in Korea Cho, Chan Hyeong (Korea Network Information Center)
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
Copyright © 2009 Pearson Education, Inc. 8.1 Sampling Distributions LEARNING GOAL Understand the fundamental ideas of sampling distributions and how the.
Math 3680 Lecture #15 Confidence Intervals. Review: Suppose that E(X) =  and SD(X) = . Recall the following two facts about the average of n observations.
Design of Clinical Research Studies ASAP Session by: Robert McCarter, ScD Dir. Biostatistics and Informatics, CNMC
Population vs. Sample. Population: a set which includes all measurements of interest to the researcher (The collection of all responses, measurements,
7.1 What is a Sampling Distribution? Objectives SWBAT: DISTINGUISH between a parameter and a statistic. USE the sampling distribution of a statistic to.
Restaurant Smoking Policies and Reported Exposure to ETS The case of Massachusetts Tandiwe Njobe National Conference on Tobacco or Health November 2002.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Chapter 7: The Distribution of Sample Means
Briefing on Census Data Evaluation in China Xiao Ning Guo Hui Hanoi, Viet Nam 2-6 December 2013.
“Neighborhood Social Planning and Development” NEBSOC WORK PACKAGES (DATA COLLECTION STRATEGY) & 3.2 (DEEPENING AND IDENTIFICATION OF THE SOCIAL.
E-Reading Rises as Device Ownership Jumps BY: KATHERINE ZICKUHR AND LEE RAINIE.
As a data user, it is imperative that you understand how the data has been generated and processed…
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Chapter 11: Test for Comparing Group Means: Part I.
Indices of Deprivation Measuring change between ID2004 and ID2007 Kate Wilkinson University of Oxford.
Presentation of Results Mr. Joel LaLone Joel LaLone Consulting Watertown, New York April 11, 2014.
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section B 1.
Membership in Online Research Panels presentation of results from a BRI Omnibus telephone survey April 29, 2010 asdf.
The DUE Process of the Internet in Hong Kong and Mainland China: A Longitudinal and Comparative Study, * Jonathan J. H. Zhu City University of.
ICT Market Follow up in Morocco Market Observatory/ANRT MOROCCO
香港大學民意研究計劃 The University of Hong Kong Public Opinion Programme
Sample Sizes for IE Power Calculations.
Presentation transcript:

CNNIC Symposium Conceptual and Operational Issues in the Measurement of Internet Use * Jonathan Zhu City University of Hong Kong * Funded by the UGC of HKSAR

CNNIC Symposium Background: the Diffusion of the Internet in Hong Kong, Beijing and Guangzhou Source: J. H. Zhu (2003)

CNNIC Symposium Internet Penetration Rate in East Asia

CNNIC Symposium Wired Internet Use vs. Wireless Internet Use

CNNIC Symposium Diffusion of Cable TV, the Internet, and Mobile Phone in Hong Kong

CNNIC Symposium Internet vs. Mobile Phone in Beijing and Guangzhou

CNNIC Symposium Issues in Measurement of Internet Use and Users The size of “Internet users” in a society is a function of: Definition of study population (SP) Method of sample weighting (SW) Requirement of minimal usage (MU) The amount of “online time” by Internet users is a function of: Definition of study population (SP) Method of sampling weighting (SW) Method of data collection (DC) Treatment of extreme values (EV)

CNNIC Symposium Criteria for Evaluation of Measurement Validity: how accurate or correct is the measure as compared with the “truth”? Reliability: how precise or stable is the measure over time and/or across space? Practicality: how efficient or economic is the measure in data collection and analysis?

CNNIC Symposium Data Hong Kong Survey 2002: telephone interviews of 1,800 residents at 6 and above in Dec by Jonathan Zhu and his team AC Nielsen/Netratings : online tracking of 1,500 Internet users from 811 households in Hong Kong in Oct and Jan

CNNIC Symposium Definitions of Study Population WIP-Hong Kong: CNNIC: 6+ Another popular definition: 18+ HK Census 2002: 6-17: 16.4% 18-74: 80.0% 75+: 3.6%

CNNIC Symposium Impact of Population Definitions on Internet User Size Data: Hong Kong 2002

CNNIC Symposium Requirements of Minimal Usage Minimal Usage Required? Last Usage Specified? YesNo Yes?? No CNNIC (1 hour/week) WIP

CNNIC Symposium Impact of Minimal Requirements on Internet User Size Data: Hong Kong 2002

CNNIC Symposium Age Distribution of the Sample before and after Weighting Data: Hong Kong 2002

CNNIC Symposium Impact of Weighting Methods on Internet User Size Data: Hong Kong 2002

CNNIC Symposium Summary: Internet Users by Population, Usage Requirement & Weighting Method Data: Hong Kong 2002

CNNIC Symposium A Mathematical Model of “True” Internet Users (TIU) TIU = 55.3 – 1.4SP SP MU – 5.4SW (Adjusted R 2 = 99.6%, Standard Error = 0.3%) Where TIU is the “Unadjusted” Internet Users (%) for HK in 2002, which should be 1.4% less for a study population of 18-74, or 3.7% less for a study population of 18+, or 4.5% less if those use the Internet less than 1 hour per week are excluded, or 5.4% less if the sample is weighted based on population census.

CNNIC Symposium Impact of Population Definitions on Online Time (at Home) Data: Hong Kong 2002

CNNIC Symposium Impact of Weighting Methods on Online Time (at Home) Data: Hong Kong 2002

CNNIC Symposium Impact of Extreme Values on Online Time (at Home) Data: Hong Kong 2002

CNNIC Symposium Impact of Data Collection (DC) Methods on Online Time Data: HKS 2002 & Netratings

CNNIC Symposium Summary: Online Time by SP, SW, DC, and EV Data: Hong Kong 2002

CNNIC Symposium A Mathematical Model of “True” Online Time (TOT) TOT = SP – 22SW – 49EV - 249DC (Adjusted R 2 = 93.5%, Standard Error = 34.3) Where TOT is the “Unadjusted” Online Time (min.) for HK users in 2002, which should be 16 min. more for a study population of 18-74, 22 min. less if the user sample is weighted, 49 min. less if extreme values are removed, or 249 min. less if data are collected through online tracking method.

CNNIC Symposium Caution: Different Definitions of “Online” Activities Telephone interview data include: Online time at both home (68%) and elsewhere (32%); Non-HTTP based activities such as using POP3 (=136 min./week) and other protocols; Online tracking data include: Online time only at home; Only HTTP=based activities protocols). It is estimated that tracking data may measure only 51% of the total online time..

CNNIC Symposium Estimated Distribution of Online Time by Location and Protocol of Usage Usage Location Online Activities HTTP basedNon-HTTPTotal Home Online Tracking (51%) 17%68% Elsewhere24%8%32% Total75%25%100%

CNNIC Symposium Conclusion: How Many Internet Users Are There? The size of “Internet Users” is significantly affected by the definition of study population (SP), the requirement of minimal usage (MU) and the method of sample weighting (SW). SP (e.g., general population vs. adults) may produce a difference of 1-4% and MU (e.g., no requirement vs. 1 hour per week) up to 5%. While there is no “correct” definition of SP or MU, it is important to report the definition and adopt, whenever possible, multiple definitions. SW (weighted vs. unweighted) may contribute another 5% difference. Since Internet use is highly correlated with age and sex, it seems both necessary and effective to weight the sample to ensure the accuracy of the measurement.

CNNIC Symposium Conclusion: How Much Time Do They Spend Online? The amount of online time is marginally affected by SP (p = 0.3) and SW (p = 0.2) probably due to the fact the base of analysis is already restricted to users. Online time is significantly affected by the treatment of extreme values (EV), which may inflate online time by up to 10%. It is thus necessary to control for it (i.e., removing EVs). Online time is most significantly affected by the method of data collection (DC, e.g., interviews vs. online tracking), which may result in a difference of 2-folder. Although online tracking is generally more accurate, it is far more expensive and impractical in many societies. It is thus important to keep in mind the magnitude of inflation in self-reported data.

CNNIC Symposium Ultimate Criteria for Evaluation Validity: how accurate or correct is the measure as compared with the “truth”? Reliability: how precise or stable is the measure over time and/or across space? Practicality: how efficient or economic is the measure in data collection and analysis?

CNNIC Symposium Consistency in Measurement of Internet Users over Time and across Space * * Based onWIP definition.

CNNIC Symposium Stability in Measurement of Sex Ratio among Internet Users in Hong Kong

CNNIC Symposium Stability in Measurement of Online Locations in Hong Kong

CNNIC Symposium Consistency in Difference between Methods across Age Cohorts Age Telephone Interview Online Tracking Interview /Tracking Total

CNNIC Symposium Final Verdicts Measurement of Internet users and online time based on interviews data is largely reliable over time and across space. The interview-based measurement is generally more practical than online tracking method. The interview-based measurement is generally weaker in validity, as compared to online tracking method. However, it could be adjusted if the departure from the “truth” is known (e.g., based on comparison with online tracking data.