Session 8 Data Processing

Slides:



Advertisements
Similar presentations
National Institute for Statistics and Geography (INEGI) is, from 2008, an autonomous institute in Technical and Managing matters. According to Mexican.
Advertisements

Business Case for Industriali- sation in Statistics Estonia: Small Example of a Large Trend MSIS 2013 Allan Randlepp Tuulikki Sillajõe.
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
Producing migration data using household surveys Experience of the Republic of Moldova UNECE Work Session on Migration Statistics, Geneva, October.
Electronic EDI e-EDI. The EDI has been in use since 1999 using a paper-based system and computerized spreadsheets to collect and manage EDI data. Over.
Electronic reporting in Poland 27th Voorburg Group Meeting Warsaw, Poland October 1st to October 5th, 2012 Central Statistical Office of Poland.
2011 Housing and Population Census in the Czech Republic UNECE – Eurostat Meeting on Population and Housing Censuses Geneva, May 2012.
Register-Based Census 2011 in Slovenia – Some Quality Aspects Danilo Dolenc Statistical Office of the Republic of Slovenia UNECE-Eurostat Expert Group.
(Re)designing administrative data – towards register based census. Estonian experience Kristi Lehto Statistics Estonia Methodology and analysis department.
Collecting grid based data Census 2000 and 2011 Diana Makarenko-Piirsalu Statistics Estonia.
Core variables in Estonian social surveys Merle Paats Statistics Estonia.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Asunción,
The traditional census: challenges, problems and solutions THE 2014 MOLDOVAN CENSUS Lucia SPOIALĂ Director General NBS Moldova.
18/03/2010 Population and Housing Censuses in South East Europe 1 Summary results of the first meeting of the Technical Coordination Group (TCG) for the.
13-Jul-07 State of the art of the ISCO-08 implementation.
Enumeration via internet – Estonian experience Diana Beltadze, Ene-Margit Tiit Statistics Estonia UN Office in Geneva Meeting on Population and Housing.
How official statistics is produced Alan Vask
Component D: Activity D.3: Surveys Department EU Twinning Project.
Jordan Population and Housing Census 2015 Prepared by: Ahmad Mowafi
Quality Assurance in Population and Housing Censuses
Quality assurance in population and housing census SUDAN’s EXPERIANCE in QUALITY assurance of Censuses By salah El din. A . Magid OUR EXPERIANCE IN 5.
Data Processes on 2011 Population And Housing Census Of Turkey
Population Census 2009 in the Republic of Kazakhstan
United Nations Statistics Division
Rudi Seljak, Aleš Krajnc
Session 8 Data Processing Estonian case study
Post Enumeration Survey Census
28 November - 1 December 2016, Amman, Jordan
TURKISH STATISTICAL INSTITUTE Demography Statistics Department Population and Migration Statistics Group 1.
Population Census in the Republic of Belarus: Experience and Perspectives Natalia Bandarenka State Institute of Management and Social Technologies of the.
Quality Assurance in Maldives Population and Housing Census 2014
Statistical Centre of Iran (SCI)
LFS ad hoc module 2009 “Entry of young people into the labour market”
2011 POPULATION AND HOUSING CENSUS OF TURKEY
Quality Assurance Plans of Turkey for 2021PHC
28 November - 1 December 2016, Amman, Jordan
ESTP COURSE ON PRODCOM STATISTICS
Multi-Mode Data Collection Approach
Use of handheld electronic devices for data collection in GeoStat
2011 POPULATION AND HOUSING CENSUS PREPARATORY WORKS
Communication challenges faced in the organisation of the 2011 Population and Housing Census Anne Nuka Head of the Marketing and Dissemination Department.
Towards 2021 Population Census in the Republic of Serbia
CENSUS 2013 Key experiences Budapest, June 2017.
The Computer-Assisted Personal
2011 Population and Housing Census of Turkey
Technical Coordination Group for the next Census round in South East Europe EUROSTAT PREPARATION FOR CENSUS 2020 MONTENEGRO Budapest Jun 2017.
Main lessons learnt from the censuses in Hungary
2011 POPULATION AND HOUSING CENSUS PREPARATORY WORKS
Albania 2021 Population and Housing Census - Plans
2014 General Population Census of Georgia
Multi-Mode Data Collection Approach
UF Directory Coordinator Training
Eurostat WG on Population and Housing Censuses
Towards Census 2021 in Hungary
Data validation in Liechtenstein
Basic preconditions The next round of population and housing censuses is scheduled for the start of the new decade (2021), both in the EU and in the partner.
Key Considerations for Planning and Management of Census Operations
Lao in Census Quality Assurance
Barış DULKADİR TURKSTAT Expert
Multi-Mode Data Collection
DATA COLLECTION WITH INTERNET
Technical Coordination Group, Zagreb, Croatia, 26 January 2018
Management and Monitoring Systems
USE OF ELECTRONIC DATA COLLECTION TECHNOLOGIES: MAIN DRIVERS AND
PRODCOM Working Group JMO M November 2012
Data Validation practice in Statistics Lithuania
Turkish Statistical Institute Demographic Statistics Department
Kaija Ruotsalainen Statistics Finland
International Standards and Contemporary Technologies,
Presentation transcript:

Session 8 Data Processing United Nations Technical Meeting on Use of Technology in Population and Housing Censuses 28t h - 1st December 2016 Statistics Estonia Diana Beltadze

New dimensions Using data sources – some data is taken from registers and not asked from respondents personally, some data is asked personally. Response method – self-enumeration, personal interview and proxy interview are used at the same time. Response technology – via internet and using computers, paper questionnaires are prepared as backup in order to reduce risks. UNECE meeting in Amman

Different prognoses Only experience in Estonia – Doctors’ career study (pilot) >50% Estonian pilot census (2009—2010) -- 21% Statistical prognosis – 27% Expert prognoses – from 10% up to – 75% Statistics Estonia’s official prognosis was 25%. Using this prognosis the number of enumerators was decided. Using this prognosis the technical parameters of data channels were calculated 01.01.2006

PHC 2011: methodology For the first time in Estonia, PHC 2011 used a combined methodology for the census. Use of registers: In the preparation of the census; In the conduct of the census; In the processing of census data. e-census Interview census better quality of the Census. Persons covered by the Census comprise those whose permanent place of residence on 31 December 2011 is in Estonia, who for the moment of Census have been living or are going to live in Estonia for at least 12 months (incl. illegals and persons with residence permits, except for the diplomatic staff of foreign countries, military and marine personnel of foreign countries and family members living with them in foreign countries), as well as persons who have left Estonia for a foreign country by the moment of Census, but who are not planning to stay away from Estonia more than 12 months; also the diplomatic staff of the Republic of Estonia and military and marine personnel and family members living with them in foreign countries during the Census. During the Census, Statistics Estonia co-operates with 2,236 institutions to enumerate members of institutional households and homeless people. To get better results, all Census employees are trained, because as mentioned already at the beginning of the article, “Everyone counts”. 1.12.2018 UNECE meeting in Amman

The additional opportunities resulting from the use of computers: By taking into account the information entered in real time, the length of questionnaires to be filled in by individual users can be minimized, as people will not see the questions that they do not have to answer; Implementation of logical controls enables immediate detection of mistakes in answers and presentation of suggestions for correction; The amount of information shown to a person can be optimized by placing additional information in help texts which can be opened only if needed; Classifications displayed in the form of menus can be used for questions with a long list of response options. UNECE meeting in Amman

When the texts were used? Occupation (description of the work) Area of activity (EMTAK) All other variables: 3—7 values (answers) in questionnaire Long list (classificator) Possibility to add text, when did not find from list In occupation sometimes ironic or humoristic answers were given (internet). Texts were better written in the case of internet

Data collection process 5 UNECE meeting in Amman

Data collection on the Web 6 UNECE meeting in Amman

Progress of the interview census The enumerators were on schedule: 23.02 – 8 % 11 % 01.03 – 30 % 38 % 08.03 – 60 % 66 % 29.03 – 100 % 99 % Officially, there were 32 people who refused to be interviewed. 1.12.2018 UNECE meeting in Amman

Logical controls Hard controls (father is younger than son). You cannot continue without correction! Soft control (difference of spouse’s ages is more than 19 years) Is this true? Possibility to continue. Also a control (in most cases soft): You did not answer the question… Internet version did not have the answers „Do not know“ and „No answer“, interviewers had such buttons.

Helpdesk statistics UNECE meeting in Amman

Changes in data collection Pre-filling was used in the e-census. Practically paperless – during the interview census, enumerators entered the answers on a laptop and the data were sent to a raw database. This meant saving on data entry resources. Logical controls were used during the census to prevent most of the possible entry mistakes. The room coordinates of each residential building were determined using GPS. 1.12.2018 UNECE meeting in Amman

Current issues after data collection Identification of persons (missing personal codes) Identification of addresses Elimination of duplicates Encoding Majority of identification and encoding exercises are completed by the end of April Data processing up to the 31 of October 2012. 1.12.2018 UNECE meeting in Amman

Data arrangement UNECE meeting in Amman

Our system Population register VVIS Buildings register, Land Board Data collection system, primary data correction Education register VAIS Databases for EUROSTAT and local customers Data correction system, imputations, controls etc UNECE meeting in Amman

Data processing The first stage of data processing, primary data arrangement, is a parallel activity during data collection and immediately after that and as a result the collected data will be prepared for the next stages of data processing. The primary data arrangement includes automatic data control and manual processing by operators (handling of duplicates, coding). A separate step is controlling the eliminated questionnaires, which is carried out by quality control managers. 1.12.2018 UNECE meeting in Amman

Data arrangement Automatically and manual arrangement of addresses - do all the entered addresses exist identification of persons - is the personal identification number of every person valid coding separating duplicates Separating incomplete and interrupted questionnaires from complete questionnaires Adding incomplete and interrupted questionnaires to the enumerators work tasks Data arrangement started on the first week of eCensus and was proceeded during the whole eCensus period, UNECE meeting in Amman

Quality of raw data Response rate of variables more than 99%. In general, in internet higher than in the case of interview.

Primary data correction team Data correction manager Operator for ID codes Coders Operator for addresses Operator for duplicates Coder Senior coder UNECE meeting in Amman

Data correction manager shares roles and tasks in IT system shares tasks by dictionary between coders helps operators to solve more difficult problems activates automatic controls for finding duplicates 19.08.2010 UNECE meeting in Amman

Different operators The operators are responsible for resolving any conflicts in the received data: ID operators check and add personal ID codes from Population register data; address operators check and correct addresses of buildings; duplicate operators solve automatically unsolvable duplicate tasks; coders encode text answers according to prescribed rules. 19.08.2010 UNECE meeting in Amman

Encoding of text answers Variable Manual % Automat % Occupation 98,1 1,9 EMTAK 91,9 8,1 Religion 68,0 32,0 VS-RTK 50,4 49,6 Citizenship 71,4 28,6 Language 0,4 99,6 Nationality 73,3 26,7 Dialect 52,9 47,1 Address 100,0 0,0

Conclusion:different ways to avoid and correct mistakes using data from population register controls in time of filling in questionnaire using classifications in questionnaire identification persons and addresses coding removing duplicates additional controls imputation UNECE meeting in Amman

Thank You! 1.12.2018 UNECE meeting in Amman

UNECE meeting in Amman