1 Statistics Canada Research Data Centre Program* Facilities across Canada housing detailed confidential microdata and documentation files from Statistics.

Slides:



Advertisements
Similar presentations
Chuck Humphrey, Alberta Research Data Centre Canadian RDC Report Where are we now?
Advertisements

DLI & Research Data Centres Creating a better understanding of these two programs Chuck Humphrey Data Library University of Alberta April 2004.
Statistics Canada Statistique Canada September 2008/1 From a seed to a forest The Research Data Centre Program.
Toward improved health for Ontario’s First Nations: The Aboriginal People’s Survey A presentation to the Association of Local Public Health Agencies February.
1 Statistics Canada Research Data Centre Program* Facilities across Canada housing detailed confidential microdata and documentation files from Statistics.
Prince George’s County Human Services Coalition Funders Panel Presenter: Renette Oklewicz Director, Foundation Programs January 11, 2012.
Dissemination of U.S. Census Data and Results: The role of ICPSR First Conference of Al-Khawarezmi Committee on Statistics Doha, Qatar 6-8 December 2010.
Data Access and Data Use: the Missing Link? Elizabeth Hamilton University of New Brunswick Chuck Humphrey University of Alberta Data and Knowledge Transfer.
Citizenship andStrategic Research Immigration Canadaand Statistics 1 Citizenship & Immigration Canada Information Sources.
Data resources for MBC Researchers. Summary: Data uniquely available to Metropolis researchers: a) Longitudinal Immigrant Database (LIDS), crom Citizenship.
First Year in Focus at Canadian Colleges and Universities.
Meeting the Challenge The National Population Health Survey and Data Access E. Hamilton UNB Libraries IASSIST 2003.
1 Statistics Canada Research Data Centre Program* Facilities across Canada housing detailed confidential microdata and documentation files from Statistics.
Introducing ICPSR An Electronic Brochure. Our Mission ICPSR provides leadership and training in data access, curation, and methods of analysis for a diverse.
St. Lucia Country Report By Edwin St Catherine Director, Central Statistical Office Presented to IPUMS Workshop August 24 th, 2007.
British Columbia Inter- university Research Data Centre – Sociology 502 October 24, 2003.
Statistics Canada Statistique Canada mai 2005 / 1.
Presented by Tim Mark, Executive Director, Canadian Association of Research Libraries (CARL) In association with Kathleen Shearer, Coordinator of the CARL.
The ONS Longitudinal Study. © London School of Hygiene and Tropical Medicine The Office for National Statistics Longitudinal Study (LS) o What is it o.
Methodology for a school- leavers’ survey Irena Kogan MZES, University of Mannheim.
Women in Canadian Astronomy: Brenda C. Matthews (HIA) Michael A. Reid (SMA)
Immigration Data Collection: Context, Process and Challenges Immigration Data Collection: Context, Process and Challenges Margaret Michalowski Statistics.
Satya Brink, Ph. D. Learning Policy Directorate, HRSD A presentation prepared for the Symposium: Trends, Shifts, Cliffs – Program Renewal in Colleges and.
Using the Census (and other data sources) in the Social Sciences Walter Giesbrecht (Data Librarian, Scott Library) Jennifer Dekker (Reference Librarian,
UW Senate President’s Report September 19, Ontario Election – October 6, 2011 Liberal Party Platform 30 per cent tuition grant Students from families.
An Overview of ICPSR Fall What is “ICPSR”? Established in 1962, the Inter-university Consortium for Political and Social Research (ICPSR) is the.
STATISTICSSTATISTIQUECANADA Aboriginal Labour Force Survey Province of Alberta.
Statistics Canada Statistique Canada February 2007/1 The Canadian Statistical System February 22, 2007 Gustave Goldmann.
6. Implications for Analysis: Data Content. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2.
CANSIM ISR Training Committee January 21, CANSIM CANSIM is Statistics Canada's key socioeconomic database Updated daily Provides fast and easy access.
Searching for Statistics Why can’t we find the data we need? Where should we even start?
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
A Study on Completion Rates and Time to Completion of Graduate Students Methodology Adopted by the G10 Data Exchange.
Chuck Humphrey Data Library Co-ordinator University of Alberta May 16, Capitalising on Metadata Tool development plans IASSIST 2007.
Issda Irish Social Science Data Archive James McBride Director.
Longitudinal Data Recent Experience and Future Direction August 2012.
The Census of Canada and Immigration & Ethno-cultural Data Chuck Humphrey University of Alberta February 10, 2006.
The Research Data Centre Program Microdata Access Division Heather Hobson April 23, 2009.
Finding Microlevel Data for Economists at Princeton University: Education and Labor.
Statistics Canada Citizenship and Immigration Canada Longitudinal Survey of Immigrants to Canada Ryerson University April 16, 2004.
October 2008 Getting to Know Data Sources SOC 3140 Prof. Sylvie Lafrenière Susan Mowers, GSG / Library.
Creating Something from Nothing: Synthetic and Dummy files Bo Wandschneider University of Guelph Chuck Humphrey University of Alberta DLI Training: Ottawa,
The Grade 9 Cohort of Fall 2000: Graduation and Post-secondary Pathways Montreal, November 2009 Paul Anisef York University Robert S. Brown Toronto DSB/York.
January 20089SOC4112 Getting to Know Data Sources Geographic, Statistical and Government Information Centre GSG Team Susan Mowers.
MCRDC Michigan Census Research Data Center The MCRDC is a joint project of the U.S. Bureau of the Census and the University of Michigan to enable qualified.
SOC 503 Techniques & Methods of Social Science Data Resources at Princeton University.
2008 NCHS Data Users’ Conference Omni Shoreham Hotel Washington, DC Wednesday, August 13, 2008.
DATA and STATISTICS … at your service! S.Mowers & the GSG team ©2009, University of Ottawa.
Creating Something from Nothing: Working with Synthetic Files ACCOLEDS /DLI Training: December 2003 Chuck Humphrey University of Alberta.
An RDC in Your Backyard Bo Wandschneider University of Guelph CAPDU 2004.
Disclosure Analysis: What do RDC Analysts do? Research Data Centre Program, Statistics Canada James Chowhan Ontario DLI Training, Queen's University
David Price October 2011 Real Time Remote Access (RTRA) #10.
DLI and EQUINOX Question 1 How do I find out what survey datasets are available from Statistics Canada ?
Health Statistics 2016 DLI Atlantic Training
Real Time Remote Access: Educational resources Susan Mowers, University of Ottawa.
The Research Data Centre Program October 14, 2015 DLI- EAC Donna Dosman.
Finding Data Files at the U of S Library Sociology 398, Social Inequality and Health Kiran Doranalli Lucy Li Data & GIS Library Services, U of S Library.
“Data from national surveys: access, analysis, and sharing”
Accessing data – a user’s perspective
Canadian Election Study
Creating Something from Nothing: Working with Synthetic Files
The Research Data Centre Program
Research Data Centre DLI Workshop (December, 2001)
The Atlantic Research Data Centre
LISA, Anticipating the Next Generation of Longitudinal Data
Susan Mowers, Data Librarian, GSG Centre - UOttawa
LISA, Anticipating the Next Generation of Longitudinal Data
Telling Canada’s story in numbers Marie-Josée Major
Capitalising on Metadata
Transnational access HIVA KU Leuven) Expertise in indicators and comparative analysis about quality of work at EU level research infrastructure About.
Presentation transcript:

1 Statistics Canada Research Data Centre Program* Facilities across Canada housing detailed confidential microdata and documentation files from Statistics Canada Statistics Canada released data that would otherwise not be available into “secure” sites.

About statistics Canada data: Canadian Community Health Survey (CCHS) Ethnic Diversity Survey (EDS) General Social Survey (GSS selected cycles) –Access to and Use of Information Communication TechnologyAccess to and Use of Information Communication Technology –Education, Work and RetirementEducation, Work and Retirement –FamilyFamily –HealthHealth –Social EngagementSocial Engagement –Social Support and AgingSocial Support and Aging –Time UseTime Use –VictimizationVictimization Longitudinal Survey of Immigrants to Canada (LSIC) National Longitudinal Survey of Children and Youth (NLSCY) National Population Health Survey (NPHS) Survey of Labour and Income Dynamics (SLID) Workplace and Employee Survey (WES) Youth in Transition Survey and the Programme for International Student Assessments (YITS-PISA) Health Services Access Survey 2

New data sources coming to the Research Data Centre Censuses of Population now available (20% sample – long questionnaire) Censuses will soon be available Health administrative data in development Residential Care Facility Survey now available: longitudinal business survey of all long-term care facilities in Canada Contact the BCIRDC office for information on any Statistics Canada surveys

National Population Health Survey Longitudinal survey of health of Canadians (smaller sample size) Data collected every second year since 1994 Detailed information on disease and disability Non-repeated modules on selected conditions such as asthma

National Survey of Children and Youth Longitudinal survey of children and parents Data collected every second year since 1994 Detailed child development measurements Detailed family and household data School level data available for some years

Youth in Transition Survey Longitudinal survey on the school to work pathways of youth Data collected every second year since 2000 Two cohorts selected in 2000: –15 year-olds (includes data from the Programme for International Student Assessment –18-20 year-olds

Survey of Labour and Income Dynamics Longitudinal survey with annual data collection Each panel has six years of data A new panel begins every third year Detailed data on family dynamics, education, income and labour Cross-sectional data from 1976 to present

Workplace and Employee Survey Longitudinal survey from 1999 to 2006 Annual data collection from employer for selected workplaces Selected workers within each workplace interviewed annually for two years Detailed information on technology, innovation and human resource practices in workplaces Allows analyses of both employer and worker characteristics

9 Other surveys adult literacy and lifestyles survey canada's alcohol and other drug survey cross national equivalency file consumer price survey Canadian tobacco use monitoring survey Employment Insurance Coverage Survey Foreign direct investment Health Promotion Survey Information and Communications Technologies in Schools Survey labour market activity survey Statistics Canada Survey of Literacy Skills Used in Daily Activities National Private Vehicle Use Survey Ontario Adult Literacy Survey Ontario Child Health Study ontario first nation regional health survey Post-Secondary Education Participation Survey Residential care facilities survey survey of displaced workers survey of independent workers School Leavers Follow-up Survey School Leavers Survey Public service employees survey survey of repeat users of EI Status vector file us health and retirement survey united stats national health interview survey

Future data sources that may become available Employment Insurance beneficiary records (10% sample) Record of Employment (10% sample) Historical Censuses of Population from 1961 to 1986 Canada Pension Plan Disability beneficiary records Health administrative records and population linked data

Other Future Developments Plans are in development to add the following to RDC dataset collection: –Cancer Registry (pilot project in progress at BCIRDC) –HRSDC administrative data –Homicide data (Cdn. Centre for Justice Statistics) [under review: pilots only] –Business data: (selected datasets from Small Business & Special Surveys Division) 11

Stats Canada data are released to universities through the “Data Liberation Initiative” Most Cdn. universities part of this Data housed in university data library (at Uvic: Kathleen Matthews, library) and copies are made available to any researcher requesting it as long as: –a) agree to terms (no re-dissemination; etc.) –b) bona fida member of the university community 12

DLI Restrictions No longitudinal data (in some cases, cross-sectional waves, not linked and with unique identifiers stripped, are available, but in other cases survey not available at all) Many variables treated as “confidential” and deleted from dataset or coarsely categorized 13

14 censored variables Full versions of datasets with censored variables + datasets not otherwise available can be worked on in a “Research Data Centre”

15

16 There are RDCs across Canada at most major universities with doctoral programs: New Brunswick, Dalhousie, Moncton Toronto (York has a “branch” which will soon develop into a full-blown centre) Waterloo (Guelph, WLU participate) McMaster (Brock participates) Western (Windsor is just opening a branch) Queen’s (part-time site) Carleton (U of Ottawa participates) Manitoba U of Saskatchewan (part-time site) 2 Alberta sites: U of Alberta; Calgary (various Prairie universities participate; Lethbridge will have a branch soon) Manitoba (branch opening in Yellowknife) Consortium (U de Montreal) with branches at UQAM, Sherbrooke, Laval McGill BC universities consortium BC consortium: UBC, SFU, UVic, Vancouver Island Univ., UNBC Statistics Canada Research Data Centre Program

17 The UVic branch works within the British Columbia Interuniversity Research Data Centre network “main” site is at UBC; open 9-5 M-F UVic site has more restrictive hours (arranged term- by-term in consultation with researchers). –Currently 15.5 hours/week (sometimes a bit less in summer) –Exact hours worked out in consultation with users

18 Support: Capital costs: –Canadian Foundation for Innovation – Office of the Provost Operating costs: – Dean of Social Science –Vice-President, Research –Dean of Graduate Studies –Dean of Business –SSHRC, CIHR “network” grant funding –Past support & seeking support for present year: Dean of Humanities; Assoc. Dean, Island Med. Pgm. Dean of Human and Social Development Dean of Education

19 What is the relationship between the RDC network and the “Data Liberation Initiative”? often users work with the DLI version of a dataset before progressing to work using the RDC StatCan will only approve projects if it can be demonstrated that DLI data is insufficient or there are no DLI files for the survey of interest contact person on campus for DLI: Kathleen Matthews

20 RDCDLI Files available -Those listed on RDC site -Other files if arragements can be made - Those listed on DLI site – see UVic library web page under Data Acquisition Files not available Any linked longitudinal dataset Recent waves of NCLSY Some newly available surveys Information on files not avail. Cluster numbersCluster numbers, Geographic detail Demographic detail Other

21 RDCDLI Who may access - Faculty with approved projects -Graduate students with approved projects (+faculty co-investigator) - Any member of UVic community with NetLink ID Where files may be worked on In Data Centre onlyMay be downloaded to be used anywhere, with agreement not to redistributed Initial contactDoug Baer or Lee Grenon (StatCan Analyst located in Vancouver at UBC) Kathleen Matthews, Data Librarian, UVic

22 & other data can be arranged There is presently a project involving BC Administrative Health data (to be linked to Stats Can survey data) For a very large list of StatCan Surveys, see the DLI website (UVic library)  click on “DLI collection” future plans: see below

23 What is the process for gaining access?

24 Application process works through SSHRC Graduate students must have faculty member as co- investigator

25 Project proposal Proposal evaluation by SSHRC peer review and Statistics Canada Very few are turned down… though must establish that confidential data are required to complete project –Does project have scientific merit? is access to confidential microdata necessary? Does researcher have expertise to conduct research? –Takes 6-8 weeks Proposals that are part of SSHRC or CIHR grants forgo the SSHRC peer review process –Approvals typically 3-4 weeks

26 Process: Submit proposal Proposal approved Security check on applicant oath, investigator becomes “deemed employee” of statistics canada Orientation session at UVic Issued access card for card reader

27 UVic facilities: 6 workstation lab with room for expansion to up to 10 workstations workstations now have widescreen monitors or dual screen configuration Server for data Most commonly used statistical software packages Some highly specialized software packages Hours are worked out to suit the needs of active researchers. Fall 2009 hours: Tues 10am-5pm; Wed 10am-4pm Thurs 12 noon – 4pm

28 Software Standard stats packages: SPSS (18), SAS (9.2) STATA (11)** [Stata/SE on 2 machines & Stata/IC on 3) Open-source stats: R Multilevel models: HLM, LISREL, MPlus SEM models: LISREL, Mplus, AMOS Specialized (Bayesian, MCMC etc.): WinBugs Other software can be obtained if demand exists.

29 Security process No output or notes can be taken out of the room Users have file drawers and access to printer inside the centre Output listings and notes (if typed into a computer file) can be released after they are “vetted” by a Statistics Canada Analyst at the main BC site Files are sent via encrypted CD to Vancouver (2-3 days) Files that are approved for release are ed back to researcher Pass card works only during centre hours (swipe in, swipe out protocol)

30 Can I work at other RDCs too? Can I work with other researchers? What about other researchers at other universities? Access is “network wide” Files are stored on a “project” basis (researchers, RAs, etc. have own account but access to shared files) UVic researchers are part of the BC consortium and could go to the UBC site if more intense periods of research are required (35 hrs/week vs. 15); project files can be sent to and from the branch (3-6 days)

31 Preparation: Check to see if dataset is one of standard RDC datasets: check –Extensive data documentation provided for listed datasets –If what you are interested is not on the list, check with Doug Baer or Lee Grenon Is a public use file available? Check with Kathleen Matthews or on library web site. Verify that variables needed for research are not on public use file. If possible, use public use file to explore data, etc. If further dataset documentation required, ask Doug Baer or Lee Grenon Go to SSHRC web page to put together application. Don’t hesitate to consult Doug Baer for help. Be prepared to specify variables to be used. Where a public use version of the dataset is available, be prepared to make clear why RDC access is needed (e.g., “a needed variable is suppressed on the public use file”).

32 Statistics Training Summer Institutes: –SPIDA (York University) –ICPSR (U Michigan) –Prairie school? (Calgary) –Possible BC initiatives –Seminar at the Congress for the Humanities & Social Sciences (this year: multilevel models) Special workshops and seminars (Baer): –Possible: SEM, survival/event history models, longitudinal data, multi-level data

33 Contact information: Doug Baer, Academic Director (Sociology) (721) – 7581 Cornett, A365 RDC (Assistant Lorraine Dame) (853) 3196 RDC Analyst at UBC: Lee Grenon, Centre web site (shows hours): web.uvic.ca/rdc