An Introduction to the Social Science Databases Social Sciences Research and Instructional Council California State University, Fullerton September 24, 2005
Facilitators Ed Nelson, Sociology CSU Fresno John Korey, Political Science Cal Poly, Pomona Mike Reibel, Geography Cal Poly, Pomona
Acknowledgements {Add names and affiliations}
Workshop Agenda Overview Using ICPSR data and its web site Using Field data and the Berkeley archive Using Roper data Using the data for teaching A Brief Introduction to SPSS Lunch Individual questions and help
SSRIC The Social Sciences Research and Instructional Council Oldest CSU discipline council Founded in 1972 Representatives from CSU campuses meet three times per year Negotiates with data providers for access to data Promotes use of data analysis in research and teaching
SSRIC The Social Sciences Research and Instructional Council 1. Sponsors an annual spring student research conference (at CSUN in 2006) 2. Sponsors travel to ICPSR summer workshops in Ann Arbor, MI http://www.icpsr.com/training/summer/index.html 3. Works with Field Research Allocates question credits to California Field Poll Selects a faculty fellow Selects a student intern
Social Science Databases The California State University subscribes to a number of social science databases To support Teaching Research Administration Available to campuses by annual subscription
SSRIC Website http://www.ssric.org
Social Science Databases Assistance from the SSRIC Note: SSRIC members are not staff data consultants or technical assistance providers, but we will do what we can to help students, faculty and staff to access data Campus representative: {Name & contact for specific campus} Other council members: {update link to web page w/ SSRIC contact info}
Social Science Data Bases Inter-university Consortium for Political and Social Research (ICPSR) Field data Roper data
ICPSR Primary social science data archive for the U.S. Founded in 1962, CSU joined in 1972 Currently over 500 member institutions worldwide Services Vast data archive accessible via web engine (ICPSR Direct) Summer program Instructional resources
http://www.icpsr.com
ICPSR Data Access Data use tutorial Finding data Social science variables database Special topic archives Online Data Analysis (DAS) SIMI
Using ICPSR Direct Locate the data you want to download Click on Data Access & Analysis Under Search, type in the key words Census
Using ICPSR Direct Locate the data you want to download Select search options: all fields, title, or study/no. Click on search Census
Using ICPSR Direct
Using ICPSR Direct Advanced Search Census
Selecting the Study to Download Select the study you want to download For each study, you will see links to description, downloads, and related literature Click on download
Downloading the Data You will be asked for your email address and your password If you don’t have a password, click on create account and create your password
Downloading the Data You will be asked to complete a brief form the first time you use ICPSR Direct If you forget your password, there is a place to click and it will be emailed to you
Select the files to download under Step 2 Select the files to download under Step 2. by clicking on the box or boxes Select Step 3. Add to data cart Select Step 4. Review cart
If all is OK select the Download Data Cart button
Accessing ICPSR Direct from Off Campus {Only If Set Up Locally} This works just like library data bases such as J-Stor or Lexis-Nexis Academic Universe. Go to: {Insert campus specific library database access URL} Click on “ICPSR,” and log in with your name, ID, and Library PIN.
Resources Data use tutorial http://www.icpsr.com/help/newuser.html
Resources Ilona Einowski’s article on ICPSR Direct http://www.icpsr.umich.edu/org/publications/bulletin/spr03.pdf. Ilona Einowski’s article on ICPSR Direct http://www.icpsr.umich.edu/org/publications/bulletin/spr03.pdf.
Accessing the Field Poll
The Field (California) Poll http://www.field.com Established in 1947 Independent, non-partisan survey of California public opinion Mervin Field Wide range of political and social topics Multiple polls each year
Accessing Field Data News Releases and California Opinion Indexes are text summaries Working with the poll data requires obtaining the poll data sets CSU accesses Field data sets through UCData at U.C. Berkeley
Accessing Field Data Accessing data involves: Setting up your browser Finding polls and data sets Downloading data sets
Setting up your browser Requires setting passive FTP access Optionally set up for folder view or FTP directory view See the handout for details You will only need to do this once for your browser and user account
Accessing UCData for Download Go to the UCData web site (http://ucdata.berkeley.edu) Click the Data link on the left side Click the CSU Only/Field Polls link in the FTP navigation column (left side)
Accessing UCData for Download tuition
For folder view Open the corresponding folder/directory in the Retrieve Data window Open the files by double clicking Point out the different elements on the page. The CSU FTP page will contain include the UC San Diego CALPOLL Index page showing in the center frame, and the Retrieve Data frame showing a list of poll folders/directories.
Downloading the Data Sets For folder view Download the open file by choosing the Save option *You may be able to download without opening by dragging the file icons to a folder on your hard drive.
For FTP directory view Select the folder/directory in the Retrieve Data window. From the Folder Tasks menu select the Copy this folder option. Other Copy options will also work. Point out the different elements on the page. The CSU FTP page will contain include the UC San Diego CALPOLL Index page showing in the center frame, and the Retrieve Data frame showing a list of poll folders/directories.
Finding Data Sets Field News Releases and California Opinion Indexes The American Public Opinion Index UC San Diego CALPOLL Index searchable web site News Releases and California Opinion Indexes provide information about poll questions and results The American Public Opinion Index also contains information about Field Polls in which specific questions were asked Can also search the UC San Diego CALPOLL Index site, accessible through UCData
Finding the Data Sets Find the poll number News Releases American Public Opinion Index The CALPOLL window Open the corresponding folder/directory in the Retrieve Data window
Using Roper Data
Roper Center Home Page http://www.ropercenter.uconn.edu/ Some links might be useful for class discussions on polling and sampling
Searching for Data Sets Point to Quick Links and then to Search for Data Sets
Searching for Data Sets Specify Keywords Date range Country Survey organization religion 1/1/2000 Search for religion from 1/1/2000 to today’s date in the United States
Reviewing the Data Sets Click on the study number USPEW2003-RELIG for more information Get more information for Click on the codebook and look at it If you want to order this data set, write down the survey number and the title
Reviewing the Data Sets To get more information Click on the codebook in the Documentation table and look at it If you want to order this data set, write down the survey number and the title
Ordering the Data Set Coming October 1: RoperExpress Meantime: Email Ed Nelson at ednelson@csufresno.edu Include Study number Title
iPOLL Data base of over 500,000 questions Includes frequency distributions allowing students to do trend analysis online You will not need a user name or password since the CSU will be IP authenticated
Searching iPOLL Search for keyword Click on Boolean or Wildcard for help Let’s search for relig% Select the topic of religion Use date from 1/1/2000 to today’s date
Reviewing iPOLL Results Click on the questions you want to view Click on view selections for the question wording and frequencies
Using the Data for Teaching
Survey Documentation and Analysis (SDA) Program Written at UC Berkeley Used by ICPSR and others-- referred to as DAS (Data Analysis System) Data files must be converted to SDA format before use. ICPSR has converted a number of data sets in their topical archives into SDA format and are converting more.
Sources of Data for SDA at ICPSR (http://www.icpsr.com) Topical Archives sponsored by agencies external to ICPSR Focus on specific subject areas such as aging and crime Provide information on data that ICPSR holds and other data sets external to ICPSR Includes data that can be analyzed online using SDA/DAS
National Archive of Computerized Data on Aging (NACDA) http://www.icpsr.com/NACDA/ Funded by the National Institute on Aging Can search ICPSR’s data holdings. http://www.icpsr.com/NACDA/archive.html Online data analysis using SDA/DAS (http://www.icpsr.com/NACDA/das.html)
National Archive of Criminal Justice Data (NACJD) http://www.icpsr.com/NACJD/ Funded by the Bureau of Justice Statistics and the National Institute of Justice Can search data holdings http://www.icpsr.com/NACJD/archive.html Online data analysis using SDA/DAS http://www.icpsr.com/NACJD/SDA/das.html
Substance Abuse and Mental Health Data Archive (SAMHDA) http://www.icpsr.com/SAMHDA/ Funded by the Office of Applied Studies, Substance Abuse and Mental Health Administration Can search holdings http://www.icpsr.com/SAMHDA/archive.html Online data analysis using SDA/DAS http://www.icpsr.com/SAMHDA/das.html
General Procedure Select study Open window to browse codebook Select what you want to do Click on START
What Can You Do? Browse codebook Subset data Download data and documentation Run statistical procedures
Statistical Procedures Frequencies Crosstabs Comparison of means Correlation matrix Comparison of correlations Multiple regression
What Else Can You Do? Recode (temporarily) Use control variables Use filter variables Use weight variable
Example: Monitoring the Future Note: search for monitoring the future as a phrase Explores values, behavior, and lifestyles of American youth; focuses on drug use 1975 to present Investigators: Lloyd D. Johnson, Jerald G. Bachman, Patrick M. O’Malley, and John Schulenberg Self-administered questionnaire 8th, 10th, and 12th graders Multistage area probability sample Students randomly assigned to one of six questionnaires Core questions -- demographics and drug use
Select Study – 2002 Monitoring the Future You can also find this study by going to the SAMSHA topical archive’s home page: http://www.icpsr.com/SAMHDA/ Click on Online Analysis Click on Monitoring the Future and click again on Online Analysis Select Monitoring the Future 2002 (12th grade survey) (Note: This is ICPSR study number 3753)
Monitoring the Future -- Codebook Open extra codebook window Click on Group Headings to see variables categorized by topic Look for variables listed on next slide Click on variable name (e.g., V150) to see questions and frequencies
Monitoring the Future Variables of Interest Demographics: V150 (sex) V151 (race) V163 (father’s educational level) V164 (mother’s educational level) v148 (age) Educational aspirations: V183 (graduate from college) Recreation: V194 (# of times go out in evening per week) V195 (# of dates) Substance use: V103 to V108 (alcohol) V112 to V114 (Marijuana) V124 to V126 (Cocaine) Religious variables: V169 (attend religious services) V170 (importance of religion)
Running Analyses in SDA on SAMHDA Leave the codebook and go back to the SAMHSA SDA page Select an action by clicking on the run frequencies circle and clicking on start
Teaching Resources Depository http://www.ssric.org/trd
POWERMUTT Politically-Oriented, Web-Enhanced Research Methods for Undergraduates: Topics and Tools
A Brief Introduction to
SPSS Data File Types System Files (*.sps) - run directly in SPSS Portable files – ascii format; platform independent File imports – from Excel, Dbase, etc.
Syntax Files Batch processing of commands – no Graphic User Interface For advanced users, provide a quicker way to process commands