15b. Accessing Data: Frequencies in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.

Slides:



Advertisements
Similar presentations
4. NLTS2 Data Sources: Parent and Youth Surveys. 4. Sources: Parent and Youth Surveys Prerequisites Recommended modules to complete before viewing this.
Advertisements

Chapter 4 Joining Multiple Tables
Prerequisites Recommended modules to complete before viewing this module 1. Introduction to the NLTS2 Training Modules 2. NLTS2 Study Overview 3. NLTS2.
12. NLTS2 Documentation: Quick References. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
13.Analysis Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data.
11. NLTS2 Documentation: Data Dictionaries. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2.
10. NLTS2 Documentation Overview. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training Modules.
16b. Accessing Data: Means in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
9. Weighting and Weighted Standard Errors. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
19.Multivariate Analysis Using NLTS2 Data. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Simple Logistic Regression
Bivariate Analysis Cross-tabulation and chi-square.
7.Implications for Analysis: Parent/Youth Survey Data.
Recap of basic SPSS and statistics 5 th - 9 th December 2011, Rome.
Funded through the ESRC’s Researcher Development Initiative Department of Education, University of Oxford Session 3.3: Inter-rater reliability.
SAS Programming: Working With Variables. Data Step Manipulations New variables should be created during a Data step Existing variables should be manipulated.
17a.Accessing Data: Manipulating Variables in SPSS ®
Introduction to SPSS Allen Risley Academic Technology Services, CSUSM
Categorical Data Analysis: Stratified Analyses, Matching, and Agreement Statistics Biostatistics March 2007 Carla Talarico.
15a.Accessing Data: Frequencies in SPSS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
PROC FREQ 1SHRUG November 28, What good is Proc FREQ It Counts! Answers question how many Display data (error checks), descriptive Analyze categorical.
Data Management: Quantifying Data & Planning Your Analysis
Problem 1: Relationship between Two Variables-1 (1)
Introduction to SPSS (For SPSS Version 16.0)
SW388R6 Data Analysis and Computers I Slide 1 Chi-square Test of Goodness-of-Fit Key Points for the Statistical Test Sample Homework Problem Solving the.
The Math Studies Project for Internal Assessment A good project should be able to be followed by a non-mathematician and be self explanatory all the way.
Introduction to SAS Essentials Mastering SAS for Data Analytics
SAS PROC REPORT PROC TABULATE
Chapter 9 Producing Descriptive Statistics PROC MEANS; Summarize descriptive statistics for continuous numeric variables. PROC FREQ; Summarize frequency.
Introduction to SAS Essentials Mastering SAS for Data Analytics Alan Elliott and Wayne Woodward SAS ESSENTIALS -- Elliott & Woodward1.
6. Implications for Analysis: Data Content. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2.
Data Analysis Using SPSS
8.Implications for Analysis: School Survey, Student Assessment, and Transcript Data.
Introduction to SAS BIO 226 – Spring Outline Windows and common rules Getting the data –The PRINT and CONTENT Procedures Manipulating the data.
SW388R6 Data Analysis and Computers I Slide 1 Central Tendency and Variability Sample Homework Problem Solving the Problem with SPSS Logic for Central.
2. NLTS2 Study Overview. 1 Prerequisites Recommended module to complete before viewing this module  1. Introduction to the NLTS2 Training Modules.
18b. PROC SURVEY Procedures in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Knowing Understanding the Basics Writing your own code SAS Lab.
1.Introduction to the NLTS2 Training Modules Jose Blackorby Renee Cameto Camille Marder Christopher Sanford Kathryn Valdes James Van Campen SRI International.
SW318 Social Work Statistics Slide 1 Compare Central Tendency & Variability Group comparison of central tendency? Measurement Level? Badly Skewed? MedianMeanMedian.
Using Weighted Data Donald Miller Population Research Institute 812 Oswald Tower, December 2008.
WINKS 7 Tutorial 7 – Advanced Topic: Labels and Formats Permission granted for use for instruction and for personal use. © Alan C. Elliott,
CADA Final Review Assessment –Continuous assessment (10%) –Mini-project (20%) –Mid-test (20%) –Final Examination (50%) 40% from Part 1 & 2 60% from Part.
1 An Introduction to SPSS for Windows Jie Chen Ph.D. 6/4/20161.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Priya Ramaswami Janssen R&D US. Advantages of PROC REPORT -Very powerful -Perform lists, subsets, statistics, computations, formatting within one procedure.
11/25/2015Slide 1 Scripts are short programs that repeat sequences of SPSS commands. SPSS includes a computer language called Sax Basic for the creation.
Stat1510: Statistical Thinking and Concepts Two Way Tables.
RESEARCH METHODS IN TOURISM Nicos Rodosthenous PhD 25/04/ /4/20131Dr Nicos Rodosthenous.
1 EPIB 698C Lecture 4 Raul Cruz-Cano Summer 2012.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
DTC Quantitative Methods Summary of some SPSS commands Weeks 1 & 2, January 2012.
14b. Accessing Data Files in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
SW388R6 Data Analysis and Computers I Slide 1 Comparing Central Tendency and Variability across Groups Impact of Missing Data on Group Comparisons Sample.
16a. Accessing Data: Means in SPSS ®. 16a. Accessing Data: Means in SSPS ® 1 Prerequisites Recommended modules to complete before viewing this module.
1 Chapter 3: Getting Started with Tasks 3.1 Introduction to Task Dialogs 3.2 Creating a Listing Report 3.3 Creating a Frequency Report 3.4 Creating a Two-Way.
Multiple Imputation using SAS Don Miller 812 Oswald Tower
17b.Accessing Data: Manipulating Variables in SAS ®
Education 793 Class Notes Cross-tabulations Presentation 3.
Using Data from the National Survey of Children with Special Health Care Needs Centers for Disease Control and Prevention National Center for Health Statistics.
1 EPIB 698C Lecture 1 Instructor: Raul Cruz-Cano
Based on Learning SAS by Example: A Programmer’s Guide Chapters 1 & 2
SAS ® 101 Based on Learning SAS by Example: A Programmer’s Guide Chapters 16 & 17 By Tasha Chapman, Oregon Health Authority.
SAS ® 101 Based on Learning SAS by Example: A Programmer’s Guide Chapters 5 & 6 By Ravi Mandal.
SAS ® 101 Based on Learning SAS by Example: A Programmer’s Guide Chapters 3 & 4 By Tasha Chapman, Oregon Health Authority.
Introduction to SPSS.
ECONOMETRICS ii – spring 2018
Producing Descriptive Statistics
Week 3 Lecture Notes PSYC2021: Winter 2019.
Presentation transcript:

15b. Accessing Data: Frequencies in SAS ®

1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training Modules  2. NLTS2 Study Overview  3. NLTS2 Study Design and Sampling  NLTS2 Data Sources, either 4. Parent and Youth Surveys or 5. School Surveys, Student Assessments, and Transcripts  9. Weighting and Weighted Standard Errors

15b. Accessing Data: Frequencies in SAS ® 2 Prerequisites Recommended modules to complete before viewing this module (cont.)  NLTS2 Documentation 11. Overview 12. Data Dictionaries 13. Quick References  14b. Accessing Data Files in SAS

15b. Accessing Data: Frequencies in SAS ® 3 Overview  Purpose  Exploring existing data Frequencies Crosstabs  Missing values options  Weights  Closing  Important information

15b. Accessing Data: Frequencies in SAS ® 4 NLTS2 restricted-use data NLTS2 data are restricted. Data used in these presentations are from a randomly selected subset of the restricted-use NLTS2 data. Results in these presentations cannot be replicated with the NLTS2 data licensed by NCES.

15b. Accessing Data: Frequencies in SAS ® 5 Purpose Learn to  Run simple statistical procedures Frequency distributions Crosstabulations  Watch for Missing values n’s Weighted vs. unweighted data

15b. Accessing Data: Frequencies in SAS ® 6 Frequencies How to run a frequency Frequencies are usually run on categorical or ordinal variables as opposed to continuous variables. Missing values are excluded in the percentages by default. Suggestion: As a preliminary step, look at variable in a PROC CONTENTS Syntax PROC FREQ data = sasdb.n2w2paryouth; TABLES W2_GendHdr2003; run ;  Multiple variables can be listed on the TABLES statement. PROC FREQ data = sasdb.n2w2paryouth ; TABLES w2_GendHdr2003 w2_IncomeHdr2003 ; run ; These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 7 Frequencies: Example PROC FREQ data = sasdb.n2w2paryouth; TABLES W2_GendHdr2003; run ; These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 8 Missing values options Missing values are excluded in the percentages unless an option to include missing values is specified. In this example, the number of missing values is not calculated in the percentages, but the number of missing values is footnoted. PROC FREQ data = sasdb.n2w2paryouth ; TABLES W2_IncomeHdr2003 ; These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 9 Missing values options “MISSING” option on TABLES statement  A “MISSING” option on the TABLES statement includes missing values in the percentages. PROC FREQ data = sasdb.n2w2paryouth ; TABLES W2_IncomeHdr2003/MISSING ;

15b. Accessing Data: Frequencies in SAS ® 10 Missing values options “MISSPRINT” option on TABLES statement  A MISSPRINT option lists missing values but does not include them in the percentages. PROC FREQ data = sasdb.n2w2paryouth ; TABLES W2_IncomeHdr2003/MISSPRINT ;

15b. Accessing Data: Frequencies in SAS ® 11 Frequencies: Example Frequencies  Use the NLTS2 Wave 1 parent interview file.  Run a frequency distribution on np1B5b and np1G5a.  What percentage had a little trouble communicating? Are the percentages evenly distributed?  What percentage of youth never fixed their own breakfast? These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 12 Frequencies: Example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 13 Frequencies: Example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 14 Crosstabs How to run a crosstab. Frequency distributions show a breakdown of a single variable for all respondents in the file. Crosstabs produce separate frequency distributions for subgroups.  For example, compare percentages for boys versus girls or other demographic groups such as income categories, race/ethnicity, age, grade level, etc. The comparison, or by-variable, must be categorical or ordinal. Note: The table statement order is  [page variable] * [row variable] * [column variable] Column percentages add up to 100% in each column. Row percentages add up to 100% in each row. Options on the table statement control the crosstabulation output statistics. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 15 Crosstabs Syntax PROC FREQ data = sasdb.n2w1parent ; TABLES np1G5a * w1_Gendhdr2001 ; run ; TABLES statement options can control which percentages are included in the table. /* just cell counts and column percents */ TABLES np1G5a * w1_Gendhdr2001 /nopercent norow ; /* just cell counts and row percents */ TABLES np1G5a * w1_Gendhdr2001 /nopercent nocol ; Suggestion: Choose the format that works best for you, but be consistent. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 16 Crosstabs Column percent: 26.3% of males and 29.2 % of females always fix their own breakfast. Row percent: 62.5% who always fix their own breakfast are male and 37.5% are female. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 17 Crosstabs Column percentages only Row percentages only These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 18 Crosstabs: Example Crosstabs  Use the Wave 1 teacher survey file.  Run a crosstab of nts1D4a BY w1_dis12.  Are the results what you expected to see?  What is the percentage in the Total column for “Strongly agree”? Would you report this percentage? These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 19 Crosstabs: Example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 20 Crosstabs: Example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 21 Crosstabs: Example With the large number of categories for disability, the previous table was split.  An option is to reverse the table order and request row percentages. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 22 Crosstabs: Example Overall, 12.7% strongly agree.  Would you report this? These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 23 Weights How to weight data. Running procedures unweighted is useful for exploratory analysis and recommended for becoming familiar with the data.  It is important to review the unweighted n’s. However, unweighted results must never be reported. The procedures we used to calculate frequencies and crosstabs can be run with weights.  The weighted percentages will be correct.  The weighted standard errors will not be correct using these procedures; do not report them. Weighting in SAS requires a WEIGHT statement.  The procedures themselves do not change.  Weights are applied when there is a WEIGHT statement. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 24 Weights Syntax PROC FREQ data = sasdb.n2w1tchr ; WEIGHT wt_nts1 ; TABLES w1_dis12 * nts1D4a /nopercent nocol ; To turn the weight off, omit the statement or comment it out. PROC FREQ data = sasdb.n2w1tchr ; * WEIGHT wt_nts1 ; TABLES w1_dis12* nts1D4a /nopercent nocol ; run ; These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 25 Weights: Example Weights: Run the earlier examples with a weight.  Run the frequency example weighted with np1Weight. NLTS2 W1 parent interview weight np1Weight np1B5b and np1G5a  Run crosstab example weighted with wt_nts1. W1 Teacher weight wt_nts1 nts1D4a BY w1_Dis12 These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 26 Weighted example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 27 Unweighted example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 28 Weighted example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 29 Unweighted example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

15b. Accessing Data: Frequencies in SAS ® 30 Closing Topics discussed in this module  Exploring existing data Frequencies Crosstabs  Missing values options  Weights Next module  16b. Accessing Data: Means in SAS

15b. Accessing Data: Frequencies in SAS ® 31 Important information  NLTS2 website contains reports, data tables, and other project-related information  Information about obtaining the NLTS2 database and documentation can be found on the NCES website  General information about restricted data licenses can be found on the NCES website  address: