The Distribution of Single Variables. Two Types of Variables Continuous variables – Equal intervals of measurement – Known zero-point that is meaningful.

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Describing Quantitative Variables
Statistical Techniques I EXST7005 Start here Measures of Dispersion.
Chi-Square Test. Most of the previous techniques presented so far have been for NUMERICAL data. So, what do we do if the data is CATEGORICAL? Ex: Information.
EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES
5-Minute Check on Activity 7-2 Click the mouse button or press the Space Bar to display the answers. 1.What were the graphs examined in the last lesson?
Experimental Design & Analysis
Chapter 1 Introduction Individual: objects described by a set of data (people, animals, or things) Variable: Characteristic of an individual. It can take.
Descriptive statistics (Part I)
Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Standard Deviation Interquartile Range (IQR)
GS/PPAL Section N Research Methods and Information Systems A QUANTITATIVE RESEARCH PROJECT - (1)DATA COLLECTION (2)DATA DESCRIPTION (3)DATA ANALYSIS.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Week 3 Topic - Descriptive Procedures Program 3 in course notes Cody & Smith (Chapter 2)
Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel.
Distributions & Graphs. Variable Types Discrete (nominal) Discrete (nominal) Sex, race, football numbers Sex, race, football numbers Continuous (interval,
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
1 Chapter 2: Methods for Describing Sets of Data (Page 19-98) Homework:14ab, 36, 43, 45, 51, 56, 64abc, 71, 79, 85, 89, 96.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 1 Exploring Data
Stats 95 Statistical analysis without compelling presentation is annoying at best and catastrophic at worst. From raw numbers to meaningful pictures.
Data Presentation.
Chapter 9 Producing Descriptive Statistics PROC MEANS; Summarize descriptive statistics for continuous numeric variables. PROC FREQ; Summarize frequency.
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Measures of Variation. For discrete variables, the Index of Qualitative Variation.
3-2 Random Variables In an experiment, a measurement is usually denoted by a variable such as X. In a random experiment, a variable whose measured.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Corrected page numbers for optional readings in SAS ® Languages and Procedures Aug 30pp and Sept 6pp Sept 13pp Sept 27pp
T- and Z-Tests for Hypotheses about the Difference between Two Subsamples.
Normal Distribution The Bell Curve. Questions What are the parameters that drive the normal distribution? What does each control? Draw a picture to illustrate.
NCHS Research Data Center Vijay Gambhir Computer Scientist (301) Kenneth W. Harris Acting Director (301)
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
Lecture 3 Topic - Descriptive Procedures Programs 3-4 LSB 4:1-4.4; 4:9:4:11; 8:1-8:5; 5:1-5.2.
Aim: How do we analyze data with a two-way table?
Announcements First quiz next Monday (Week 3) at 6:15-6:45 Summary:  Recap first lecture: Descriptive statistics – Measures of center and spread  Normal.
I. Introduction to Data and Statistics A. Basic terms and concepts Data set - variable - observation - data value.
Lesson 8 - Topics Creating SAS datasets from procedures Using ODS and data steps to make reports Using PROC RANK Programs in course notes LSB 4:11;5:3.
Describing Association for Discrete Variables. Discrete variables can have one of two different qualities: 1. ordered categories 2. non-ordered categories.
Today: March 7 Data Transformations Rank Tests for Non-Normal data Solutions for Assignment 4.
Introduction to Statistics Measures of Central Tendency and Dispersion.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
Other Types of t-tests Recapitulation Recapitulation 1. Still dealing with random samples. 2. However, they are partitioned into two subsamples. 3. Interest.
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Customize SAS Output Using ODS Joan Dong. The Output Delivery System (ODS) gives you greater flexibility in generating, storing, and reproducing SAS procedure.
1 Chapter 10: Describing the Data Science is facts; just as houses are made of stones, so is science made of facts; but a pile of stones is not a house.
24 Nov 2007Data Management1 Data Summarization and Exploratory Data Analysis Objective: Describe or Examine Data Sets in Term of Key Characteristics.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
StatisticsStatistics Did you hear about the statistician who put her head in the oven and her feet in the refrigerator? She said, "On average, I feel just.
A QUANTITATIVE RESEARCH PROJECT -
Measurements Statistics
Box Plots & Distributions
제 5장 기술통계 및 추론 PROC MEANS 절차 PROC MEANS <options> ;
Chapter 2: Methods for Describing Data Sets
Descriptive Statistics:
Descriptive Statistics
Descriptive Statistics
Descriptive Statistics
Descriptive Statistics
Welcome!.
Warm-Up (Add to your notes!)
Methods of Acquiring Information
Sit in Groups Lab5_8.31 Effects on Adolescents’ Romantic Ideals
Descriptive Statistics
Math 145 September 5, 2007.
Descriptive Statistics
Math 145 January 24, 2007.
Math 145 May 28, 2009.
Math 341 January 24, 2007.
Biostatistics Lecture (2).
Univariate Description
Presentation transcript:

The Distribution of Single Variables

Two Types of Variables Continuous variables – Equal intervals of measurement – Known zero-point that is meaningful Discrete variables – Simply counts of attributes – Generate category frequencies

For continuous variables, Equal intervals means that the distance between any two adjacent values is identical e.g., the difference between 21 and 22 years of age is identical in years to the difference between 33 and 34 years of age Meaningful zero-point means it makes sense e.g., a GPA of 0.00 means you have completed no coursework successfully

For discrete variables, All we can do is count the number of observations that fall into its various categories e.g., the number of males and females in this class

Using SAS to Produce Statistics for Single Variables Using SAS to Produce Statistics for Single Variables libname mydata 'a:\'; libname library 'a:\'; options ps=66 nodate nonumber; proc freq data=mydata.cities; table citysize; title1 'One-Way Frequency Distribution'; title2; title3 'PPD 404'; run;

One-Way Frequency Distribution PPD 404 SIZE OF CITY, DICHOTOMY Cumulative Cumulative CITYSIZE Frequency Percent Frequency Percent Small Large

libname mydata 'a:\'; libname library 'a:\'; options ps=66 nodate nonumber; proc chart data=mydata.cities; vbar spending / discrete; title1 'Vertical Bar Chart'; run;

Vertical Bar Chart Frequency 23 + ***** | ***** 22 + ***** ***** | ***** ***** 21 + ***** ***** | ***** ***** 20 + ***** ***** | ***** ***** 19 + ***** ***** | ***** ***** 18 + ***** ***** ***** | ***** ***** ***** 17 + ***** ***** ***** | ***** ***** ***** 16 + ***** ***** ***** | ***** ***** ***** 15 + ***** ***** ***** | ***** ***** ***** 14 + ***** ***** ***** | ***** ***** ***** 13 + ***** ***** ***** | ***** ***** ***** 12 + ***** ***** ***** | ***** ***** ***** 11 + ***** ***** ***** | ***** ***** ***** 10 + ***** ***** ***** | ***** ***** ***** 9 + ***** ***** ***** | ***** ***** ***** 8 + ***** ***** ***** | ***** ***** ***** 7 + ***** ***** ***** | ***** ***** ***** 6 + ***** ***** ***** | ***** ***** ***** 5 + ***** ***** ***** | ***** ***** ***** 4 + ***** ***** ***** | ***** ***** ***** 3 + ***** ***** ***** | ***** ***** ***** 2 + ***** ***** ***** | ***** ***** ***** 1 + ***** ***** ***** | ***** ***** ***** Low Medium High POLICE EXPENDITURE, TRICHOTOMY

libname mydata 'a:\'; libname library 'a:\'; options ps=66 nodate nonumber; proc univariate data=mydata.cities plot normal; var populat; title1 'Univariate and EDA Statistics'; run;

Univariate and EDA Statistics PPD 404 Univariate Procedure Variable=POPULAT NUMBER OF RESIDENTS, IN 1,000S Moments N 63 Sum Wgts 63 Mean Sum Std Dev Variance Skewness Kurtosis USS CSS CV Std Mean T:Mean= Pr>|T| Num ^= 0 63 Num > 0 63 M(Sign) 31.5 Pr>=|M| Sgn Rank 1008 Pr>=|S| W:Normal Pr<W

Quantiles(Def=5) 100% Max % % Q % % Med % % Q % 72 0% Min 56 5% 60 1% 56 Range 7840 Q3-Q1 541 Mode 56 Extremes Lowest Obs Highest Obs 56( 30) 1511( 56) 56( 24) 1949( 55) 58( 46) 2816( 54) 60( 21) 3367( 53) 65( 51) 7896( 52)

Univariate and EDA Statistics PPD 404 Stem Leaf # Boxplot * * | *-----* Multiply Stem.Leaf by 10**+3

Univariate and EDA Statistics PPD 404 Univariate Procedure Variable=POPULAT NUMBER OF RESIDENTS, IN 1,000S Normal Probability Plot * | | | | | | * ++++ | *++++ | ** | * | ++++********** 250+ * * *** ******************

Exercise 2 Identify which of the following are variables and which are constants. a. Size of cities b. Denver c. Gender status d. Computer literacy e. Male f. College graduate g. Political party preference h. Grades on an examination

Answers Identify which of the following are variables and which are constants. V a. Size of cities C b. Denver V c. Gender status V d. Computer literacy C e. Male C f. College graduate V g. Political party preference V h. Grades on an examination

Identify which of the following are discrete variables and which are continuous variables. a. Region of the country: North, South, etc. b. TV viewing: number of hours per week c. Agency size: number of full-time employees d. Crime rate: serious crimes per 1,000 population e. Your hometown: Pasadena, Newport Beach, etc. f. Political conservatism: percent voting Republican g. Contest results: first place, second place, etc. h. Opinion: five-point scale from "Strongly Agree" to "Strongly Disagree"

Answers Identify which of the following are discrete variables and which are continuous variables. Da. Region of the country: North, South, etc. Cb. TV viewing: number of hours per week Cc. Agency size: number of full-time employees Cd. Crime rate: serious crimes per 1,000 population De. Your hometown: Pasadena, Newport Beach, etc. Cf. Political conservatism: percent voting Republican C or Dg. Contest results: first place, second place, etc. C or Dh. Opinion: five-point scale from "Strongly Agree" to "Strongly Disagree"