Hsien-Ming Lien Dept of Public Finance, NCCU

Slides:



Advertisements
Similar presentations
Start up Excel. Notice that each row has a number, and each column has a letter. Click in A1 (column A, row 1), and type in a title for your data.
Advertisements

Introduction to Excel 2007 Part 2: Bar Graphs and Histograms February 5, 2008.
T T02-03 Histogram (Equal Classes) Purpose Allows the analyst to analyze quantitative data by summarizing it in sorted format, scattergram by.
Guide to Using Excel For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 6th Ed. Chapter 2: Graphs, Charts.
1.2 - Displaying quantitative data with graphs (Histograms)
Chapter 2 Graphs, Charts, and Tables – Describing Your Data
WINKS SDA Statistical Data Analysis (Windows Kwikstat) Getting Started Guide.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 2-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Statistics for Decision Making Descriptive Statistics QM Fall 2003 Instructor: John Seydel, Ph.D.
1 Here are some additional methods for describing data.
T T02-06 Histogram (6 SD) Purpose Allows the analyst to analyze quantitative data by summarizing it in sorted format, scattergram by observation,
Displays of variables Measurement Variables (interval, ratio) Frequency distributions Histograms Stem and leaf displays Scatterplots.
Examine the data Hsien-Ming Lien Dept of Public Finance, NCCU.
Histograms & Summary Data.  Summarizing large of amounts of data in two ways: Histograms: graphs give a pictorial representation of the data Numerical.
QM Spring 2002 Statistics for Decision Making Descriptive Statistics.
A Simple Guide to Using SPSS© for Windows
Stata Introduction Sociology 229A, Class 2 Copyright © 2008 by Evan Schofer Do not copy or distribute without permission.
X y Exploratory data analysis Cross tabulations and scatter diagrams.
T T02-04 Histogram (User Selected Classes) Purpose Allows the analyst to analyze quantitative data by summarizing it in sorted format, scattergram.
Topics: Descriptive Statistics A road map Examining data through frequency distributions Measures of central tendency Measures of variability The normal.
Summarizing Quantitative Data Frequency Distribution Relative Frequency and Percent Frequency Distributions Histogram Cumulative Distributions Ogive.
QM 1 - Intro to Quant Methods Graphical Descriptive Statistics Charts and Tables Dr. J. Affisco.
Getting Started with your data
SPSS Statistical Package for the Social Sciences is a statistical analysis and data management software package. SPSS can take data from almost any type.
Introduction to Excel 2007 Part 3: Bar Graphs and Histograms Psych 209.
Descriptive Statistics: Tabular and Graphical Methods
Chapter 1 Displaying the Order in a Group of Numbers and… Intro to SPSS (Activity 1) Thurs. Aug 22, 2013.
Basic Descriptive Statistics Percentages and Proportions Ratios and Rates Frequency Distributions: An Introduction Frequency Distributions for Variables.
Guide to Using Excel 2003 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 7th Ed. Chapter 2: Graphs, Charts.
Stata Workshop #1 Chiu-Hsieh (Paul) Hsu Associate Professor College of Public Health
Ch2: Exploring Data: Charts 13 Sep 2011 BUSI275 Dr. Sean Ho HW1 due Thu 10pm Download and open “SportsShoes.xls”SportsShoes.xls.
Data Management 連賢明 政大財政. 2 統計軟體  一般通用 STATA SAS  個體計量 LIMDEP  高階軟體 MATLAB GAUSS.
Visual Displays for Quantitative Data
June 21, Objectives  Enable the Data Analysis Add-In  Quickly calculate descriptive statistics using the Data Analysis Add-In  Create a histogram.
Getting Started with Stata 2/11/2010 Tom Tomberlin Nealia Khan Learning Technologies Center Harvard Graduate School of Education.
STATA for S-052 M. Shane Tutwiler Your Friendly S-040 Lecturer William Johnston IT Services Harvard Graduate School of Education.
Chapter 3: Organizing Data. Raw data is useless to us unless we can meaningfully organize and summarize it (descriptive statistics). Organization techniques.
Robust Regression. Regression Methods  We are going to look at three approaches to robust regression:  Regression with robust standard errors  Regression.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
LECTURE 02 Descriptive Statistics MGT 601. Descriptive Statistics Table 1: Wages of 120 workers in Dollars
MATH 2311 Section 1.5. Graphs and Describing Distributions Lets start with an example: Height measurements for a group of people were taken. The results.
Example 2-5 Histograms Capital Credit Union Issue: Analyze credit card balances for Capital Credit Union customers using a frequency distribution and histogram.
Histograms Lecture 14 Sec Wed, Sep 29, 2004.
Probability and Statistics 12/11/2015. Statistics Review/ Excel: Objectives Be able to find the mean, median, mode and standard deviation for a set of.
Practical Solutions Analysing Continuous Data. 2 1)To produce the overall histogram you can use the options exactly as given. This results in the following.
IENG-385 Statistical Methods for Engineers SPSS (Statistical package for social science) LAB # 1 (An Introduction to SPSS)
EXCEL CHAPTER 6 ANALYZING DATA STATISTICALLY. Analyzing Data Statistically Data Characteristics Histograms Cumulative Distributions Classwork: 6.1, 6.6,
Announcements Exams returned at end of class Average = 78 Standard Dev = 12 Key with explanations will be posted Don’t be discouraged: First test is often.
Advanced Quantitative Techniques
The rise of statistics Statistics is the science of collecting, organizing and interpreting data. The goal of statistics is to gain understanding from.
Descriptive Statistics: Tabular and Graphical Methods
Descriptive Statistics: Tabular and Graphical Methods
Chapter 1.1 Displaying Distributions with graphs.
EMPA Statistical Analysis
SPSS: Using statistical software — a primer
QM222 Class 13 Section D1 Omitted variable bias (Chapter 13.)
MATH 2311 Section 1.5.
Sec. 1.1 HW Review Pg. 19 Titanic Data Exploration (Excel File)
Guide to Using Excel 2003 For Basic Statistical Applications
Topic 5: Exploring Quantitative data
Introduction to Stata Spring 2017.
2 Handling Data Basic Medical Statistics Course October 2010
Representation of Data
Statistics: The Interpretation of Data
Descriptive Statistics
Program This course will be dived into 3 parts: Part 1 Descriptive statistics and introduction to continuous outcome variables Part 2 Continuous outcome.
Simple and Multiple Regression
Unit 2 – Graphical Representation
Introduction to Excel 2007 Part 3: Bar Graphs and Histograms
A Brief Introduction to Stata(2)
Presentation transcript:

Hsien-Ming Lien Dept of Public Finance, NCCU Examine the data Hsien-Ming Lien Dept of Public Finance, NCCU

1.1 Read the data Read the ASCII file Read the excel file infile must provide the variable name, width, and format Read the excel file insheet variable names need to be specified Read the Stata file use c:\regstata\elemapi from the internet

cd dir use save

use http://www.ats.ucla.edu/stat/stata/webbooks/reg/elemapi

1.2 Describe the data Describe the data Data size Observations Variable name Variable type (string, byte, float, etc)

直接按ok

Variables api00/academic performance of the school acs_k3/the average class size in kindergarten through 3rd grade meals/the percentage of students receiving free meals full/the percentage of teachers who have full teaching credentials

List All observations Some observations Some variables

選取變數

Notice the missing values of meals. 

Codebook Number of values Missing values Distribution of values

選取變數後按ok

summarize Provide concise information about variables Observations Basic statistics (mean, s.d., min, max) Option: details

選取變數後按ok

1.3 Tab the data Tabulate Tabulate the size of class size

Look at the school and district number to check if they are from the same district

1.4 Graph the data Use graphs to examine the data Histogram Stem and leaf plot

A stem-and-leaf plot would also have helped to identify these observations.   This plot shows the exact values of the observations, indicating that there were three -21s, two -20s, and one -19.

Quiz 1: do a histogram on full Quiz 2: do a stem-and-leaf plot on full

Let's look at the frequency distribution of full to see if we can understand this better.  The values go from 0.42 to 1.0, then jump to 37 and go up from there. It appears as though some of the percentages are actually entered as proportions, e.g., 0.42 was entered instead of 42 or 0.96 which really should have been 96.

Again, let's see which districts these data came from.

We note that all 104 observations in which full was less than or equal to one came from district 401.  Let's count how many observations there are in district 104 using the count command.

Two ways graphs Scatterplot: show the joint distribution of two variables Let's look at the scatterplot matrix for the variables:

Correct the variable mistakes acs_k3 Replace the negative values into the positive ones replace acs_k3=-acs_k3 if acs_k3<0 Full Change from the percentage to the proportion replace full=full*100 if full<=1

save elemapi, replace