EMPA Statistical Analysis

Slides:



Advertisements
Similar presentations
Introduction to Statistics
Advertisements

Chapter 1 The Where, Why, and How of Data Collection
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
Sociology 601: Class 1, September Syllabus Course website Objectives Prerequisites Text Homeworks Class time Exams Grading Schedule.
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
STA 2023 Chapter 1 Notes. Terminology  Data: consists of information coming from observations, counts, measurements, or responses.  Statistics: the.
Statistics: Basic Concepts. Overview Survey objective: – Collect data from a smaller part of a larger group to learn something about the larger group.
Chapter 1 DATA AND PROBLEM SOLVING. Section 1.1 GETTING STARTED.
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
Variables, sampling, and sample size. Overview  Variables  Types of variables  Sampling  Types of samples  Why specific sampling methods are used.
Chap 1-1 Chapter 1 Introduction and Data Collection Business Statistics.
1 M02-Data Collection/Types  Department of ISM, University of Alabama, Lesson Objectives - M02  Understand “Scales of Measurement.”  Understand.
INTRODUCTION TO STATISTICS CHAPTER 1: IMPORTANT TERMS & CONCEPTS.
1 PAUF 610 TA 1 st Discussion. 2 3 Population & Sample Population includes all members of a specified group. (total collection of objects/people studied)
Introduction and Data Collection Basic Business Statistics 10 th Edition.
Lesson 3 Measurement and Scaling. Case: “What is performance?” brandesign.co.za.
2 NURS/HSCI 597 NURSING RESEARCH & DATA ANALYSIS GEORGE MASON UNIVERSITY.
Introduction to Statistics
SAMPLING Purposes Representativeness “Sampling error”
SESSION 1 & 2 Last Update 15 th February 2011 Introduction to Statistics.
Statistics Terminology. What is statistics? The science of conducting studies to collect, organize, summarize, analyze, and draw conclusions from data.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
The rise of statistics Statistics is the science of collecting, organizing and interpreting data. The goal of statistics is to gain understanding from.
Elementary Statistics
Learning Objectives : After completing this lesson, you should be able to: Describe key data collection methods Know key definitions: Population vs. Sample.
Chapter 1 Introduction and Data Collection
ScWk 298 Quantitative Review Session
Module 9: Choosing the Sampling Strategy
INTRODUCTION AND DEFINITIONS
Statistics in Management
Overview of probability and statistics
Running models and Communicating Statistics
Introduction to Statistics
SAMPLING Purposes Representativeness “Sampling error”
Chapter 1 Introduction to Statistics
Graduate School of Business Leadership
Introduction to Statistics
statistics Specific number
Applied Statistical Analysis
1 Chapter.
Understandable Statistics
Chapter Eight: Quantitative Methods
Chapter 1 Getting Started Understandable Statistics Ninth Edition
The Language of Sampling
Unit 7: Statistics Key Terms
Introduction to Statistics
Chapter 1: Introduction to Statistics
The Nature of Probability and Statistics
Introduction to Statistics
statistics Specific number
The Nature of Probability and Statistics
Definitions Covered Statistics Individual Variable
Quantitative Methods for Business Studies
Chapter 1 The Where, Why, and How of Data Collection
Use your Chapter 1 notes to complete the following warm-up.
Overview of Statistics
Statistics Workshop Tutorial 1
Chapter 1 The Where, Why, and How of Data Collection
6A Types of Data, 6E Measuring the Centre of Data
Introduction to Statistics
1.) Come up with 10 examples of how statistics are used in the real life. Be specific and unique. 2.) Video.
The Nature of Probability and Statistics
Business Statistics: A First Course (3rd Edition)
Gathering and Organizing Data
Displaying Data – Charts & Graphs
Understanding Basic Statistics
Understanding Basic Statistics
The Where, Why, and How of Data Collection
Introduction to Statistics
Chapter 1 The Where, Why, and How of Data Collection
Presentation transcript:

EMPA Statistical Analysis Week 1 Eva Witesman, Ph.D. Romney Institute of Public Management Brigham Young University

Course format Canvas RSS feed for calendar Daily class schedule Readings Participation Application assignments (view first assignment) Final project (examples, template)

Why statistics? How do you do your job better? What information would be useful? What information do you collect? Three main roles for public managers Consumers of data Producers of data Analysts

Types of Questions for Analysis Descriptive analysis: Describing the sample Examples?

Types of Questions for Analysis Inferential analysis: Describing the population based on the sample Extrapolating from a sample to a population Benchmarking Examples?

Types of Questions for Analysis Bivariate and multivariate analysis: Testing relationships between variables Does X cause Y even in the presence of Z? Which of several X factors have the greatest impact on Y? Predicting Y using a variety of X variables Comparing pre-test/post-test values Examples?

Sources/types of data Data you (or others) collect already as part of your administrative processes (administrative data) Data you collect on purpose (survey data, observations, measurements) Data others have collected on purpose (Public or purchased data)

Data Generation Exercise Let’s create some data based on the three principles/one outcome model Take the survey here: https://byu.az1.qualtrics.com/jfe/form/SV_37y1mYB2Xxxq22N

Construct operationalization Construct: The broad idea or concept to be measured (e.g. “academic success”) Definition: The specific manner in which the concept is to be defined (e.g. “cumulative grade point average from undergraduate education”) Operational definition: The specific data source for observing the construct (e.g. “self-reported undergraduate GPA on question 4 of the intake survey”)

Levels of measurement Text (string): The data is text only and each entry is completely unique. Nominal: Each entry is completely unique and without common groupings or shared meaning (usually ID variables, addresses, phone numbers, etc.). Can be text or numeric.

Levels of measurement Categorical: Unordered but finite categories (e.g. MPA emphasis, shirt color, race, region, country of origin, etc.) Binary (dummy/dichotomous): Categorical variable with exactly two categories.

Levels of measurement Ordinal: Ordered categories but with nonstandard units or “bins” of data (e.g. age or income ranges, Likert scales)

Levels of measurement Interval: Data measured in ordered, standard units (e.g. height in inches, age in years, weight in pounds, costs in USD, etc.) Continuous, ratio, count, etc.: Other, more specific, subsets of interval-level data.

Dataset characteristics Unit of analysis: Defines the way in which each row in the database is unique; identifies the type of unit or object being studied. Examples include individuals, households, states, cities, months, state/year, programs, etc. This is the level at which data is collected. What is the unit of analysis for our example data?

Dataset characteristics Observations (observational units): Each row in the database should be a unique observation or observational unit. For example, if you are studying households, each row would contain data about a single household, and each household is an “observation” or “observational unit.” Where/who are the observations in our practice data?

Dataset characteristics Variables: Each column identifies a specific characteristic of, or piece of information about, the observations. Variables have values that are not the same for all observations in the dataset (otherwise they would be “constants” and should be excluded from the analysis). What are the variables in our practice data? What constructs are they operationalizing?

Sampling terminology Population of interest: The population is the large group of observations about which you want to make claims or inferences. This is the group you are ultimately trying to study.

Sampling terminology Sampling frame: This is the representation of the population from which you will draw a sample. The sampling frame is usually a list of contact information or another method of accessing people in the population.

Sampling terminology Sample: This is the subset of the population you have selected out of the sampling frame to examine or gather data about.

Sampling terminology Response rate: This is the percent of sampled people, in a survey, who provided responses to the survey. This is an indication of the level of participation in the study and can be a measure of potential bias. Effective response rate: This is the percent of sampled people, in a survey, who provided complete and useable responses (as determined following data cleaning).

Sampling terminology Random sample: A sample that was taken by randomly (true randomness!) selecting a sample from the sampling frame. This is generally acknowledged to be the least biased method, though stratified random sampling may result in more representative data.

Sampling terminology Nonrandom sample: A sample taken in any way other than random selection. Convenience sampling: Selecting based on convenience to the researcher Snowball sampling: Having study participants suggest other individuals to participate in the study Purposive sampling: Identifying specific targets to be sampled into the study

Sampling terminology Stratified random sampling: Dividing the sampling frame into subsets based on key characteristics and then randomly sampling within the subsets Census Attempting to sample every member of the sampling frame.

Sampling terminology Bias: Anything that may cause your study to fail at being accurately representative of the population of interest.

Installing R Install R, RStudio, and Rcommander Install MPA plugin version 3

Application Identify a data source Clean data Generate a codebook Begin application assignment