Initial Data Analysis DISTINCTIONS. Some Distinctions Population vs. Sample Descriptive vs. Inferential stats Variables Types of data  Quantitative versus.

Slides:



Advertisements
Similar presentations
CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Advertisements

Introduction to Statistics
Reliability and Validity
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.
Introduction to Statistics Quantitative Methods in HPELS 440:210.
Chapter 1 A First Look at Statistics and Data Collection.
Introduction to Statistics and Research
Introduction to Quantitative Research
QUANTITATIVE DATA ANALYSIS
Intro to Statistics for the Behavioral Sciences PSYC 1900
47.269: Research I: The Basics Dr. Leonard Spring 2010
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Today Concepts underlying inferential statistics
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Measurement and Measurement Scales Measurement is the foundation of any scientific investigation Everything we do begins with the measurement of whatever.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
Understanding Research Results
Chapter 1: Introduction to Statistics
AP Statistics Overview and Basic Vocabulary. Key Ideas The Meaning of Statistics Quantitative vs. Qualitative Data Descriptive vs. Inferential Statistics.
Statistical Analyses & Threats to Validity
Chapter 3 Goals After completing this chapter, you should be able to: Describe key data collection methods Know key definitions:  Population vs. Sample.
STA 2023 Chapter 1 Notes. Terminology  Data: consists of information coming from observations, counts, measurements, or responses.  Statistics: the.
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
CHAPTER 4 Research in Psychology: Methods & Design
Chapter 1: Introduction to Statistics
Jargon & Basic Concepts Howell Statistical Methods for Psychology.
Introduction to Statistics What is Statistics? : Statistics is the sciences of conducting studies to collect, organize, summarize, analyze, and draw conclusions.
Biostatistics Ibrahim Altubasi, PT, PhD The University of Jordan.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Biostatistics-short course Introduction Anwar Ahmad.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
1.What is this graph trying to tell you? 2.Do you see anything misleading, unclear, etc.? 3.What is done well?
Introduction. The Role of Statistics in Science Research can be qualitative or quantitative Research can be qualitative or quantitative Where the research.
Associate Professor Arthur Dryver, PhD School of Business Administration, NIDA url:
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
The What and the Why of Statistics The Research Process Asking a Research Question The Role of Theory Formulating the Hypotheses –Independent & Dependent.
Chapter 1: The What and the Why of Statistics  The Research Process  Asking a Research Question  The Role of Theory  Formulating the Hypotheses  Independent.
Chapter 1 Introduction to Statistics. Statistical Methods Were developed to serve a purpose Were developed to serve a purpose The purpose for each statistical.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 3: The Foundations of Research 1.
ANOVA and Linear Regression ScWk 242 – Week 13 Slides.
An Introduction to Statistics and Research Design
Unit 1 – Intro to Statistics Terminology Sampling and Bias Experimental versus Observational Studies Experimental Design.
Research Ethics:. Ethics in psychological research: History of Ethics and Research – WWII, Nuremberg, UN, Human and Animal rights Today - Tri-Council.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Introduction to Statistics. What’s it all about?  Why is statistical analysis important?  What all do we do with statistics?  What are the problems/limitations.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Introduction To Statistics. Statistics, Science, ad Observations What are statistics? What are statistics? The term statistics refers to a set of mathematical.
Statistics AP Psychology 2010 J. Mulder. Why are statistics important? “Proof is virtually impossible for psychology researchers to attain because controlling.
Introduction. The Role of Statistics in Science Research can be qualitative or quantitative Research can be qualitative or quantitative Where the research.
Lecture 1 Stat Applications, Types of Data And Statistical Inference.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Introduction to Statistics Chapter 1. § 1.1 An Overview of Statistics.
Introduction To Statistics
RESEARCH METHODS IN INDUSTRIAL PSYCHOLOGY & ORGANIZATION Pertemuan Matakuliah: D Sosiologi dan Psikologi Industri Tahun: Sep-2009.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Dr.Rehab F.M. Gwada. Measures of Central Tendency the average or a typical, middle observed value of a variable in a data set. There are three commonly.
2 NURS/HSCI 597 NURSING RESEARCH & DATA ANALYSIS GEORGE MASON UNIVERSITY.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Psy B07 Chapter 1Slide 1 BASIC CONCEPTS. Psy B07 Chapter 1Slide 2  Population  Random Sampling  Random Assignment  Variables  What do we do with.
Some Terminology experiment vs. correlational study IV vs. DV descriptive vs. inferential statistics sample vs. population statistic vs. parameter H 0.
Introduction to Quantitative Research
Pharmaceutical Statistics
Introduction to Statistics and Research
Jargon & Basic Concepts
Basic Statistical Terms
The Nature of Probability and Statistics
Presentation transcript:

Initial Data Analysis DISTINCTIONS

Some Distinctions Population vs. Sample Descriptive vs. Inferential stats Variables Types of data  Quantitative versus Categorical  Measurement scales

Population The entire collection of events that you are interested in generalizing to.  For example, our population could be the students in this class, UNT students, all students in U.S., people in general. Although we wish to make claims about the entire population, it is often too large to deal with, and so we will take a portion of it to study.  Random sampling

Random Sampling Choose a subset (sample) of the population ensuring that each member of the population has an equivalent chance of being sampled. Examine that sample and use your observations to draw inferences about the population.  Example : Voting polls, television ratings, rolling a die.

Random Sampling Note, however, that the inferences drawn are only as good as the representativeness of the sample. If the sample is not random, it may not be representative of the population. When a sample is not representative of its parent population, the external validity of any inference is called into question i.e. how well will we be able to generalize?  Example : Most psychology studies and freshman psych students.

Random Assignment When studying the effects of some treatment variable in an experimental fashion, it is also important to randomly assign subjects to treatments.  Control 1 vs. Experimental group  Oftentimes we want to look at the effects of some treatment e.g. a drug, teaching strategy, memory technique etc.  To study the effects of the treatment we’ll often give one or more groups the treatment and one group no treatment and then compare the groups Random assignment reduces the likelihood that groups differ in some critical way other than the treatment since everyone has an equal chance to be put in one of the treatment groups.

Random Assignment If random assignment is not used then the internal validity of the experimental results may be compromised i.e. are our results due to the treatment we’ve imposed or something else?  Example: don’t randomly assign male/females to receive treatment -> effects seen due to gender rather than treatment

Variables Assume we have a random sample of subjects that we have randomly assigned to treatment groups.  Example: Stop-smoking study. Now we must select the variables we wish to study, with the term variable referring to a property of an object or event that can take on different values.  Example: # of cigs smoked, abstinence after one week (yes or no). Note the distinction; # of cigarettes smoked is a continuous variable, whereas abstinence is a categorical variable. A variable is to be contrasted with a constant, that which only takes on one value. 1

Types of Data Measurement (quantitative, magnitude) Data  Continuous vs. Discrete  Example: GPA during college vs. GPA for class  Example: 9 point “Likert” scale- continuous or discrete? 20 point? Categorical (frequency, nominal, qualitative) Data  Named data e.g. different brands, political party, race, gender How you think about your data and what scale of measurement your variables are is very important. What you decide about the variable will have a say on the analyses available, and even possibly even have vast effect on the theory itself.  Early developmental theories suggested clear cut stages which imply categories of development

A Note about Categorical Variables Consider the grouping variable in which people are classified as In-patient, Out-patient, and Control groups. There is only one variable in a theoretical sense, and our goal is to determine the relationship between the grouping variable and the outcome, and an ANOVA table in this sense would speak to the overall effect group membership has on the outcome. Statistically speaking there are actually two coded variables if we are applying the general linear model  See dummy coding, effects coding etc. 1 What you want to keep in mind is that a single group has no relationship with the outcome, as membership in a single group is a constant.

Variables Another distinction related to variables concerns those that we are interested in understanding and explaining (dependent, criterion, outcome variables) versus those we expect to have an effect (perhaps causal) on that outcome, and which we may be able to manipulate or have control over experimentally (independent, predictor variables).  Which one is predicting and which is being predicted? Both predictor and dependent variables can be quantitative or categorical  Example: Whether or not we give a subject the stop-smoking treatment would be the independent variable, and the # of cigarettes smoked would be a dependent variable.  Other examples: age:income, shoe size:intelligence, gender:hostility, intelligence:voting outcome

Parameters and Statistics Parameters are simply values associated with the population and as such are often inferred rather than actually known  They are designated with Greek notation e.g.  for the population mean “Statistics” in this sense speak specifically to the data set at hand (the sample) and make no reference to values outside the sample. Using statistics we have collected we will then infer the population values (parameters) e.g. use the sample mean to infer  Most commonly employed methods assume a fixed population parameter

What Do We Do With The Data? Descriptive statistics are used to describe the data set itself without reference to the population from which it is derived.  Examples: graphing, calculating, averages, looking for extreme scores. Exploratory/Initial data analysis (Tukey, Chatfield, others) typically relies on descriptive information most

What Do We Do With The Data? Inferential statistics allow you to infer something about the parameters of the population based on the statistics of the sample, and the various tests we perform on the sample.  Examples: Chi-Square, T-Tests, Correlations, ANOVA

Measurement Scales Nominal  category labels assigned in some meaningful way (e.g. gender, political party) Ordinal  orders or ranks objects on some continuum (e.g. military ranks) Interval  Can speak of differences between scale points, arbitrary zero point (Fahrenheit scale- 30°-20°=20°-10°, but 20°/10° is not twice as hot!)  Probably most common in psych Ratio  Same as interval but with true zero point (distance, weight, Kelvin- physical measurements). Ratios are interval scales too but not the other way around.

Scales There is much debate with regard to scale distinction and how to deal with different data types.  Even some types of data seem to qualify as more than one type. Although some analyses will result in the same outcome whatever you want to call your data, which analysis you perform may be affected by what you see the underlying construct to be, and so it is important that you give it some thought.

Decision tree

Decision Trees While decision trees might be helpful, they are, at best, a suggestion, and should never be used as a definitive statement on what analysis to do Example: political party (republican vs. democrats) and 3 ‘continuous’ attitude measures (gun control, abortion, Iraq)  Simple enough right? Possible analyses  Purely descriptive assessment  Simple correlations  3 t-tests  Classical, Non-parametric, or Robust? Bootstrap, Mann-Whitney, Differences on M-estimators rather than means etc.  MANOVA  Differences on the linear combination of continuous  Discriminant Function Analysis or Logistic regression  Predicting party based on attitude (classification)  Factor analysis on continuous, t-test on factor scores

Analyses The point is you will always have an option for how to both understand and describe, as well as analyze the data What you must do is work this out BEFORE collecting the data While you will still have the options on how to analyze the data in the end, those options should be known before anything is collected To collect without an analysis in mind that will test your theoretical model is to waste one’s time, and suggests a lack of truly understanding just what your theory/hypothesis is