Why do we need statistics? Chapter 1. The Research Process.

Slides:



Advertisements
Similar presentations
Jamil A. Malik (PhD) National Institute of Psychology Quaid-e-Azam Univeristy.
Advertisements

Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Independent & Dependent Variables
Introduction to Summary Statistics
CHAPTER 20 Psychological Research and Statistics.
Statistics. Review of Statistics Levels of Measurement Descriptive and Inferential Statistics.
Statistical Tests Karen H. Hagglund, M.S.
Review of Basics. REVIEW OF BASICS PART I Measurement Descriptive Statistics Frequency Distributions.
Review of Basics. REVIEW OF BASICS PART I Measurement Descriptive Statistics Frequency Distributions.
BHS Methods in Behavioral Sciences I April 18, 2003 Chapter 4 (Ray) – Descriptive Statistics.
Descriptive Statistics
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Analysis of Research Data
Introduction to Educational Statistics
Data observation and Descriptive Statistics
Central Tendency and Variability
2 Textbook Shavelson, R.J. (1996). Statistical reasoning for the behavioral sciences (3 rd Ed.). Boston: Allyn & Bacon. Supplemental Material Ruiz-Primo,
Today: Central Tendency & Dispersion
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Lies, damned lies & statistics
Objective To understand measures of central tendency and use them to analyze data.
BIOSTATISTICS II. RECAP ROLE OF BIOSATTISTICS IN PUBLIC HEALTH SOURCES AND FUNCTIONS OF VITAL STATISTICS RATES/ RATIOS/PROPORTIONS TYPES OF DATA CATEGORICAL.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Chapter 3 Statistical Concepts.
Psychometrics.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
CHAPTER 1 Basic Statistics Statistics in Engineering
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Counseling Research: Quantitative, Qualitative, and Mixed Methods, 1e © 2010 Pearson Education, Inc. All rights reserved. Basic Statistical Concepts Sang.
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Skewness & Kurtosis: Reference
Chapter 2 Statistical Concepts Robert J. Drummond and Karyn Dayle Jones Assessment Procedures for Counselors and Helping Professionals, 6 th edition Copyright.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Research Methods Chapter 14 pages Measures of Central Tendencies Descriptive statistics that summarise data by identifying a score that suggests.
Why Do We Need Statistics?
Research Ethics:. Ethics in psychological research: History of Ethics and Research – WWII, Nuremberg, UN, Human and Animal rights Today - Tri-Council.
Chapter Eight: Using Statistics to Answer Questions.
Statistics Without Fear! AP Ψ. An Introduction Statistics-means of organizing/analyzing data Descriptive-organize to communicate Inferential-Determine.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
Department of Cognitive Science Michael J. Kalsher PSYC 4310 COGS 6310 MGMT 6969 Unit 1A: Review of Research Process.
LIS 570 Summarising and presenting data - Univariate analysis.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Chapter 12 Understanding Research Results: Description and Correlation
Measurements Statistics
Different Types of Data
Why do we need statistics?
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Central Tendency and Variability
Descriptive Statistics
Introduction to Statistics
Basic Statistical Terms
15.1 The Role of Statistics in the Research Process
Chapter Nine: Using Statistics to Answer Questions
Descriptive Statistics
Why do we need statistics?
PSYCHOLOGY AND STATISTICS
Descriptive and elementary statistics
Presentation transcript:

Why do we need statistics? Chapter 1

The Research Process

Types of Data Analysis Quantitative Methods ◦ Testing theories using numbers Qualitative Methods ◦ Testing theories using language  Magazine articles/Interviews  Conversations  Newspapers  Media broadcasts

Initial Observation Find something that needs explaining ◦ Observe the real world ◦ Read other research Test the concept: collect data ◦ Collect data to see whether your hunch is correct ◦ To do this you need to define variables  Anything that can be measured and can differ across entities or time.

Example Find something that needs explaining: ◦ Favorite colors for males versus females Test the concept: collect data ◦ Please list your favorite color. Variables ◦ Male versus female ◦ Favorite color

The Research Process

Generating and Testing Theories Theories ◦ An hypothesized general principle or set of principles that explain known findings about a topic and from which new hypotheses can be generated. Hypothesis ◦ A prediction from a theory. ◦ E.g. the number of people turning up for a Big Brother audition that have narcissistic personality disorder will be higher than the general level (1%) in the population. Falsification ◦ The act of disproving a theory or hypothesis.

Example Theory ◦ Males and females are different, therefore they like different things. Hypothesis ◦ There will be different favorite colors for males and females ◦ There will be a different number of options for males and females.

The Research Process

Collect Data to Test Your Theory Independent Variable ◦ The proposed cause ◦ A predictor variable ◦ A manipulated variable (in experiments) Dependent Variable ◦ The proposed effect ◦ An outcome variable ◦ Measured not manipulated (in experiments)

Example Independent variable ◦ Gender ** Dependent variable ◦ Most popular option ◦ Number of options

Levels of Measurement Categorical (entities are divided into distinct categories): ◦ Binary variable: There are only two categories  e.g. dead or alive. ◦ Nominal variable: There are more than two categories  e.g. whether someone is an omnivore, vegetarian, vegan, or fruitarian. ◦ Ordinal variable: The same as a nominal variable but the categories have a logical order  e.g. whether people got a fail, a pass, a merit or a distinction in their exam.

Levels of Measurement Continuous (entities get a distinct score): ◦ Interval variable: Equal intervals on the variable represent equal differences in the property being measured  e.g. the difference between 6 and 8 is equivalent to the difference between 13 and 15. ◦ Ratio variable: The same as an interval variable, but the ratios of scores on the scale must also make sense  e.g. a score of 16 on an anxiety scale means that the person is, in reality, twice as anxious as someone scoring 8.

Example IV = Gender = Binary categorical variable ◦ Many research designs require categorical IVs (like ANOVA) ◦ Levels DV ◦ Most popular option = Nominal categorical variable ◦ Range of options = Ratio scale

Measurement Error Measurement error ◦ The discrepancy between the actual value we’re trying to measure, and the number we use to represent that value. Example: ◦ You (in reality) weigh 80 kg. ◦ You stand on your bathroom scales and they say 83 kg. ◦ The measurement error is 3 kg.

Example What might be measurement error for our favorite color example?

Validity Whether an instrument measures what it set out to measure. Content validity ◦ Evidence that the content of a test corresponds to the content of the construct it was designed to cover Ecological validity ◦ Evidence that the results of a study, experiment or test can be applied, and allow inferences, to real-world conditions.

Reliability Reliability ◦ The ability of the measure to produce the same results under the same conditions. Test-Retest Reliability ◦ The ability of a measure to produce consistent results when the same entities are tested at two different points in time.

How to Measure Correlational research: ◦ Observing what naturally goes on in the world without directly interfering with it. Experimental research: ◦ One or more variable is systematically manipulated to see their effect (alone or in combination) on an outcome variable. ◦ Statements can be made about cause and effect.

Example Therefore, our study is somewhere between correlational and experimental. ◦ We didn’t really manipulate anything … ◦ But we are using different groups … ◦ Often called quasi-experimental design.

How to Measure Cross-sectional research: ◦ This term implies that data come from people at different age points with different people representing each age point. Longitudinal research: ◦ Data come from the same people over different time points in their lives.

Experimental Research Methods Cause and Effect (Hume, 1748) 1.Cause and effect must occur close together in time (contiguity); 2.The cause must occur before an effect does; 3.The effect should never occur without the presence of the cause.

Experimental Research Methods Confounding variables: the ‘Tertium Quid’ ◦ A variable (that we may or may not have measured) other than the predictor variables that potentially affects an outcome variable. Ruling out confounds (Mill, 1865) ◦ An effect should be present when the cause is present and that when the cause is absent the effect should be absent also. ◦ Control conditions: the cause is absent.

Example What are some potential confound variables for our color study?

Methods of Data Collection Between-group/Between- subject/independent ◦ Different entities in experimental conditions

Methods of Data Collection Repeated measures (within-subject) ◦ The same entities take part in all experimental conditions. ◦ Economical ◦ Practice effects ◦ Fatigue

Example Our study is between-subjects because each person is only in one group (level).

Types of Variation Systematic Variation (treatment) ◦ Differences in performance created by a specific experimental manipulation. Unsystematic Variation (error) ◦ Differences in performance created by unknown factors.  Age, Gender, IQ, Time of day, Measurement error etc.

Types of Variation So what do you to do control for variation? Randomization ◦ Minimizes unsystematic variation. In a BN design – randomize to groups In a RM design – randomize order of levels ◦ Counterbalancing

The Research Process

Analysing Data: Histograms Frequency Distributions (aka Histograms) ◦ A graph plotting values of observations on the horizontal axis, with a bar showing how many times each value occurred in the data set. The ‘Normal’ Distribution ◦ Bell shaped ◦ Symmetrical around the centre

The Normal Distribution The curve shows the idealized shape.

Properties of Frequency Distributions Skew ◦ The symmetry of the distribution. ◦ Positive skew (scores bunched at low values with the tail pointing to high values). ◦ Negative skew (scores bunched at high values with the tail pointing to low values).

Skew

Properties of Frequency Distributions Kurtosis ◦ The ‘heaviness’ of the tails. ◦ Leptokurtic = heavy tails. ◦ Platykurtic = light tails.

Kurtosis

Kurtosis

Central tendency: The Mode Mode ◦ The most frequent score Bimodal ◦ Having two modes Multimodal ◦ Having several modes

Bimodal and Multimodal Distributions

Central Tendency: The Median The Median is the middle score when scores are ordered:

Central Tendency: The Mean Mean ◦ The sum of scores divided by the number of scores. ◦ Number of friends of 11 Facebook users.

The Dispersion: Range The Range ◦ The smallest score subtracted from the largest ◦ For our Facebook friends data the highest score is 234 and the lowest is 22; therefore the range is: 234 − 22 = 212

Deviance We can calculate the spread of scores by looking at how different each score is from the center of a distribution e.g. the mean:

Example Deviance from the average number of responses for each group ◦ I know this is two numbers but take my word for it mathematically it’s the same for lots of them.

Sum of Squared Errors, SS Indicates the total dispersion, or total deviance of scores from the mean: It’s size is dependent on the number of scores in the data. More useful to work with the average dispersion, known as the variance:

Standard Deviation The variance gives us a measure in units squared. This problem is solved by taking the square root of the variance, which is known as the standard deviation:

Same Mean, Different SD

Slide 48 Important Things to Remember The Sum of Squares, Variance, and Standard Deviation represent the same thing: ◦ The ‘Fit’ of the mean to the data ◦ The variability in the data ◦ How well the mean represents the observed data ◦ Error

The Normal Probability Distribution

Going beyond the data: Z-scores Z-scores ◦ Standardising a score with respect to the other scores in the group. ◦ Expresses a score in terms of how many standard deviations it is away from the mean. ◦ The distribution of z-scores has a mean of 0 and SD = 1.

Properties of z-scores Mean = 0 ◦ This number is incredibly useful, like for serious. Standard deviation = 1

Properties of z-scores 1.96 cuts off the top 2.5% of the distribution. − 1.96 cuts off the bottom 2.5% of the distribution. As such, 95% of z-scores lie between − 1.96 and % of z-scores lie between − 2.58 and 2.58, 99.9% of them lie between − 3.29 and 3.29.

Probability Density function of a Normal Distribution

Reporting Data Dissemination of Research ◦ Findings are presented at conferences and in articles published in scientific journals. Knowing how to report data ◦ Chose a mode of presentation that optimizes the understanding of the data; ◦ If you present three or fewer numbers then try using a sentence; ◦ If you need to present between 4 and 20 numbers consider a table; ◦ If you need to present more than 20 numbers then a graph is often more useful than a table.