Statistics and Research methods Wiskunde voor HMI Betsy van Dijk.

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Chapter 1 Displaying the Order in a Group of Numbers
Basic Statistical Concepts
QUANTITATIVE DATA ANALYSIS
Scales of Measurement S1-1. Scales of Measurement: important for selecting stat's (later on) 1. Nominal Scale: number is really a name! 1 = male 2 = female.
Descriptive Statistics
Analysis of Research Data
Introduction to Educational Statistics
Edpsy 511 Homework 1: Due 2/6.
Data observation and Descriptive Statistics
Central Tendency and Variability
Levels of Measurement Nominal measurement Involves assigning numbers to classify characteristics into categories Ordinal measurement Involves sorting objects.
Chapter 3: Central Tendency
Measures of Central Tendency
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Today: Central Tendency & Dispersion
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Describing distributions with numbers
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
BIOSTATISTICS II. RECAP ROLE OF BIOSATTISTICS IN PUBLIC HEALTH SOURCES AND FUNCTIONS OF VITAL STATISTICS RATES/ RATIOS/PROPORTIONS TYPES OF DATA CATEGORICAL.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
EPE/EDP 557 Key Concepts / Terms –Empirical vs. Normative Questions Empirical Questions Normative Questions –Statistics Descriptive Statistics Inferential.
Psychometrics.
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 16 Descriptive Statistics.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Types of data and how to present them 47:269: Research Methods I Dr. Leonard March 31, :269: Research Methods I Dr. Leonard March 31, 2010.
Chapters 1 & 2 Displaying Order; Central Tendency & Variability Thurs. Aug 21, 2014.
Smith/Davis (c) 2005 Prentice Hall Chapter Four Basic Statistical Concepts, Frequency Tables, Graphs, Frequency Distributions, and Measures of Central.
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Introduction to Descriptive Statistics Objectives: 1.Explain the general role of statistics in assessment & evaluation 2.Explain three methods for describing.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Descriptive Statistics, The Normal Distribution, and Standardization.
Describing Data Lesson 3. Psychology & Statistics n Goals of Psychology l Describe, predict, influence behavior & cognitive processes n Role of statistics.
Skewness & Kurtosis: Reference
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
INVESTIGATION 1.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Introduction to Statistics Santosh Kumar Director (iCISA)
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Statistics for Psychology CHAPTER SIXTH EDITION Statistics for Psychology, Sixth Edition Arthur Aron | Elliot J. Coups | Elaine N. Aron Copyright © 2013.
Central Tendency & Dispersion
Chapter Eight: Using Statistics to Answer Questions.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Central Tendency A statistical measure that serves as a descriptive statistic Determines a single value –summarize or condense a large set of data –accurately.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
LIS 570 Summarising and presenting data - Univariate analysis.
Introduction to statistics I Sophia King Rm. P24 HWB
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Introduction to Statistics. Objectives: Understand certain statistical concepts & terminology Describe types of measurement scales Differentiate between.
Chapter 2 Describing and Presenting a Distribution of Scores.
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
Descriptive Statistics(Summary and Variability measures)
Descriptive Statistics Printing information at: Class website:
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Descriptive Statistics
Central Tendency and Variability
Description of Data (Summary and Variability measures)
An Introduction to Statistics
Chapter 1 Displaying the Order in a Group of Numbers.
Introduction to Statistics
Basic Statistical Terms
Chapter Nine: Using Statistics to Answer Questions
The Mean Variance Standard Deviation and Z-Scores
Presentation transcript:

Statistics and Research methods Wiskunde voor HMI Betsy van Dijk

Introduction  Statistics is about – Systematically studying phenomena in which we are interested – Quantifying variables in order to use mathematical techniques – Summarizing these quantities in order to describe and make inferences – Using these descriptions and inferences to make decisions or understand

The Two Branches of Statistical Methods  Descriptive statistics (beschrijvende statistiek) – Used to summarize, organize and simplify data  Inferential statistics (toetsende statistiek) – Draw conclusions/make inferences that go beyond the numbers from a research study – Techniques that allow us to study samples and then make generalizations about the populations from which they were selected

Descriptive Statistics  Numbers that describe the characteristics of a particular data set – “The average age in the class is 27 years” – “The range of ages in class is 22 years, from a minimum of 20 to a maximum of 42”

Inferential Statistics  Descriptive statistics from a sample that are used to make inferences about the characteristics of a population. – “The average age of people taking Research Statistics is 27 years.” People taking Research Statistics A sample of people taking Research Statistics a “parameter”

Basic Concepts - Variables  Things that change – Environmental events or conditions – Personal characteristics or attributes – Behaviors  Anything that takes on different values in different situations (even just through time)

Basic Concepts  Value – A possible number or category that a score can have  Score – A particular person’s value on a variable  Data – Scores or measurements of phenomena, behaviors, characteristics, etc.  A Statistic – A number that summarizes a set of data in some way

Populations and Samples  Population – Set of all the individuals of interest in a population study  Sample – Set of individuals selected from the population  Sampling error – Discrepancy, or amount of error that exists between the sample statistic and population parameter

Measurement  Measurement is the process of assigning numbers to variables following a set of rules  There are different levels of measurement – Nominal – Ordinal – Interval – Ratio

Nominal Measurement  Places data in categories  Non-quantitative (e.g. qualitative), even though there might be numbers involved  Nominal (categorical) variables  Examples – Male/Female M,F (0,1) – Voting precinct Alucha, Dade, Palm Beach (023, 095, 167)

Ordinal Measurement  Places data in order  Quantitative as far as ranking goes  Rank-order (ordinal) variables  Distance between values varies  Examples – First, second, third  (1,2,3) (2.7, 2.8, 7.6) – Young, Middle Age, Old – Very Good, Good, Intermediate, Bad, Very Bad  (1,2,3,4,5)

Interval Measurement  Has all the characteristics of ordinal data  Additionally, the differences between values represents a specific amount of whatever is being measured (equal intervals represent equal amounts)  Examples – Temperature (the difference between 20C and 40C is the same as 60C and 80C, but 0 is not the absence of temperature)  Note: Many rating scales are treated like interval measurements

Ratio Measurement  Has all the characteristics of interval data  Additionally, has a true zero which represents the absence of whatever is being measured  Examples – Time (e.g. reaction time) – Distance  The zero point allows you to make statements about ratios (e.g. 100 feet is twice as far as 50 feet)

A Few More Things  Continuous variables – Take on an infinite number of values between two measured levels (e.g. time measurements)  Discrete variables – Have no intermediate values (e.g. number of people in class)

Math Warm-Up  Order of operations – Parentheses, exponents, multiplication/division, addition/subtraction – PEMDAS, or “please excuse my dear aunt sally” – Summation using the summation statistic before other addition/substraction  Proportion – Some portion of some total amount – Expressed by a fraction or a decimal – To calculate, divide the portion by the total amount  Percentage – A proportion that is scaled to be out of 100 (instead of some other total amount) – To calculate, first calculate the proportion, then multiply by 100  Mathematical operators – Exponents, square roots, parentheses, summation, indexing

Math Warm-Up Practice problems

Frequency Tables  Used to summarize data  Steps in making a frequency table 1. Make a list of each possible value 2. Count up the number of scores with each value 3. Make a table  Frequency table shows how often each value occurs

A Frequency Table

Histogram -- Stress-rating Data

Grouped Frequency Table  A frequency table that uses intervals

Frequency Graphs  Histogram

Frequency Graphs  Frequency polygon

Shapes of Frequency Distributions  Unimodal, bimodal, and rectangular

Shapes of Frequency Distributions Unimodal – there is a single most frequent value or “peak” Bimodal – there are two most-frequent values or peaks Rectangular – there is no peak; all values are about equally frequent

Shapes of Frequency Distributions  Symmetrical and skewed distributions

Shapes of Frequency Distributions Symmetrical – left and right halves of the distribution have approximately the same shape Skewed – left and right halves of the distribution do not have the same shape “skew” is towards the side with the fewer cases Right (or positive) skew = few cases with large scores Left (or negative) skew = few cases with small scores

Skewed distributions may be caused by: “Ceiling effects” – limitation in the high end of the scale “Floor effects” – limitation in the low end of the scale

Sometimes skewed distributions occur because of the nature of the variable itself…

Shapes of Frequency Distributions  Normal and kurtotic distributions

Measures of Central Tendency Median – The value in the middle Mode – The most common value Mean – The average value

The Mean M = the mean X = the scores N = the number of scores

The Median Rank the scores from lowest to highest Median is the score in the middle – if even number of scores, by convention take the average of the two middle ones Median is not as sensitive to extreme values as the mean

The Mode The most frequent score To compute the mode: look at a frequency table and find the most frequent score. In a symmetrical, unimodal distribution, the mean, median and mode are all the same.

Symmetrical Distribution FrequencyFrequency Mean Median Mode

Question Negative Skew FrequencyFrequency Where (approximately) will Mean, Median and Mode be situated?

Problem with the Mean The mean can be strongly influenced by outliers – This distorts the mean as a measure of central tendency The median and mode are less affected by outliers

Measures of Variance – A single number that tells you how spread out a distribution is All M = 15.0

Measures of Variance Range: difference between the maximum and minimum observed values Variance: a measure of the amount that values differ from the mean of their distribution Standard deviation: the average amount (approximately) that values differ from the mean of their distribution

Formula for the sample variance: Estimate of the population variance: Unbiased estimate of population variance Degrees of freedom: df = N-1 Variance

Describing Individual Values Sometimes observations have values that people are familiar with – Rating 1 to 10, Age, Temperature, SAT But sometimes values are on an unfamiliar scale – Score on the Wisconsin Card Sorting Task – APGAR score How can you communicate the relative value of a given observation? – Is that a very high value? very low? somewhere in the middle?

Z Scores Characterize a score in relation to the distribution The number of standard deviations the score is above or below the mean is called the Z score Formula for Z score:

Standard and Raw Scores Z scores are also called “standard scores” The original scores are called “raw scores” For a distribution of Z scores, always M = 0... and always SD = 1