DCAL Stats Workshop Bodo Winter.

Slides:



Advertisements
Similar presentations
Richard M. Jacobs, OSA, Ph.D.
Advertisements

DENSITY CURVES and NORMAL DISTRIBUTIONS. The histogram displays the Grade equivalent vocabulary scores for 7 th graders on the Iowa Test of Basic Skills.
Measures of Dispersion or Measures of Variability
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Measures of Spread The Range, Variance, and Standard Deviation.
Chapter 13 Analyzing Quantitative data. LEVELS OF MEASUREMENT Nominal Measurement Ordinal Measurement Interval Measurement Ratio Measurement.
Chapter 14 Analyzing Quantitative Data. LEVELS OF MEASUREMENT Nominal Measurement Nominal Measurement Ordinal Measurement Ordinal Measurement Interval.
Introduction to Educational Statistics
Measures of Dispersion
Today: Central Tendency & Dispersion
STAT 13 -Lecture 2 Lecture 2 Standardization, Normal distribution, Stem-leaf, histogram Standardization is a re-scaling technique, useful for conveying.
Quiz 2 Measures of central tendency Measures of variability.
Chapters 1 & 2 Displaying Order; Central Tendency & Variability Thurs. Aug 21, 2014.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Statistics Recording the results from our studies.
Some Useful Continuous Probability Distributions.
And the Rule THE NORMAL DISTRIBUTION. SKEWED DISTRIBUTIONS & OUTLIERS.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
KNR 445 Statistics t-tests Slide 1 Variability Measures of dispersion or spread 1.
A tour of fundamental statistics introducing Basic Statistics.
INVESTIGATION 1.
Agenda Descriptive Statistics Measures of Spread - Variability.
Hotness Activity. Descriptives! Yay! Inferentials Basic info about sample “Simple” statistics.
Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
Normal Distribution. Normal Distribution: Symmetric: Mean = Median = Mode.
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
Measures of variability: understanding the complexity of natural phenomena.
Statistics What is statistics? Where are statistics used?
Descriptive and Inferential Statistics Or How I Learned to Stop Worrying and Love My IA.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
Descriptive Statistics(Summary and Variability measures)
Data Analysis. Statistics - a powerful tool for analyzing data 1. Descriptive Statistics - provide an overview of the attributes of a data set. These.
LESSON 5 - STATISTICS & RESEARCH STATISTICS – USE OF MATH TO ORGANIZE, SUMMARIZE, AND INTERPRET DATA.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
7 th Grade Math Vocabulary Word, Definition, Model Emery Unit 4.
Exploratory Data Analysis
SFB stats workshop Bodo Winter.
Advanced Quantitative Techniques
Statistical Methods Michael J. Watts
STAT 4030 – Programming in R STATISTICS MODULE: Basic Data Analysis
Statistical Methods Michael J. Watts
CHAPTER 1 Exploring Data
Measures of Central Tendency
Basic Statistics Measures of Variability.
Univariate Analysis/Descriptive Statistics
Central Tendency and Variability
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Do-Now-Day 2 Section 2.2 Find the mean, median, mode, and IQR from the following set of data values: 60, 64, 69, 73, 76, 122 Mean- Median- Mode- InterQuartile.
Descriptive Statistics
Description of Data (Summary and Variability measures)
Measures of Central Tendency and Dispersion
(12) students were asked their SAT Math scores:
Describing Location in a Distribution
Summary Statistics 9/23/2018 Summary Statistics
Chapter 2 The Mean, Variance, Standard Deviation, and Z Scores
Central tendency and spread
Central Tendency.
Statistical Evaluation
HMI 7530– Programming in R STATISTICS MODULE: Basic Data Analysis
Descriptive Statistics: Describing Data
Data analysis and basic statistics
What would be the typical temperature in Atlanta?
Section 2.1 Density Curves & the Normal Distributions
MATH 2400 – Ch. 2 Vocabulary Mean: the average of a set of data sum/n
Statistics for a Single Measure (Univariate)
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Lecture 4 Psyc 300A.
Central Tendency & Variability
Presentation transcript:

DCAL Stats Workshop Bodo Winter

Outline Two learning curves Friday Jan 19 Saturday Jan 20

Outline Two learning curves Friday Jan 19 Saturday Jan 20

Outline Two learning curves Friday Jan 19 Saturday Jan 20

What is statistics? “Math-assisted thinking” “Statistics, more than most other areas of mathematics, is just formalized common sense, quantified straight thinking.” Paulos (1992: 58) Paulos, J. A. (1992). Beyond numeracy: Ruminations of a numbers man. New York: Vintage Books.

Statistics is part of the entire research cycle Theory/Hypothesis Publish paper, data and scripts Data collection Write-up ALL OF THAT IS STATISTICS Preprocessing/ Data Preparation Statistical Analysis

Statistics is part of the entire research cycle Theory/Hypothesis Publish paper, data and scripts “confirmatory statistics” Data collection Write-up ALL OF THAT IS STATISTICS Preprocessing/ Data Preparation Statistical Analysis

Statistics is part of the entire research cycle “confirmatory statistics” = hypothesis-testing “exploratory statistics” ALL OF THAT IS STATISTICS = hypothesis- generating

“Getting meaning from data” Descriptive Statistics Michael Starbird Inferential Statistics

“Getting meaning from data” Word Emotional Valence minty +1.52 juicy +1.56 smelly -1.87 sweet +2.12 putrid -1.78 delicious +1.82 stinky -1.49 rancid -2.11 Descriptive Statistics Michael Starbird Inferential Statistics Winter (2016), Language, Cognition and Neuroscience

“Getting meaning from data” Word Emotional Valence minty +1.52 juicy +1.56 smelly -1.87 sweet +2.12 putrid -1.78 delicious +1.82 stinky -1.49 rancid -2.11 Winter (2016), Language, Cognition and Neuroscience

“Getting meaning from data” Word Emotional Valence sweet +2.12 delicious +1.82 juicy +1.56 minty +1.52 stinky -1.49 putrid -1.78 smelly -1.87 rancid -2.11 M = 1.8 M = -1.8 Winter (2016), Language, Cognition and Neuroscience

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution” “uniform distribution”

Everything is grounded in the notion of a “distribution” “uniform distribution” Inspired by Cartoon Guide to Statistics

Everything is grounded in the notion of a “distribution” “uniform distribution” Inspired by Cartoon Guide to Statistics

Everything is grounded in the notion of a “distribution” Inspired by Cartoon Guide to Statistics

Everything is grounded in the notion of a “distribution” “normal distribution” Inspired by Cartoon Guide to Statistics

Everything is grounded in the notion of a “distribution” “Gaussian distribution” Inspired by Cartoon Guide to Statistics

Everything is grounded in the notion of a “distribution” “distribution with positive skew” Inspired by Cartoon Guide to Statistics

Ways continuous distributions differ Location Spread Shape Mean Median Mode Range Variance Standard deviation Inter-Quartile Range

Differences in location Warriner et al. (2013), Behavior Research Methods

Differences in location -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in location -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in location M = 0.2 -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in location M = -0.6 -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in location M = -0.6 -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in location sum of all the numbers (from the first number to the nth number) Differences in location divided by how many numbers you have +4 Warriner et al. (2013), Behavior Research Methods

Example: the mean of three response times 300ms 200ms 400ms Sum: 300 + 200 + 400 = 900 Divided by N: 900 / 3 = 300ms

The mean is a “balance point”. The median is a “half-way point”.

The mean is a “balance point”. The median is a “half-way point”.

The mean is a “balance point”. The median is a “half-way point”. 50% 50%

The mean is a “balance point”. The median is a “half-way point”. 50% 50%

Differences in spread: range -2.11 +1.56 -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in spread: standard deviation -4 +4 SD = 1.21 Warriner et al. (2013), Behavior Research Methods

Differences in spread: standard deviation -4 +4 SD = 1.21 Warriner et al. (2013), Behavior Research Methods

the mean Warriner et al. (2013), Behavior Research Methods

differences from the mean Warriner et al. (2013), Behavior Research Methods

squared differences from the mean Warriner et al. (2013), Behavior Research Methods

sum of squared differences from the mean Warriner et al. (2013), Behavior Research Methods

conceptually: “average” of sum of squared differences from the mean Warriner et al. (2013), Behavior Research Methods

conceptually: “undoing” the squaring Warriner et al. (2013), Behavior Research Methods

You can think of the standard deviation conceptually as the “average deviation” from the mean* * it is not technically the average deviation, but the basic idea is right Warriner et al. (2013), Behavior Research Methods

Differences in spread: SD -4 +4 SD = 1.21 Warriner et al. (2013), Behavior Research Methods

Differences in spread: SD -4 +4 SD = 0.41

The 68%-95% rule of thumb

The 68%-95% rule of thumb If the distribution is approximately normal 68% of the data fall within the interval: [ mean - SD, mean + SD ] 95% of the data fall within the interval: [ mean + 2 * SD, mean + 2 * SD ]

The 68%-95% rule of thumb Imagine a paper reports these two numbers: M = 600 ms, SD = 50 ms Between which two numbers do you expect 68% of the data? 550ms – 650 ms

The 68%-95% rule of thumb Imagine a paper reports these two numbers: M = 600 ms, SD = 50 ms Between which two numbers do you expect 95% of the data? 500ms – 700 ms

In R, computing all of this is easy... mean(yournumbers) sd(yournumbers) median(yournumbers) range(yournumbers)

Approaching R: Having the right attitude “I have been writing R code for years, and every day I still write code that doesn’t work!” Wickham & Grolemund (2017: 7) Wickham, H. & Grolemund, G (2017). R for Data Science. Sebastopol, CA: O’Reilly.