DCAL Stats Workshop Bodo Winter.

Slides:

Advertisements

Similar presentations

Richard M. Jacobs, OSA, Ph.D.

Advertisements

DENSITY CURVES and NORMAL DISTRIBUTIONS. The histogram displays the Grade equivalent vocabulary scores for 7 th graders on the Iowa Test of Basic Skills.

Measures of Dispersion or Measures of Variability

B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.

Measures of Spread The Range, Variance, and Standard Deviation.

Chapter 13 Analyzing Quantitative data. LEVELS OF MEASUREMENT Nominal Measurement Ordinal Measurement Interval Measurement Ratio Measurement.

Chapter 14 Analyzing Quantitative Data. LEVELS OF MEASUREMENT Nominal Measurement Nominal Measurement Ordinal Measurement Ordinal Measurement Interval.

Introduction to Educational Statistics

Measures of Dispersion

Today: Central Tendency & Dispersion

STAT 13 -Lecture 2 Lecture 2 Standardization, Normal distribution, Stem-leaf, histogram Standardization is a re-scaling technique, useful for conveying.

Quiz 2 Measures of central tendency Measures of variability.

Chapters 1 & 2 Displaying Order; Central Tendency & Variability Thurs. Aug 21, 2014.

© Copyright McGraw-Hill CHAPTER 3 Data Description.

Statistics Recording the results from our studies.

Some Useful Continuous Probability Distributions.

And the Rule THE NORMAL DISTRIBUTION. SKEWED DISTRIBUTIONS & OUTLIERS.

Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.

KNR 445 Statistics t-tests Slide 1 Variability Measures of dispersion or spread 1.

A tour of fundamental statistics introducing Basic Statistics.

INVESTIGATION 1.

Agenda Descriptive Statistics Measures of Spread - Variability.

Hotness Activity. Descriptives! Yay! Inferentials Basic info about sample “Simple” statistics.

Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.

INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.

Normal Distribution. Normal Distribution: Symmetric: Mean = Median = Mode.

 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.

Measures of variability: understanding the complexity of natural phenomena.

Statistics What is statistics? Where are statistics used?

Descriptive and Inferential Statistics Or How I Learned to Stop Worrying and Love My IA.

Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.

Descriptive Statistics(Summary and Variability measures)

Data Analysis. Statistics - a powerful tool for analyzing data 1. Descriptive Statistics - provide an overview of the attributes of a data set. These.

LESSON 5 - STATISTICS & RESEARCH STATISTICS – USE OF MATH TO ORGANIZE, SUMMARIZE, AND INTERPRET DATA.

Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.

7 th Grade Math Vocabulary Word, Definition, Model Emery Unit 4.

Exploratory Data Analysis

SFB stats workshop Bodo Winter.

Advanced Quantitative Techniques

Statistical Methods Michael J. Watts

STAT 4030 – Programming in R STATISTICS MODULE: Basic Data Analysis

Statistical Methods Michael J. Watts

CHAPTER 1 Exploring Data

Measures of Central Tendency

Basic Statistics Measures of Variability.

Univariate Analysis/Descriptive Statistics

Central Tendency and Variability

CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.

Do-Now-Day 2 Section 2.2 Find the mean, median, mode, and IQR from the following set of data values: 60, 64, 69, 73, 76, 122 Mean- Median- Mode- InterQuartile.

Descriptive Statistics

Description of Data (Summary and Variability measures)

Measures of Central Tendency and Dispersion

(12) students were asked their SAT Math scores:

Describing Location in a Distribution

Summary Statistics 9/23/2018 Summary Statistics

Chapter 2 The Mean, Variance, Standard Deviation, and Z Scores

Central tendency and spread

Central Tendency.

Statistical Evaluation

HMI 7530– Programming in R STATISTICS MODULE: Basic Data Analysis

Descriptive Statistics: Describing Data

Data analysis and basic statistics

What would be the typical temperature in Atlanta?

Section 2.1 Density Curves & the Normal Distributions

MATH 2400 – Ch. 2 Vocabulary Mean: the average of a set of data sum/n

Statistics for a Single Measure (Univariate)

Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3

Lecture 4 Psyc 300A.

Central Tendency & Variability

Presentation transcript:

DCAL Stats Workshop Bodo Winter

Outline Two learning curves Friday Jan 19 Saturday Jan 20

Outline Two learning curves Friday Jan 19 Saturday Jan 20

Outline Two learning curves Friday Jan 19 Saturday Jan 20

What is statistics? “Math-assisted thinking” “Statistics, more than most other areas of mathematics, is just formalized common sense, quantified straight thinking.” Paulos (1992: 58) Paulos, J. A. (1992). Beyond numeracy: Ruminations of a numbers man. New York: Vintage Books.

Statistics is part of the entire research cycle Theory/Hypothesis Publish paper, data and scripts Data collection Write-up ALL OF THAT IS STATISTICS Preprocessing/ Data Preparation Statistical Analysis

Statistics is part of the entire research cycle Theory/Hypothesis Publish paper, data and scripts “confirmatory statistics” Data collection Write-up ALL OF THAT IS STATISTICS Preprocessing/ Data Preparation Statistical Analysis

Statistics is part of the entire research cycle “confirmatory statistics” = hypothesis-testing “exploratory statistics” ALL OF THAT IS STATISTICS = hypothesis- generating

“Getting meaning from data” Descriptive Statistics Michael Starbird Inferential Statistics

“Getting meaning from data” Word Emotional Valence minty +1.52 juicy +1.56 smelly -1.87 sweet +2.12 putrid -1.78 delicious +1.82 stinky -1.49 rancid -2.11 Descriptive Statistics Michael Starbird Inferential Statistics Winter (2016), Language, Cognition and Neuroscience

“Getting meaning from data” Word Emotional Valence minty +1.52 juicy +1.56 smelly -1.87 sweet +2.12 putrid -1.78 delicious +1.82 stinky -1.49 rancid -2.11 Winter (2016), Language, Cognition and Neuroscience

“Getting meaning from data” Word Emotional Valence sweet +2.12 delicious +1.82 juicy +1.56 minty +1.52 stinky -1.49 putrid -1.78 smelly -1.87 rancid -2.11 M = 1.8 M = -1.8 Winter (2016), Language, Cognition and Neuroscience

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution”

Everything is grounded in the notion of a “distribution” “uniform distribution”

Everything is grounded in the notion of a “distribution” “uniform distribution” Inspired by Cartoon Guide to Statistics

Everything is grounded in the notion of a “distribution” “uniform distribution” Inspired by Cartoon Guide to Statistics

Everything is grounded in the notion of a “distribution” Inspired by Cartoon Guide to Statistics

Everything is grounded in the notion of a “distribution” “normal distribution” Inspired by Cartoon Guide to Statistics

Everything is grounded in the notion of a “distribution” “Gaussian distribution” Inspired by Cartoon Guide to Statistics

Everything is grounded in the notion of a “distribution” “distribution with positive skew” Inspired by Cartoon Guide to Statistics

Ways continuous distributions differ Location Spread Shape Mean Median Mode Range Variance Standard deviation Inter-Quartile Range

Differences in location Warriner et al. (2013), Behavior Research Methods

Differences in location -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in location -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in location M = 0.2 -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in location M = -0.6 -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in location M = -0.6 -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in location sum of all the numbers (from the first number to the nth number) Differences in location divided by how many numbers you have +4 Warriner et al. (2013), Behavior Research Methods

Example: the mean of three response times 300ms 200ms 400ms Sum: 300 + 200 + 400 = 900 Divided by N: 900 / 3 = 300ms

The mean is a “balance point”. The median is a “half-way point”.

The mean is a “balance point”. The median is a “half-way point”.

The mean is a “balance point”. The median is a “half-way point”. 50% 50%

The mean is a “balance point”. The median is a “half-way point”. 50% 50%

Differences in spread: range -2.11 +1.56 -4 +4 Warriner et al. (2013), Behavior Research Methods

Differences in spread: standard deviation -4 +4 SD = 1.21 Warriner et al. (2013), Behavior Research Methods

Differences in spread: standard deviation -4 +4 SD = 1.21 Warriner et al. (2013), Behavior Research Methods

the mean Warriner et al. (2013), Behavior Research Methods

differences from the mean Warriner et al. (2013), Behavior Research Methods

squared differences from the mean Warriner et al. (2013), Behavior Research Methods

sum of squared differences from the mean Warriner et al. (2013), Behavior Research Methods

conceptually: “average” of sum of squared differences from the mean Warriner et al. (2013), Behavior Research Methods

conceptually: “undoing” the squaring Warriner et al. (2013), Behavior Research Methods

You can think of the standard deviation conceptually as the “average deviation” from the mean* * it is not technically the average deviation, but the basic idea is right Warriner et al. (2013), Behavior Research Methods

Differences in spread: SD -4 +4 SD = 1.21 Warriner et al. (2013), Behavior Research Methods

Differences in spread: SD -4 +4 SD = 0.41

The 68%-95% rule of thumb

The 68%-95% rule of thumb If the distribution is approximately normal 68% of the data fall within the interval: [ mean - SD, mean + SD ] 95% of the data fall within the interval: [ mean + 2 * SD, mean + 2 * SD ]

The 68%-95% rule of thumb Imagine a paper reports these two numbers: M = 600 ms, SD = 50 ms Between which two numbers do you expect 68% of the data? 550ms – 650 ms

The 68%-95% rule of thumb Imagine a paper reports these two numbers: M = 600 ms, SD = 50 ms Between which two numbers do you expect 95% of the data? 500ms – 700 ms

In R, computing all of this is easy... mean(yournumbers) sd(yournumbers) median(yournumbers) range(yournumbers)

Approaching R: Having the right attitude “I have been writing R code for years, and every day I still write code that doesn’t work!” Wickham & Grolemund (2017: 7) Wickham, H. & Grolemund, G (2017). R for Data Science. Sebastopol, CA: O’Reilly.