ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Slides:

Advertisements

Similar presentations

Request Dispatching for Cheap Energy Prices in Cloud Data Centers

Advertisements

SpringerLink Training Kit

Luminosity measurements at Hadron Colliders

From Word Embeddings To Document Distances

Choosing a Dental Plan Student Name

Virtual Environments and Computer Graphics

Chương 1: CÁC PHƯƠNG THỨC GIAO DỊCH TRÊN THỊ TRƯỜNG THẾ GIỚI

THỰC TIỄN KINH DOANH TRONG CỘNG ĐỒNG KINH TẾ ASEAN –

D. Phát triển thương hiệu

NHỮNG VẤN ĐỀ NỔI BẬT CỦA NỀN KINH TẾ VIỆT NAM GIAI ĐOẠN

Điều trị chống huyết khối trong tai biến mạch máu não

BÖnh Parkinson PGS.TS.BS NGUYỄN TRỌNG HƯNG BỆNH VIỆN LÃO KHOA TRUNG ƯƠNG TRƯỜNG ĐẠI HỌC Y HÀ NỘI Bác Ninh 2013.

Nasal Cannula X particulate mask

Evolving Architecture for Beyond the Standard Model

HF NOISE FILTERS PERFORMANCE

Electronics for Pedestrians – Passive Components –

Parameterization of Tabulated BRDFs Ian Mallett (me), Cem Yuksel

L-Systems and Affine Transformations

CMSC423: Bioinformatic Algorithms, Databases and Tools

Some aspect concerning the LMDZ dynamical core and its use

Bayesian Confidence Limits and Intervals

实习总结（Internship Summary)

Current State of Japanese Economy under Negative Interest Rate and Proposed Remedies Naoyuki Yoshino Dean Asian Development Bank Institute Professor Emeritus,

Front End Electronics for SOI Monolithic Pixel Sensor

Face Recognition Monday, February 1, 2016.

Solving Rubik's Cube By: Etai Nativ.

CS284 Paper Presentation Arpad Kovacs

انتقال حرارت 2 خانم خسرویار.

Summer Student Program First results

Theoretical Results on Neutrinos

HERMESでのHard Exclusive生成過程による核子内クォーク全角運動量についての研究

Wavelet Coherence & Cross-Wavelet Transform

yaSpMV: Yet Another SpMV Framework on GPUs

Creating Synthetic Microdata for Higher Educational Use in Japan: Reproduction of Distribution Type based on the Descriptive Statistics Kiyomi Shirakawa.

MOCLA02 Design of a Compact L-band Transverse Deflecting Cavity with Arbitrary Polarizations for the SACLA Injector Sep. 14th, 2015 H. Maesaka, T. Asaka,

Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,

Fuel cell development program for electric vehicle

Overview of TST-2 Experiment

Optomechanics with atoms

داده کاوی سئوالات نمونه

Inter-system biases estimation in multi-GNSS relative positioning with GPS and Galileo Cecile Deprez and Rene Warnant University of Liege, Belgium

ლექცია 4 - ფული და ინფლაცია

10. predavanje Novac i financijski sustav

Wissenschaftliche Aussprache zur Dissertation

FLUORECENCE MICROSCOPY SUPERRESOLUTION BLINK MICROSCOPY ON THE BASIS OF ENGINEERED DARK STATES* *Christian Steinhauer, Carsten Forthmann, Jan Vogelsang,

Particle acceleration during the gamma-ray flares of the Crab Nebular

Interpretations of the Derivative Gottfried Wilhelm Leibniz

Advisor: Chiuyuan Chen Student: Shao-Chun Lin

Widow Rockfish Assessment

SiW-ECAL Beam Test 2015 Kick-Off meeting

On Robust Neighbor Discovery in Mobile Wireless Networks

Chapter 6 并发：死锁和饥饿 Operating Systems: Internals and Design Principles

You NEED your book!!! Frequency Distribution

Y V =0 a V =V0 x b b V =0 z

Fairness-oriented Scheduling Support for Multicore Systems

Climate-Energy-Policy Interaction

Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,

Ch48 Statistics by Chtan FYHSKulai

The ABCD matrix for parabolic reflectors and its application to astigmatism free four-mirror cavities.

Measure Twice and Cut Once: Robust Dynamic Voltage Scaling for FPGAs

Online Learning: An Introduction

Factor Based Index of Systemic Stress (FISS)

What is Chemistry? Chemistry is: the study of matter & the changes it undergoes Composition Structure Properties Energy changes.

THE BERRY PHASE OF A BOGOLIUBOV QUASIPARTICLE IN AN ABRIKOSOV VORTEX*

Quantum-classical transition in optical twin beams and experimental applications to quantum metrology Ivano Ruo-Berchera Frascati.

The Toroidal Sporadic Source: Understanding Temporal Variations

FW 3.4: More Circle Practice

ارائه یک روش حل مبتنی بر استراتژی های تکاملی گروه بندی برای حل مسئله بسته بندی اقلام در ظروف

Decision Procedures Christoph M. Wintersteiger 9/11/2017 3:14 PM

Limits on Anomalous WWγ and WWZ Couplings from DØ

Presentation transcript:

ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong Statistics Primer ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong General notes: This is a big-picture presentation. Given the time constraints, try not get lost in the details (exceeding the primer level) related to a concept that may potentially confuse the attendees. Stick to one practical example throughout so people can follow. (Reading achievement/ability?) Coordinate with Daniela to have the slides sent out to all attendees (both onsite and remote) prior to the event. Go over the work process on the white board even if it is available on the slides - goes for all computations, so bring a calculator Incorporate visual aids to content delivery as much as possible. Using the pointer to connect the content you are verbally delivering with the content on the slides for the attendees. This will facilitate them to follow you both verbally and visually.

Quick Overview of Statistics

Descriptive vs. Inferential Statistics Descriptive Statistics: summarize and describe data (central tendency, variability, skewness) Inferential Statistics: procedure for making inferences about population parameters using sample statistics Sample Population

Measures of Central Tendency Mode: the most frequently occurring value in a distribution Select the value(s) with the highest frequency Median: the value representing the middle point of a distribution Order data Determine the median position = (n + 1) / 2 Locate the median based on step 2 Mean: the arithmetic average of a distribution Sum all the data values and divide by the number of values Σ𝑥 𝑛

Measures of Variability Range: difference between the largest and smallest values in the data Mean deviation: measure of the average absolute deviations from the mean – uncommonly used These measures are not very descriptive of a distribution’s variability, need better measures… 5 ( 𝑋 𝐻 − 𝑋 𝐿 ) |Σ 𝑥 − 𝑥 | 𝑛

Measures of Variability Cont. Sum of squares: sum of the squared deviation scores, used to compute variance and standard deviation Variance: the average squared deviations from the mean Standard deviation: square root of the variance - commonly used 𝑆𝑆= Σ 𝑥 − 𝑥 2 𝑠 2 = Σ 𝑥 − 𝑥 2 𝑛 −1 Example on the following slides 𝑠= Σ 𝑥 − 𝑥 2 𝑛 −1

Variance and Sum of Squares Student 𝑥 (𝑥− 𝑥 ) (𝑥− 𝑥 ) 2 Girl #1 90 Girl #2 23 Girl #3 26 Boy #1 83 Boy #2 48 Boy #3 24 Average = Sum =

Empirical Rule The empirical rule states that symmetric or normal distribution with population mean μ and standard deviation σ have the following properties. Remember z-scores?

Sampling Distribution Theoretical distribution of sample statistics (e.g., the mean, standard deviation, Pearson’s r), as opposed to individual scores NOT the same thing as a sample distribution or a population distribution Used to help generalize the findings of our sample statistics back to our populations Tough to understand, concrete example on next slide Provide an example (students’ reading achievement scores) to explain how one would be constructed theoretically, practical example on next slide

Sampling Distribution All possible outcomes are shown below in Table 1. Table 1. All possible outcomes when two balls are sampled with replacement. Outcome Ball 1 Ball 2 Mean 1 1.0 2 1.5 3 2.0 4 5 6 2.5 7 8 9 3.0 We have a hat with 3 pool balls in it – 1, 2, and 3 – and we want to generate a sampling distribution of the mean for sample size = 2. What are all possible combinations of balls that could be drawn with replacement? Draw 2 balls, compute their mean, plot it on a line, create a distribution of means. This is a concrete example. Imagine doing this with IQ scores, math scores, or any other type of continuous variable – would be impossible to do, which is why it is a theoretical distribution, and we don’t actually create it.

Sampling Error As has been stated before, inferential statistics involve using a representative sample to make judgments about a population. Lets say that we wanted to determine the nature of the relationship between county and achievement scores among Texas students. We could select a representative sample of say 10,000 students to conduct our study. If we find that there is a statistically significant relationship in the sample we could then generalize this to the entire population. However, even the most representative sample is not going to be exactly the same as its population. Given this, there is always a chance that the things we find in a sample are anomalies and do not occur in the population that the sample represents. This error is referred as sampling error. Samples are not always exactly representative of the population – this is called sampling error. Sampling error is a result of using samples to approximate our populations of interest.

Sampling Error A formal definition of sampling error is as follows: Sampling error occurs when random chance produces a sample statistic that is not equal to the population parameter it represents. Due to sampling error there is always a chance that we are making a mistake when rejecting or failing to reject our null hypothesis. Remember that inferential procedures are used to determine which of the statistical hypotheses is true. This is done by rejecting or failing to reject the null hypothesis at the end of a procedure.

Sampling Distribution and Standard Error (SE) https://www.youtube.com/watch?v=hvIDuEmWt2k Watch video up to about 2:45.

Hypothesis Testing Null Hypothesis Significance Testing (NHST) Testing p-values using statistical significance tests (image from cnx.org) Effect Size Measure magnitude of the effect (e.g., Cohen’s d) NHSST – z-test, t-test, ANOVA, etc. Many effect size measures, eta squared, r squared, Cohen’s d, etc. Talk about confidence intervals here?

Null Hypothesis Significance Testing Statistical significance testing answers the following question: Assuming the sample data came from a population in which the null hypothesis is exactly true, what is the probability of obtaining the sample statistic one got for one’s sample data with the given sample size? (Thompson, 1994) Alternatively: Statistical significance testing is used to examine a statement about a relationship between two variables. Under Alternatively: discuss causal versus correlational relationships?

Hypothetical Example Is there a difference between the reading abilities of boys and girls? Null Hypothesis (H0): There is not a difference between the reading abilities of boys and girls. Alternative Hypothesis (H1): There is a difference between the reading abilities of boys and girls. Alternative hypotheses may be non-directional (above) or directional (e.g., boys have a higher reading ability than girls). Formulate 2 hypotheses regarding this hypothesis: the null and the alternative. Null hypothesis always assumes no relationship between the variables. Alternative hypotheses’ directionality is dependent on theory – must have good theoretical reason to hypothesize directionality.

Testing the Hypothesis Use a sampling distribution to calculate the probability of a statistical outcome. pcalc = likelihood of the sample’s result pcalc < pcritical: reject H0 pcalc ≥ pcritical: fail to reject H0 This slide assumes the sampling distribution is discussed elsewhere… P-calc --- the likelihood of an outcome occurring

Level of Significance (pcrit) Alpha level (α) determines: The probability at which you reject the null hypothesis The probability of making a Type I error (typically .05 or .01) True Outcome in Population Reject H0 is true H0 is false Observed Outcome Reject H0 Type I error (α) Correct Decision Fail to reject H0 Type II error (β)

Example: Independent t-test Research Question: Is there a difference between the reading abilities of boys and girls? Hypotheses: H0: There is not a difference between the reading abilities of boys and girls. H1: There is a difference between the reading abilities of boys and girls.

Dataset Reading test scores (out of 100) Boys Girls 88 82 90 70 95 92 81 80 93 71 86 73 79 85 89 87

Significance Level α = .05, two-tailed test df = n1 + n2 – 2 = 10 + 10 – 2 = 18 Use t-table to determine tcrit tcrit = ±2.101 Explain what df is?

Decision Rules If tcalc > tcrit, then pcalc < pcrit Reject H0 Fail to reject H0 -2.101 2.101 p = .025

Computations Boys Girls Frequency (N) 10 Sum (Σ) 807 881 Mean ( 𝑋 ) 80.70 88.10 Variance (S2) 55.34 26.54 Standard Deviation (S) 7.44 5.15 Skip most computational stuff. Algebra they can read on their own/computer does most of the work for them anyway.

Computations cont. Pooled variance Standard Error = 40.944 = 2.862

Computations cont. = -2.586 𝑡= 𝑋 1 − 𝑋 2 𝑆𝐸 𝑋 1 − 𝑋 2 Compute tcalc Decision: Reject H0. Girls scored statistically significantly higher on the reading test than boys did. 𝑡= 𝑋 1 − 𝑋 2 𝑆𝐸 𝑋 1 − 𝑋 2 = -2.586 Refer back to means and bell curve if necessary to explain the decision.

Confidence Intervals CI95 = 𝑥 ± tcrit (SE) Sample means provide a point estimate of our population means. Due to sampling error, our sample estimates may not perfectly represent our populations of interest. It would be useful to have an interval estimate of our population means so we know a plausible range of values that our population means may fall within. 95% confidence intervals do this. Can help reinforce the results of the significance test. CI95 = 𝑥 ± tcrit (SE) = -7.4 ± 2.101(2.862) = [-13.412, -1.387]

Statistical Significance vs. Importance of Effect Does finding that p < .05 mean the finding is relevant to the real world? Not necessarily… https://www.youtube.com/watch?v=5OL1RqHrZQ8 Effect size provides a measure of the magnitude of an effect Practical significance Cohen’s d, η2, and R2 are all types of effect sizes Watch about 7 minutes of the video Effect size - Can have a statistically significant effect that has no practical implications for the real world, or can have a non-significant effect that has a large effect size but was ns due to sample size or other reasons. Important to look at ES’s. and NHSST results. Each is only a piece of the puzzle, and need to look at both to better understand the whole picture.

Cohen’s d = -1.16 Equation: Guidelines: d = .2 = small d = .5 = moderate d = .8 = large Not only is our effect statistically significant, but the effect size is large. = -1.16 Standardized mean difference.