Steven Gollmer Cedarville University

Slides:



Advertisements
Similar presentations
Chapter 23: Inferences About Means
Advertisements

Hypothesis Test II: t tests
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Topics Today: Case I: t-test single mean: Does a particular sample belong to a hypothesized population? Thursday: Case II: t-test independent means: Are.
Testing means, part III The two-sample t-test. Sample Null hypothesis The population mean is equal to  o One-sample t-test Test statistic Null distribution.
Single Sample t-test Purpose: Compare a sample mean to a hypothesized population mean. Design: One group.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
Statistics 101 Class 9. Overview Last class Last class Our FAVORATE 3 distributions Our FAVORATE 3 distributions The one sample Z-test The one sample.
T-Tests Lecture: Nov. 6, 2002.
Chapter 11: Inference for Distributions
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Claims about a Population Mean when σ is Known Objective: test a claim.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
- Interfering factors in the comparison of two sample means using unpaired samples may inflate the pooled estimate of variance of test results. - It is.
Created by Erin Hodgess, Houston, Texas Section 7-5 Testing a Claim About a Mean:  Not Known.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Hypothesis Testing for Variance and Standard Deviation
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
Testing Hypothesis That Data Fit a Given Probability Distribution Problem: We have a sample of size n. Determine if the data fits a probability distribution.
Section 8-5 Testing a Claim about a Mean: σ Not Known.
10.5 Testing Claims about the Population Standard Deviation.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Chapter Outline Goodness of Fit test Test of Independence.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Revision of basic statistics Hypothesis testing Principles Testing a proportion Testing a mean Testing the difference between two means Estimation.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
Inferences Concerning Variances
Chapter 11: Additional Topics Using Inferences 11.1 – Chi-Square: Tests of Independence 11.2 – Chi-Square: Goodness of Fit 11.3 – Testing a Single Variance.
Big Data Steven Gollmer Cedarville University. Working with Large Data Accessing data Collection and calibration assumptions Selecting appropriate parameters.
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
Inference ConceptsSlide #1 1-sample Z-test H o :  =  o (where  o = specific value) Statistic: Test Statistic: Assume: –  is known – n is “large” (so.
AP Statistics.  If our data comes from a simple random sample (SRS) and the sample size is sufficiently large, then we know that the sampling distribution.
Correlation. u Definition u Formula Positive Correlation r =
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
1 Testing Statistical Hypothesis The One Sample t-Test Heibatollah Baghi, and Mastee Badii.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
Environmental Modeling Basic Testing Methods - Statistics II.
Chapter 9 Introduction to the t Statistic
Section 11.1 Inference for the Mean of a Population AP Statistics March 15, 2010 CASA.
Introduction For inference on the difference between the means of two populations, we need samples from both populations. The basic assumptions.
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Lecture Nine - Twelve Tests of Significance.
STA 291 Spring 2010 Lecture 21 Dustin Lueker.
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Math 4030 – 10a Tests for Population Mean(s)
Testing a Claim About a Mean:  Not Known
Hypothesis Tests: One Sample
Environmental Modeling Basic Testing Methods - Statistics
Inference on Mean, Var Unknown
Inference for distributions: - Optional topics in comparing distributions IPS chapter 7.3 © 2006 W.H. Freeman and Company.
Hypothesis Tests for a Population Mean in Practice
Sections 6-4 & 7-5 Estimation and Inferences Variation
The t distribution and the independent sample t-test
Recipe for any Hypothesis Test
Independent Samples: Comparing Means
7.3 Hypothesis Testing for the Mean ( unknown)
Introduction to the t Test
Statistical Inference for the Mean: t-test
Testing a Claim About a Standard Deviation or Variance
Inferences from Matched Pairs
Inference Concepts 1-Sample Z-Tests.
STA 291 Spring 2008 Lecture 21 Dustin Lueker.
Presentation transcript:

Steven Gollmer Cedarville University Big Data Steven Gollmer Cedarville University Picture from wikipedia - Dice

Working with Large Data Accessing data Collection and calibration assumptions Selecting appropriate parameters Formatting Calculation Testing hypothesis

Hipparcos Space Astrometry Main Page http://www.rssd.esa.int/index.php?project=HIPPARCOS Data Catalogues http://www.rssd.esa.int/index.php?project=HIPPARCOS&page=Overview http://cdsweb.u-strasbg.fr/ Software Desktop - http://www.rssd.esa.int/index.php?project=HIPPARCOS&page=Celestia2000 Search tool - http://www.rssd.esa.int/index.php?project=HIPPARCOS&page=multisearch2 Data Format Flexible Image Transport System (FITS) - http://fits.gsfc.nasa.gov/

Sloan Digital Sky Survey Main Page http://www.sdss.org/ Data 9th Data Release - http://www.sdss3.org/dr9/ Archive Server - http://dr9.sdss3.org/ Software IDL - http://www.sdss3.org/dr9/software/

Weather Data NOAA National Climatic Data Center http://www.ncdc.noaa.gov/ Popular Data - http://www.ncdc.noaa.gov/most-popular-data Environmental Modeling Center http://www.emc.ncep.noaa.gov/

TERRA/AQUA http://terra.nasa.gov http://aqua.nasa.gov Data Format LARC DAAC - http://eosweb.larc.nasa.gov/ LAADS Web - http://ladsweb.nascom.nasa.gov/index.html Format NetCDF - http://www.unidata.ucar.edu/software/netcdf/ HDF - http://www.hdfgroup.org/

Other Topics of Interest Extra-Solar Planets Asteroid Mapping and Near Earth Detection Earthquakes Agencies and Products NASA - http://www.nasa.gov/home/index.html ESA - http://www.esa.int/ESA USGS - http://www.usgs.gov/ GOES - http://www.goes.noaa.gov/ Paleoclimatology - http://www.ncdc.noaa.gov/paleo/pubs/pcn/pcn-proxy.html

Hypothesis Testing P-value T-test Probability of a value being found assuming the null hypothesis. Usually reject the null hypothesis if p < 0.05 or 0.01 (5% or 1%) May have more stringent criteria for rejection. T-test Assume a normal distribution One-sample test 𝑡= 𝑥 − 𝜇 0 𝑆/ 𝑛 Two-sample test 𝑡= 𝑀 𝑥 − 𝑀 𝑦 𝑆𝑥2 𝑛𝑥 + 𝑆𝑦2 𝑛𝑦 Check significance using T distribution table Compare t value and degrees of freedom 1 sample df = n-1 2 sample df = n1 + n2 – 2 S – Estimate of standard deviation M – Estimate of the mean n – Number of samples

Example Hypothesis Statistics Result 2 tail rejection Data is from a distribution with mean m = 2.5 Statistics X = 3.317 S = 0.7139 df = 5 Result T = 2.80 2 tail rejection p = 0.05 is 2.571 p = 0.02 is 3.365 Data 2.3 4.2 3.6 3.1 2.8 3.9

Z-Value Assume a normal random variable Z – Value x ~ (m, s2) m – mean s – standard deviation Z – Value z ~ (0, 1) If number of samples is large, then z-test will work on one-sample test instead of a t-test. erf(x)= 2 𝜋 0 𝑥 𝑒 − 𝑢 2 𝑑𝑢 One Tail: p=1/2(1+erf(z/ 2 ) Two Tail: p=erf(z/ 2 ) 𝑓 𝑥 = 1 𝜎 2𝜋 𝑒 − (𝑥−𝜇) 2 2 𝜎 2 𝑧~ 𝑥−𝜇 𝜎