Basic Statistics (for this class) Special thanks to Jay Pinckney (The HPLC and Statistics Guru) APOS.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Objectives (BPS chapter 24)
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Chapter 12 Simple Regression
SIMPLE LINEAR REGRESSION
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Chapter Topics Types of Regression Models
Chapter 11 Multiple Regression.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Today Concepts underlying inferential statistics
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Correlation and Regression Analysis
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Simple Linear Regression and Correlation
Simple Linear Regression Analysis
Linear Regression/Correlation
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Correlation & Regression
Inferential Statistics
Lecture 16 Correlation and Coefficient of Correlation
Understanding Research Results
AM Recitation 2/10/11.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Introduction to Linear Regression and Correlation Analysis
Correlation and Linear Regression
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Chapter 15 Correlation and Regression
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
INTRODUCTORY LINEAR REGRESSION SIMPLE LINEAR REGRESSION - Curve fitting - Inferences about estimated parameter - Adequacy of the models - Linear.
Statistical Analysis Topic – Math skills requirements.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Section 9-1: Inference for Slope and Correlation Section 9-3: Confidence and Prediction Intervals Visit the Maths Study Centre.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
I231B QUANTITATIVE METHODS ANOVA continued and Intro to Regression.
Correlation & Regression Analysis
I271B QUANTITATIVE METHODS Regression and Diagnostics.
PCB 3043L - General Ecology Data Analysis.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Video Conference 1 AS 2013/2012 Chapters 10 – Correlation and Regression 15 December am – 11 am Puan Hasmawati Binti Hassan
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
REGRESSION AND CORRELATION SIMPLE LINEAR REGRESSION 10.2 SCATTER DIAGRAM 10.3 GRAPHICAL METHOD FOR DETERMINING REGRESSION 10.4 LEAST SQUARE METHOD.
Appendix I A Refresher on some Statistical Terms and Tests.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
26134 Business Statistics Week 5 Tutorial
Correlation – Regression
Correlation and Regression
Chapter 10 Correlation and Regression
Simple Linear Regression
15.1 The Role of Statistics in the Research Process
Presentation transcript:

Basic Statistics (for this class) Special thanks to Jay Pinckney (The HPLC and Statistics Guru) APOS

Some basic Definitions Population- the totality of individual observations about which inferences are to be made. Sample -collection of individual observations selected by a specified procedure. They are a subset of the population. – More observations are usually better than a few – Balanced designs (the same number of samples for all treatments) are preferred Variable (character) -the actual property measured by the individual observations Variance (aka Mean Square)- The mean of the squared deviations of observations from their arithmetic mean. Units are NOT the same as the original observations – AKA measures how far a set of numbers is spread out. Standard Deviation (SD)- The square root of variance. Units are the same as original observations 1 SD= 68.26%2 SD=95.46%3 SD=99.72%

Measurement Variables (3 Types) Categorical -observations in a limited number of categories which have no obvious scale (Diatom, dinoflagellate, cyanobacteria). Discrete - (discontinuous) a real scale, but not all values are possible. – Have only certain fixed numerical values with no intermediate values possible. (body segments, teeth, counts, number of offspring, eggs). – Discrete variables are usually integers. Continuous- any value is possible, only restricted by the measuring device (lengths, concentrations, etc.). – Includes areas, volumes, weights, angles, temperatures, time, percentages, rates, and sometimes behavior.

The Awesome Power of Statistics (APOS) Hypothesis Testing is the foundation for the scientific method Statistics is: the tool used to assign a level of confidence in hypothesis testing The Null Hypothesis (Ho) -The hypothesis that nothing is going on or that there is NO difference between the sets of observations – phytoplankton biomass is the same....or....pH is constant Statistics are used to test the null hypothesis....either accept or reject based on some predetermined level of confidence (usually p<.05) If you reject the null hypothesis, you must accept the alternate hypothesis (Ha) that there is a difference between the sets of observations example: phytoplankton biomass is different....or...pH is different

How confident are you in your decision? The P Value is the bottom line This is the probability that the null hypothesis is true A p-value of 0.05 (5% chance of the null hypothesis being true) is usually used as the critical level for the rejection of a null hypothesis. The smaller the p-value, the more confidence you can place in your conclusion. For example, a p-value of means that there is a 1 in 1000 chance that the null hypothesis is true. What is the best p-value to use?

T-Test 2 Basic Types Type 1 – Group Comparisons (unpaired) t-Test Tests the null hypothesis that the two sets of data have the same mean (μ1 = μ2) – When the Data consist of 2 Groups (Unpaired Data) – There is no logical PAIR of samples – Use when you have 2 treatments and want to determine if the mean values of the two treatment groups are significantly different Example: Are boys taller than girls?

T-Test 2 Basic Types Type 2 – Paired Comparisons t-Test – When the data are “Paired” – Each observation for one treatment is paired with one for the other treatment (2 measurements made on the same EU) – Usually applied when an individual is tested for two different factors – Before and After Experiments (make a measurement, do something, make another after) – Also, two treatments are applied to the same EU – This test controls for variability between EU’s Examples: – Arm length in humans (left arm vs. right arm) (EU is the individual) – Two vertical profiles of pH in a sediment core (EU is the core) – Surface and bottom measurements of dissolved oxygen at the same location (EU is the location) – Collect a sample, split into two parts, apply different treatments to each half

Example Problem Galveston Bay is usually described as a shallow estuary (<3 m) with a well-mixed water column. An investigator conducted a preliminary study to try to confirm this statement. Starting in January 2006, the investigator went to a representative location in the bay and obtained measurements of various water quality parameters from two depths: 0.5 m (surface) and 2.5 m (bottom). These measurements were repeated at weekly/biweekly intervals for one year to gather enough data to make a general conclusion about the mixing status (mixed or stratified) of the water column at this location.

Example

Answers

Correlations Correlation and Regression are often misused terms that are usually assumed to be synonymous Correlation Concerned mostly with whether two variables are interdependent, vary together Cannot express one as a function of the other (i.e., no lines or equations) No distinction between dependent and independent variables Cannot assume or imply Causality (one variable does not cause the other, no cause and effect) More valid assumption is that the two variables are both effects of a common cause Purpose of the analysis is to estimate the degree to two variables vary together Regression Purpose is to describe the dependence of a variable (y) on an independent variable (x) Independent Variable - the variable under control of the investigator, fixed and known without error Dependent Variable - the variable that is measured (with error) Used to support hypotheses regarding possible causation of changes in y by changes in x Used to predict values of y given a value of x Used to explain variation in y due to x, using x as a statistical control Implies a Cause (x) and an Effect (y) relationship between two variables

Cont… Correlation analysis is used to determine the degree of association between two variables Determine whether two sets of observations are associated or correlated, the strength of the correlation, and whether it is significant or not Standard Correlation - Pearson’s Product-Moment Correlation The statistic is denoted as r (true correlation is ρ) and is termed the correlation coefficient r ranges from -1 (perfect negative correlation) to 0 (no correlation) to +1 (perfect positive correlation) The p-value indicates the significance for testing the hypothesis: H0: ρ = 0 Ha: ρ ≠ 0 Results are usually reported as “Pearson product-moment correlation analysis indicates a significant positive association between y1 and y2 (r = 0.51, n=22, p < 0.05)”

Example

Answers

Regression Analysis Used to determine a Cause and Effect relationship between two variables Useful for quantifying the form and strength of a relationship between 2 variables Data consist of a Y value (effect, dependent) and X value (cause, predictor, independent) May have multiple Y values for each value of X Least-Squares Linear Regression Analysis will supply the slope and y-intercept of a “best fit” line based on a minimization of the squared differences (least squares) Equation: Y = a + bX a = constant y-intercept b = slope

Other Formulas

Example 1.

ANOVA –Analysis of Variance A Very Common parametric test used to determine if multiple means are significantly different Tests the H 0 that separate sets of data have the same mean Does this by testing if the variation within groups is the same as the variation between groups

Terminology Factor-The item that is being manipulated (e.g., nutrient concentration) Level-the different degrees of the factor (e.g., actual nutrient concentrations used in the experiment; control, 5 μM, 10 μM, 15 μM) Treatments-are the same as the factor levels

Example

Answer