Modifyuse bio. IB book IB Biology Topic 1: Statistical Analysis ary/Science/c4b/1/stat1.htm

Slides:



Advertisements
Similar presentations
STATISTICAL ANALYSIS Frequency Distribution # Indivi duals Median Mean MedianMean Median Figure 2.Frequency distributions of three different samples. ABC.
Advertisements

1 SPSS output & analysis. 2 The Regression Equation A line in a two dimensional or two-variable space is defined by the equation Y=a+b*X The Y variable.
“Students” t-test.
An investigation of shell length variation in a mollusc species A marine gastropod (Thersites bipartita) has been sampled from two different locations:A.
Correlation and regression
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Correlation and Regression
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Biol 500: basic statistics
Chapter 11: Inference for Distributions
Using Statistics in Research Psych 231: Research Methods in Psychology.
EDUC 200C Section 4 – Review Melissa Kemmerle October 19, 2012.
Relationships Among Variables
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Lecture 5 Correlation and Regression
Lecture 16 Correlation and Coefficient of Correlation
AM Recitation 2/10/11.
TOPIC 1 STATISTICAL ANALYSIS
Data Collection & Processing Hand Grip Strength P textbook.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
Statistical Analysis Topic – Math skills requirements.
Research & Statistics Looking for Conclusions. Statistics Mathematics is used to organize, summarize, and interpret mathematical data 2 types of statistics.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Correlation Assume you have two measurements, x and y, on a set of objects, and would like to know if x and y are related. If they are directly related,
Statistical Analysis IB Topic 1. Why study statistics?  Scientists use the scientific method when designing experiments  Observations and experiments.
LECTURE 3: ANALYSIS OF EXPERIMENTAL DATA
1.1 Statistical Analysis. Learning Goals: Basic Statistics Data is best demonstrated visually in a graph form with clearly labeled axes and a concise.
Statistical Analysis Topic – Math skills requirements.
Data Analysis.
PCB 3043L - General Ecology Data Analysis.
Intro to Psychology Statistics Supplement. Descriptive Statistics: used to describe different aspects of numerical data; used only to describe the sample.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Advanced Higher Biology Unit 3 Investigative Biology.
USING GRAPHING SKILLS. Axis While drawing graphs, we have two axis. X-axis: for consistent variables Y-axis: for other variable.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
Welcome back!. May 12 th, 2014…TASIS Exam hall 3vI 3vI.
Statistical Analysis IB Topic 1. IB assessment statements:  By the end of this topic, I can …: 1. State that error bars are a graphical representation.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Regression and Correlation
Modify—use bio. IB book  IB Biology Topic 1: Statistical Analysis
PCB 3043L - General Ecology Data Analysis.
Understanding Results
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Correlation and Simple Linear Regression
STATISTICAL ANALYSIS.
Correlation and Regression
Statistical Analysis Error Bars
Correlation and Simple Linear Regression
Statistical Analysis IB Topic 1.
Statistical Analysis - Mean(Average), Median, Mode, Range - Standard Deviation - T-test/ANOVA - Correlation - Chi Test - Percent Change.
STATISTICS Topic 1 IB Biology Miss Werba.
STATISTICAL ANALYSIS.
Simple Linear Regression and Correlation
Inferential Statistics
Topic 8 Correlation and Regression Analysis
Descriptive Statistics
Spearman’s Rank Correlation Coefficient
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Modifyuse bio. IB book IB Biology Topic 1: Statistical Analysis ary/Science/c4b/1/stat1.htm ary/Science/c4b/1/stat1.htm

An investigation of shell length variation in a mollusc species A marine gastropod (Thersites bipartita) has been sampled from two different locations:A marine gastropod (Thersites bipartita) has been sampled from two different locations: –Sample A: Shells found in full marine conditions –Sample B: Shells found in brackish water conditions. sample size = 10 shellssample size = 10 shells length of the shell measured as shownlength of the shell measured as shown

Analysis of Gastropod Data measured height of shells (ruler)measured height of shells (ruler) Units: mm + / - 1 mm (ERROR)Units: mm + / - 1 mm (ERROR) Significant digitsSignificant digits UncertaintyUncertainty –all measuring devices! –reflects the precision of the measurement There should be no variation in the precision of raw dataThere should be no variation in the precision of raw data must be consistent.

1.1.1 Error bars and the representation of variability in data. Biological systems are subject to a genetic program and environmental variationBiological systems are subject to a genetic program and environmental variation collect a set of data it shows variationcollect a set of data it shows variation Graphs: show variation using error barsGraphs: show variation using error bars –show range of the data or –standard deviation

Mean & Range for each group MarineMarine BrackishBrackish

Graph Mean & Range for each group Quick comparison of the 2 data setsQuick comparison of the 2 data sets

1.1.2 Calculation of Mean and Std Dev 3 classes of data3 classes of data MeanMean –arithmetic mean (avg): measure of the central tendency (middle value) Std DevStd Dev –Measures spread around the mean –Measure of variation or accuracy of measurement

1.1.2 Calculation of Mean and Std Dev Std Dev of sample = sStd Dev of sample = s is for the sample not the total populationis for the sample not the total population Pop 1. Mean = 31.4Pop 1. Mean = 31.4 s = 5.7 Pop 2. Mean =41.6 s = 4.3Pop 2. Mean =41.6 s = 4.3

Graphing Mean and Std Dev: Error Bars Mean +/- 1 std devMean +/- 1 std dev no overlap between these two populationsno overlap between these two populations The question being considered is:The question being considered is: –Is there a significant difference between the two samples from different locations? oror –Are the differences in the two samples just due to chance selection?

Graphing Mean and Std Dev: Error Bars StdDev graph compares 68% of the population % begins to show that they look different. Range graph : misleads us to think the data may be similar misleads us to think the data may be similar

1.1.3 Standard deviation and the spread of values around the mean. 1. StdDev is a measure of how spread out the data values are from the mean. 2. Assume: 1.normal distribution of values around the mean 2.data not skewed to either end 3. 68% of all the data values in a sample can be found between the mean +/- 1 standard deviation

Science/c4b/1/stat1.htm#gastro Animation of mean and standard deviation

1.1.3 Standard deviation and the spread of values around the mean % of all the data values in a sample can be found between the mean + 2s and the mean -2s.

1.1.4 Comparing means and standard deviations of 2 or more samples. Sample w/ small StdDev suggests narrow variation Sample w/ larger StdDev suggests wider variation Example: molluscs Pop 1. Mean = 31.4 Standard deviation(s)= 5.7 Pop 1. Mean = 31.4 Standard deviation(s)= 5.7 Pop 2. Mean =41.6 Standard deviation(s) = 4.3 Pop 2. Mean =41.6 Standard deviation(s) = 4.3

1.1.4 Comparing means and standard deviations of 2 or more samples. Pop 2 has a greater mean shell length but slightly narrower variation. Why this is the case would require further observation and experiment on environmental and genetic factors.

1.1.5 Comparing 2 samples with t-Test Null Hypothesis: There is no significant difference between the two samples except as caused by chance selection of data. There is no significant difference between the two samples except as caused by chance selection of data. OR OR Alternative hypothesis: There is a significant difference between the height of shells in sample A and sample B. There is a significant difference between the height of shells in sample A and sample B.

1.1.5 Comparing 2 samples with t-Test For the examples you'll use in biology, tails is always 2, and type can be: 1, paired 2,Two samples equal variance 3, Two samples unequal variance

Good idea to graph it Bar chartBar chart Error barsError bars StatsStats

T-test: Are the mollusc shells from the two locations significantly different? T-test tells you the probability (P) that the 2 sets are basically the same. (null hypothesis)T-test tells you the probability (P) that the 2 sets are basically the same. (null hypothesis) P varies from 0 (not likely) to 1 (certain).P varies from 0 (not likely) to 1 (certain). –higher P = more likely that the two sets are the same, and that any differences are just due to random chance. –lower P = more likely that that the two sets are significantly different, and that any differences are real.

T-test: Are the mollusc shells from the two locations significantly different? In biology the critical P is usually 0.05 (5%) (biology experiments are expected to produce quite varied results)In biology the critical P is usually 0.05 (5%) (biology experiments are expected to produce quite varied results) –If P > 5% then the two sets are the same (i.e. accept the null hypothesis).(i.e. accept the null hypothesis). –If P < 5% then the two sets are different (i.e. reject the null hypothesis).(i.e. reject the null hypothesis). For t test, # replicates as large as possibleFor t test, # replicates as large as possible –At least > 5

Drawing Conclusions 1. State null hypothesis & alternative hypothesis (based on research ?) 2. Set critical P level at P=0.05 (5%) 3. Write the decision rule If P > 5% then the two sets are the same (i.e. accept the null hypothesis). If P > 5% then the two sets are the same (i.e. accept the null hypothesis). If P < 5% then the two sets are different (i.e. reject the null hypothesis). If P < 5% then the two sets are different (i.e. reject the null hypothesis). 4. Write a summary statement based on the decision. The null hypothesis is rejected since calculated P = (< 0.05; two-tailed test). The null hypothesis is rejected since calculated P = (< 0.05; two-tailed test). 5. Write a statement of results in standard English. There is a significant difference between the height of shells in sample A and sample B. There is a significant difference between the height of shells in sample A and sample B.

1.1.6 Correlation & Causation Sometimes youre looking for an association between variables.Sometimes youre looking for an association between variables. Correlations see if 2 variables vary togetherCorrelations see if 2 variables vary together +1 = perfect positive correlation 0 = no correlation 0 = no correlation -1 = perfect negative correlation -1 = perfect negative correlation Relations see how 1 variable affects anotherRelations see how 1 variable affects another

Pearson correlation (r) Data are continuous & normally distributedData are continuous & normally distributed

Spearmans rank-order correlation (r s) Data are not continuous & normally distributedData are not continuous & normally distributed Usually scatterplot for either type of correlationUsually scatterplot for either type of correlation both correlation coefficients indicate a strong + corr.both correlation coefficients indicate a strong + corr. –large females pair with large males –Dont know why, but it shows there is a correlation to investigate further.

Causative: Use linear regression Fits a straight line to dataFits a straight line to data Gives slope & interceptGives slope & intercept –m and c in the equation y = mx + c Doesnt PROVE causation, but suggests it...need further investigation!