Statistics allow biologists to support the findings of their experiments.

Slides:



Advertisements
Similar presentations
Statistical Analysis IB Diploma Biology Modified by Christopher Wilkinson from Stephen Taylor Image: 'Hummingbird Checks Out Flower'
Advertisements

Statistical Analysis IB Diploma BiologyIB Diploma Biology (HL/SL)
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
How Science Works Glossary AS Level. Accuracy An accurate measurement is one which is close to the true value.
1 STATISTICS!!! The science of data. 2 What is data? Information, in the form of facts or figures obtained from experiments or surveys, used as a basis.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
Answering questions about life with statistics ! The results of many investigations in biology are collected as numbers known as _____________________.
Statistical Tests. Data Analysis Statistics - a powerful tool for analyzing data 1. Descriptive Statistics - provide an overview of the attributes of.
TOPIC 1 STATISTICAL ANALYSIS
Statistical Analysis Statistical Analysis
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Topic 1: Statistical Analysis
STATISTICS!!! The science of data. What is data? Information, in the form of facts or figures obtained from experiments or surveys, used as a basis for.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
STEM Fair Graphs & Statistical Analysis. Objectives: – Today I will be able to: Construct an appropriate graph for my STEM fair data Evaluate the statistical.
Statistical Tests.
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
Measures of Dispersion CUMULATIVE FREQUENCIES INTER-QUARTILE RANGE RANGE MEAN DEVIATION VARIANCE and STANDARD DEVIATION STATISTICS: DESCRIBING VARIABILITY.
Beak of the Finch Natural Selection Statistical Analysis.
Statistical Analysis IB Diploma Biology Stephen Taylor Image: 'Hummingbird Checks Out Flower'
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
Statistical Analysis Topic – Math skills requirements.
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Statistics in Biology. Histogram Shows continuous data – Data within a particular range.
STATISTICS!!! The science of data.
Experimental Psychology PSY 433 Appendix B Statistics.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Sampling  When we want to study populations.  We don’t need to count the whole population.  We take a sample that will REPRESENT the whole population.
1.1 Statistical Analysis. Learning Goals: Basic Statistics Data is best demonstrated visually in a graph form with clearly labeled axes and a concise.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Statistical Analysis Topic – Math skills requirements.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Statistical Analysis. Null hypothesis: observed differences are due to chance (no causal relationship) Ex. If light intensity increases, then the rate.
Statistical Analysis Image: 'Hummingbird Checks Out Flower'
Statistical analysis Why?? (besides making your life difficult …)  Scientists must collect data AND analyze it  Does your data support your hypothesis?
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
Excel How To Mockingbird Example BIO II Van Roekel.
MAKING MEANING OUT OF DATA Statistics for IB-SL Biology.
Advanced Higher Biology Unit 3 Investigative Biology.
Statistical Analysis adapted from the work of Stephen Taylor.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Statistical Inferences for Variance Objectives: Learn to compare variance of a sample with variance of a population Learn to compare variance of a sample.
The Data Collection and Statistical Analysis in IB Biology John Gasparini The Munich International School Part II – Basic Stats, Standard Deviation and.
Micro array Data Analysis. Differential Gene Expression Analysis The Experiment Micro-array experiment measures gene expression in Rats (>5000 genes).
Outline Sampling Measurement Descriptive Statistics:
AP Biology Intro to Statistics
Statistical Analysis IB Diploma Biology Stephen Taylor
Modify—use bio. IB book  IB Biology Topic 1: Statistical Analysis
Statistics (0.0) IB Diploma Biology
STEM Fair: Statistical Analysis
Statistical Analysis - IB Biology - Mark Polko
Inferential statistics,
STATISTICAL ANALYSIS.
EXAMPLES OF STATS FUNCTIONS
Statistical Analysis Determining the Significance of Data
Hypothesis tests for the difference between two means: Independent samples Section 11.1.
STEM Fair Graphs & Statistical Analysis
TOPIC 1: STATISTICAL ANALYSIS
Statistical Analysis Error Bars
Topic 1 and Data analysis
STATISTICS Topic 1 IB Biology Miss Werba.
Writing the IA Report: Analysis and Evaluation
STATISTICAL ANALYSIS.
1.1 Statistical Analysis.
Presentation transcript:

Statistics allow biologists to support the findings of their experiments.

“Why is this Biology?” Variation in populations. Variability in results. affects Confidence in conclusions. The key methodology in Biology is hypothesis testing through experimentation. Carefully-designed and controlled experiments and surveys give us quantitative (numeric) data that can be compared. We can use the data collected to test our hypothesis and form explanations of the processes involved… but only if we can be confident in our results. We therefore need to be able to evaluate the reliability of a set of data and the significance of any differences we have found in the data. Image: 'Transverse section of part of a stem of a Dead-nettle (Lamium sp.) showing+a+vascular+bundle+and+part+of+the+cortex' Found on flickrcc.net

“Which medicine should I prescribe?” Image from: Donate to Medecins Sans Friontiers through Biology4Good:

“Which medicine should I prescribe?” Image from: Donate to Medecins Sans Friontiers through Biology4Good: Generic drugs are out-of-patent, and are much cheaper than the proprietary (brand-name) equivalents. Doctors need to balance needs with available resources. Which would you choose?

Hummingbirds are nectarivores (herbivores that feed on the nectar of some species of flower). In return for food, they pollinate the flower. This is an example of mutualism – benefit for all. As a result of natural selection, hummingbird bills have evolved. Birds with a bill best suited to their preferred food source have the greater chance of survival. Photo: Archilochus colubris, from wikimedia commons, by Dick Daniels.wikimedia commonsDick Daniels

Researchers studying comparative anatomy collect data on bill-length in two species of hummingbirds: Archilochus colubris (red-throated hummingbird) and Cynanthus latirostris (broadbilled hummingbird). To do this, they need to collect sufficient relevant, reliable data so they can test the Null hypothesis (H 0 ) that: “there is no significant difference in bill length between the two species.” Photo: Archilochus colubris (male), wikimedia commons, by Joe Schneidwikimedia commons

The Null hypothesis presumes that there is NO STATISTICAL DIFFERENCE between the two samples. The ALTERNATIVE hypothesis presumes that there is a STATISTICAL DIFFERENCE between the two samples. The t-test provides a probability that the two samples are the same. A P < 0.05 is accepted as a low enough probability of sameness to reject the NULL hypothesis.

The sample size must be large enough to provide sufficient reliable data and for us to carry out relevant statistical tests for significance. We must also be mindful of uncertainty in our measuring tools and error in our results. Photo: Broadbilled hummingbird (wikimedia commons).wikimedia commons

The mean is a measure of the central tendency of a set of data. Table 1: Raw measurements of bill length in A. colubris and C. latirostris. Bill length (±0.1mm) nA. colubrisC. latirostris Mean s Calculate the mean using: Your calculator (sum of values / n) Excel =AVERAGE(highlight raw data) n = sample size. The bigger the better. In this case n=10 for each group. All values should be centred in the cell, with decimal places consistent with the measuring tool uncertainty.

Standard deviation is a measure of the spread of most of the data. Table 1: Raw measurements of bill length in A. colubris and C. latirostris. Bill length (±0.1mm) nA. colubrisC. latirostris Mean s Standard deviation can have one more decimal place. =STDEV (highlight RAW data). Which of the two sets of data has: a.The longest mean bill length? a.The greatest variability in the data?

Standard deviation is a measure of the spread of most of the data. Table 1: Raw measurements of bill length in A. colubris and C. latirostris. Bill length (±0.1mm) nA. colubrisC. latirostris Mean s Standard deviation can have one more decimal place. =STDEV (highlight RAW data). Which of the two sets of data has: a.The longest mean bill length? a.The greatest variability in the data? C. latirostris A. colubris

Standard deviation is a measure of the spread of most of the data. Error bars are a graphical representation of the variability of data. Which of the two sets of data has: a.The highest mean? a.The greatest variability in the data? A B Error bars could represent standard deviation, range or confidence intervals.

The overlap of a set of error bars gives a clue as to the significance of the difference between two sets of data. Large overlap No overlap Lots of shared data points within each data set. Results are not likely to be significantly different from each other. Any difference is most likely due to chance. No (or very few) shared data points within each data set. Results are more likely to be significantly different from each other. The difference is more likely to be ‘real’.

Our results show a very small overlap between the two sets of data. So how do we know if the difference is significant or not? We need to use a statistical test. The t-test is a statistical test that helps us determine the significance of the difference between the means of two sets of data.

The Null Hypothesis (H 0 ): “There is no significant difference.” This is the ‘default’ hypothesis that we always test. In our conclusion, we either accept the null hypothesis or reject it. A t-test can be used to test whether the difference between two means is significant. If we accept H 0, then the means are not significantly different. If we reject H 0, then the means are significantly different. Remember: We are never ‘trying’ to get a difference. We design carefully-controlled experiments and then analyse the results using statistical analysis.

Excel can jump straight to a value of P for our results. One function (=ttest) compares both sets of data. As it calculates P directly (the probability that the difference is due to chance), we can determine significance directly. In this case, P= This is much smaller than 0.005, so we are confident that we can: reject H 0. The difference is unlikely to be due to chance. Conclusion: There is a significant difference in bill length between A. colubris and C. latirostris.

Two tails: we assume data are normally distributed, with two ‘tails’ moving away from mean. Type 2 (unpaired): we are comparing one whole population with the other whole population. (Type 1 pairs the results of each individual in set A with the same individual in set B).

Cartoon from: Correlation does not imply causation, but it does waggle its eyebrows suggestively and gesture furtively while mouthing "look over there."

Correlation does not imply causality. Pirates vs global warming, from

Correlation does not imply causality. Pirates vs global warming, from Where correlations exist, we must then design solid scientific experiments to determine the cause of the relationship. Sometimes a correlation exist because of confounding variables – conditions that the correlated variables have in common but that do not directly affect each other. To be able to determine causality through experimentation we need: One clearly identified independent variable Carefully measured dependent variable(s) that can be attributed to change in the independent variable Strict control of all other variables that might have a measurable impact on the dependent variable. We need: sufficient relevant, repeatable and statistically significant data. Some known causal relationships: Atmospheric CO 2 concentrations and global warming Atmospheric CO 2 concentrations and the rate of photosynthesis Temperature and enzyme activity

Flamenco Dancer, by Steve Corey