Statistical Analysis IB Topic 1. IB assessment statements:  By the end of this topic, I can …: 1. State that error bars are a graphical representation.

Slides:



Advertisements
Similar presentations
Statistical Analysis HL Biology 2010.
Advertisements

Standard Error for AP Biology
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
IB B IOLOGY Q UARTER 1 E XAM R EVIEW IB topics 1 (statistical analysis) & 2.1 (cell theory)
Answering questions about life with statistics ! The results of many investigations in biology are collected as numbers known as _____________________.
TOPIC 1 STATISTICAL ANALYSIS
Statistical Analysis Statistical Analysis
Assessment Statements – State that error bars are a graphical representation of the variability of data – Calculate the mean and standard deviation.
STATISTICS For Research. 1. Quantitatively describe and summarize data A Researcher Can:
STATISTICS For Research. Why Statistics? 1. Quantitatively describe and summarize data A Researcher Can:
Data Collection & Processing Hand Grip Strength P textbook.
Topic 1: Statistical Analysis
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
STATISTICS!!! The science of data. What is data? Information, in the form of facts or figures obtained from experiments or surveys, used as a basis for.
Statistical Analysis.  State that Error bars are a graphical representation of the variability of data.  To answer an IB question involving simply.
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
6.1 Statistical Analysis.
Understanding Numerical Data
Statistical Analysis Topic – Math skills requirements.
Research & Statistics Looking for Conclusions. Statistics Mathematics is used to organize, summarize, and interpret mathematical data 2 types of statistics.
Statistical Analysis Topic 1. Statistics State that error bars are a graphical representation of the variability of data Calculate the mean.
MATH IN THE FORM OF STATISTICS IS VERY COMMON IN AP BIOLOGY YOU WILL NEED TO BE ABLE TO CALCULATE USING THE FORMULA OR INTERPRET THE MEANING OF THE RESULTS.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Statistical Analysis IB Topic 1. Why study statistics?  Scientists use the scientific method when designing experiments  Observations and experiments.
Understanding Your Data Set Statistics are used to describe data sets Gives us a metric in place of a graph What are some types of statistics used to describe.
1.1 Statistical Analysis. Learning Goals: Basic Statistics Data is best demonstrated visually in a graph form with clearly labeled axes and a concise.
Statistical analysis. Types of Analysis Mean Range Standard Deviation Error Bars.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
More statistics notes!. Syllabus notes! (The number corresponds to the actual IB numbered syllabus.) Put the number down from the syllabus and then paraphrase.
Statistical Analysis Topic – Math skills requirements.
Statistics in IB Biology Error bars, standard deviation, t-test and more.
Statistical Analysis. Null hypothesis: observed differences are due to chance (no causal relationship) Ex. If light intensity increases, then the rate.
Statistical Analysis Topic 1. Statistics State that error bars are a graphical representation of the variability of data Calculate the mean.
Data Analysis.
PCB 3043L - General Ecology Data Analysis.
Statistical analysis Why?? (besides making your life difficult …)  Scientists must collect data AND analyze it  Does your data support your hypothesis?
MAKING MEANING OUT OF DATA Statistics for IB-SL Biology.
Chapter 4 Exploring Chemical Analysis, Harris
STATISICAL ANALYSIS HLIB BIOLOGY TOPIC 1:. Why statistics? __________________ “Statistics refers to methods and rules for organizing and interpreting.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
The pictures of Statistics.  Central Tendencies -  Mean –  Median –  Mode -  Statistics -
AP PSYCHOLOGY: UNIT I Introductory Psychology: Statistical Analysis The use of mathematics to organize, summarize and interpret numerical data.
Introductory Psychology: Statistical Analysis
Statistical analysis.
AP Biology Intro to Statistics
Standard Error for AP Biology
TYPES OF GRAPHS There are many different graphs that people can use when collecting Data. Line graphs, Scatter plots, Histograms, Box plots, bar graphs.
Modify—use bio. IB book  IB Biology Topic 1: Statistical Analysis
Statistical analysis.
PCB 3043L - General Ecology Data Analysis.
Standard Error for AP Biology
STATISTICS For Research
AP Biology Intro to Statistics
Module 8 Statistical Reasoning in Everyday Life
TOPIC 1: STATISTICAL ANALYSIS
Statistics in Science Data can be collected about a population (surveys) Data can be collected about a process (experimentation)
Statistical Analysis Error Bars
Statistical Analysis IB Topic 1.
Standard Error for AP Biology
STATISTICS Topic 1 IB Biology Miss Werba.
Statistics.
Statistical analysis.
Topic 1 Statistical Analysis.
Introduction to statistics August 19, 2014
5 Week Assessment Review
Data Literacy Graphing and Statisitics
Presentation transcript:

Statistical Analysis IB Topic 1

IB assessment statements:  By the end of this topic, I can …: 1. State that error bars are a graphical representation of the variability of data 2. Calculate the mean and standard deviation of a set of values 3. State that the term standard deviation is used to summarize the spread of values around the mean, and that 68% of values fall within one standard deviation of the mean 4. Explain how the standard deviation is useful for comparing the means and spread of data between two or more samples 5. Deduce the significance of the difference between two sets of data using calculated values for t and the appropriate tables 6. Explain that the existence of a correlation does not establish that there is a causal relationship between two variables

Why study statistics?  Scientists use the scientific method when designing experiments  Observations and experiments result in the collection of measurable data  Statistics is a branch of mathematics which allows us to sample small portions and draw conclusions about the larger population

Words to know and love …  Mean Average of data points Sum divided by the total  Range Measures the spread of data Difference between the largest and smallest Very large or small values are called outliers

More words …  Standard deviation (SD) A measure of how data are dispersed or spread around the mean Determined by mathematical formula (which you do NOT need to know) Use your calculator or online program  Error bars Graphical representation of the variability of data Error bars can show either the range of data OR the SD Look at Figures 1.1 and 1.2 in your book (pgs. 3, 4)

Standard Deviation  In normal distribution, about 68% of all values lie within +/- 1 SD of the mean  This rises to about 95% for +/- 2 SD from the mean  The SD tells us how tightly the data points are clustered around the mean Clustered together = small SD Spread out = large SD

Graphical Interpretation

Why is this useful?  SD tells you how many extremes are in the data  Questions: What is the shape of the graph of a normal distribution of data points? If there are 100 bean plants represented by the bell curve, how many will be within one standard deviation of the mean?

Comparing the means and spread of data between two or more samples  Open your book to page 6 and look at the data table for the bean plants  First, calculate the mean  Look at the data – how would you describe the values for both sets of data?  How can we quantify your observations about the variability of the data? Find the standard deviation Use your calculator  Don’t worry about the equation (unless you want to)

Options …  TI 83, 84 at/StatTI-83.html at/StatTI-83.html  TI 86 at/StatTI-86.html at/StatTI-86.html  Online calculator m m

Back to the bean plants  SD in sunlight = cm  SD in shade = cm  Looking at the means alone, it appears there is no difference between the two sets of data  However, the high SD of the plants grown in the shade tells us what? How confident can we be in the data? What conclusions can we draw about just looking at the mean?

Question… If all the data values are equal, such as 7, 7, 7, 7, what is the standard deviation of this set of four data points?

Answer  0, if all values are the same, there’s no deviation from the mean

Question  If the daily temperatures of a city A range from 10 *C to 30*C for one month, the mean temperature may be 20*C. Another city B may also have a mean temperature of 20*C for the same month. However, the range of city B is only 15*C to 25*C.  Which city has a temperature with a higher standard deviation?  Which city can give a more accurate prediction of weather and why?

Answer  City A has a higher standard deviation  City B since is has a very narrow range of temperature or a very low standard deviation

Significant difference and the t-test  The t-test is used to determine whether or not the difference between two sets of data is a significant (real) difference  We use a Table of t values (page 8) You do not need to memorize this! This is a tool scientists use

How to navigate the table  Probability (p) Bottom of the table (p) that chance alone could make a difference P = 0.50 = difference is due to chance 50% of the time  This is not a significant difference P = 0.05 = the probability that the difference is due to chance is only 5%  Statisticians are never 100% certain, but like to be at least 95% certain  Degrees of freedom Sum of sample sizes of each of the two groups minus two

Practice  Looking at the table …  If the degree of freedom is 9, and the given value of t is 2.60, the table indicates that the t value is just greater than  Looking at the bottom of the table, probability that chance alone could produce the result is only 5% This means there is a 95% chance that the difference is significant

Worked example 1.4  Compare two groups of barnacles living on a rocky shore. Measure the width of their shells to see if a significant size difference is found depending on how close they live to the water. One group lives between 0 and 10 meters above the water level. The second group lives between 10 and 20 meters above the water level  Measurement was taken of the width of the shells in mm. 15 shells were measured from each group. The mean of the group closer to the water indicates that living closer to the water causes barnacles to have a larger shell. If the value of t is 2.25, is that a significant difference?

Answer to worked example 1.4  The degree of freedom is 28 ( – 2)  2.25 is just above 2.05  Refer to table …  P = 0.05, so the probability that chance alone could produce that result is only 5%  The confidence level is 95%  We are 95% confident that the difference between the barnacles is significant  Barnacles living nearer the water have a significantly larger shell than those living 10 meters or more away from the water

Worked example 1.5  The heights of 16 yr old girls from the UK and US were compared. The means indicate that the British girls are taller than the US girls. The sample size from each group was 50 girls from each country. The calculated t is  What are the degrees of freedom used to determine the probability that the differences between the two groups are due to chance?  Using the given t value of 2.00 with your calculated degrees of freedom, what is the probability that chance alone can produce a difference in the heights of these girls?  How confident are we that the British girls are taller than the US girls based on this sample size?

Answers   P = 0.05 or 5%  95% confident

Correlation and Causation  Observing something can suggest correlation  Experiments provide a test which shows cause  Observations without an experiment can only show a correlation.

Question  For years we have known that there is a high positive correlation between smoking and lung cancer. Does this high positive correlation prove that smoking causes lung cancer?  How can the cause of lung cancer be determined?

Answer  No, it does not prove that smoking causes lung cancer. Only data collected from a well designed experiment can show cause  For many years, scientists have performed experiments to study the cause of the correlation which was originally observed. Currently, scientific evidence shows that smoking increases the chances of contracting lung cancer. Experiments show cause.

Africanized Honey Bees (AHB)  Are there any volunteers who can summarize the relationship between correlation and causation using this example?

Cormorants  Example of using a mathematical correlation test  The value of r = correlation +1 (complete positive correlation) to 0 (no correlation) to -1 (complete negative correlation)

Exit slip  Tear and share paper  Name, date, period upper right hand corner  Title: Exit Slip

Exit Slip  1. What is the shape of the graph of a normal distribution of data points?  2. If there are 100 bean plants represented by the bell curve, how many will be within one standard deviation of the mean?  3. What is standard deviation used for?  4. What is an error bar?  5. If p = 0.05, how confident can I be in my data? Is this an acceptable level of confidence in science?