A Little Intro to Statistics What’s the chance of rolling a 6 on a dice? 1/6 What’s the chance of rolling a 3 on a dice? 1/6 Rolling 11 times and not getting.

Slides:



Advertisements
Similar presentations
KEY CONCEPT A population shares a common gene pool.
Advertisements

Linkage analysis Nasir Mehmood Roll No: Linkage analysis Linkage analysis is statistical method that is used to associate functionality of genes.
Section 5.3 ~ The Central Limit Theorem Introduction to Probability and Statistics Ms. Young ~ room 113.
Statistics review of basic probability and statistics.
Sampling distributions of alleles under models of neutral evolution.
Probability - 1 Probability statements are about likelihood, NOT determinism Example: You can’t say there is a 100% chance of rain (no possibility of.
Additional Topics for Exam 2 Week 6, Wednesday. Is the Binomial Model Appropriate? Situation #1: “How likely is it that in a group of 120 the majority.
Working with Random Variables. What is a Random Variable? A random variable is a variable that has a numerical value which arises by chance (ie – from.
Coalescence with Mutations Towards incorporating greater realism Last time we discussed 2 idealized models – Infinite Alleles, Infinite Sites A realistic.
Preview What does Recombination do to Sequence Histories. Probabilities of such histories. Quantities of interest. Detecting & Reconstructing Recombinations.
N-gene Coalescent Problems Probability of the 1 st success after waiting t, given a time-constant, a ~ p, of success 5/20/2015Comp 790– Continuous-Time.
Forward Genealogical Simulations Assumptions:1) Fixed population size 2) Fixed mating time Step #1:The mating process: For a fixed population size N, there.
MA 102 Statistical Controversies Monday, April 1, 2002 Today: Randomness and probability Probability models and rules Reading (for Wednesday): Chapter.
1 Trust and divorce Separated or Divorced trust | No Yes | Total Low | | 247 | |
March 2006Vineet Bafna CSE280b: Population Genetics Vineet Bafna/Pavel Pevzner
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Dispersal models Continuous populations Isolation-by-distance Discrete populations Stepping-stone Island model.
Chapter Sampling Distributions and Hypothesis Testing.
PROBABILITY AND SAMPLES: THE DISTRIBUTION OF SAMPLE MEANS.
Today Today: Chapter 8 Assignment: 5-R11, 5-R16, 6-3, 6-5, 8-2, 8-8 Recommended Questions: 6-1, 6-2, 6-4, , 8-3, 8-5, 8-7 Reading: –Sections 8.1,
11.1 Genetic Variation Within Population KEY CONCEPT A population shares a common gene pool.
The Central Limit Theorem For simple random samples from any population with finite mean and variance, as n becomes increasingly large, the sampling distribution.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
DNA Analysis Techniques for Molecular Genealogy Luke Hutchison Project Supervisor: Scott R. Woodward.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
1 More about the Sampling Distribution of the Sample Mean and introduction to the t-distribution Presentation 3.
1 Theoretical Physics Experimental Physics Equipment, Observation Gambling: Cards, Dice Fast PCs Random- number generators Monte- Carlo methods Experimental.
Extensions to Basic Coalescent Chapter 4, Part 2.
Lecture 3: population genetics I: mutation and recombination
Sampling Distributions Adapted from Exploring Statistics with the TI-83 by Gail Burrill, Patrick Hopfensperger, Mike Koehler.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Models and their benefits. Models + Data 1. probability of data (statistics...) 2. probability of individual histories 3. hypothesis testing 4. parameter.
Getting Parameters from data Comp 790– Coalescence with Mutations1.
Stat 1510: Sampling Distributions
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
The final exam solutions. Part I, #1, Central limit theorem Let X1,X2, …, Xn be a sequence of i.i.d. random variables each having mean μ and variance.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
How likely is it that…..?. The Law of Large Numbers says that the more times you repeat an experiment the closer the relative frequency of an event will.
Population genetics. coalesce 1.To grow together; fuse. 2.To come together so as to form one whole; unite: The rebel units coalesced into one army to.
By Mireya Diaz Department of Epidemiology and Biostatistics for EECS 458.
Coalescent theory CSE280Vineet Bafna Expectation, and deviance Statements such as the ones below can be made only if we have an underlying model that.
Practice Page # 21 Practice X = Stanford-Binet Y = WAIS b =.80 (15 / 16) =.75 a = 100 – (.75)100 = 25 Y = 25 + (.75)X = 25 + (.75)65 It’s.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Fixed Parameters: Population Structure, Mutation, Selection, Recombination,... Reproductive Structure Genealogies of non-sequenced data Genealogies of.
What is the probability of picking an ace? Probability =
PROBABILITY DISTRIBUTIONS DISCRETE RANDOM VARIABLES OUTCOMES & EVENTS Mrs. Aldous & Mr. Thauvette IB DP SL Mathematics.
Copyright © 2009 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Intro CS – Probability and Random Numbers Lesson Plan 6a.
Warm Up The average amount of meat that an American consumes per year is lbs. Assume that the standard deviation is 25 and that the distribution.
Do Now Find the standard score (z-score) using the given information Mean=785 Standard deviation=134 Observed Score = 614 z = What percentile is.
KEY CONCEPT A population shares a common gene pool.
Sampling Distributions
Practice A research was interested in the relation between stress and humor. Below are data from 8 subjects who completed tests of these two traits.
Statistical Inference for the Mean Confidence Interval
KEY CONCEPT A population shares a common gene pool.
KEY CONCEPT A population shares a common gene pool.
KEY CONCEPT A population shares a common gene pool.
Chapter 18 – Central Limit Theorem
KEY CONCEPT A population shares a common gene pool.
KEY CONCEPT A population shares a common gene pool.
Introduction to Probability & Statistics Expectations
Sampling Distributions
KEY CONCEPT A population shares a common gene pool.
KEY CONCEPT A population shares a common gene pool.
KEY CONCEPT A population shares a common gene pool.
KEY CONCEPT A population shares a common gene pool.
A population shares a common gene pool.
KEY CONCEPT A population shares a common gene pool.
Presentation transcript:

A Little Intro to Statistics What’s the chance of rolling a 6 on a dice? 1/6 What’s the chance of rolling a 3 on a dice? 1/6 Rolling 11 times and not getting a 6? (5/6) 11 ~ 13.4% Rolling 11 times and not getting a 3? (5/6) 11 ~ 13.4% In a pile of Organic Chemistry Exams, what’s the chance of seeing a score between 75 and 78%? What about between 0 and 3%? What’s the chance of cancer cells surviving after being exposed to radiation for a period of time? –Period of time is short. –Period of time is long.

Examples of Different Types of Probability Distribution Exponential (i.e. survivability) Uniform (i.e. dice role) Normal (i.e. exam scores)

Sampling from a Distribution Take exam scores that are normally distributed Take a pile of exams Randomly pick one Record the score Do this enough times, and you’ll see a normal distribution  Can simulate taking samples with any known distribution. 

Recombination 2 Locus Example gamete Locus b Locus a Recombination location

2-Locus Recombination Example 1.Determine amount of time (backwards, in generations) when an event occurs. Locus a Locus b gamete 1gamete 2 t1t1 t 1  sampled from exponential distribution w/ mean 2N/(1+R) where R=4Nr, N is effective population size and r per generation per offspring is recombination rate. This comes from doing the statistics of taking into account that two events occur: coalescence and recombination. present past

2-Locus Recombination Example 2. Determine what type of event occurred Here, the events possible are coalescence or recombination. Locus a Locus b gamete 1gamete 2 t1t1 P coalesc = 1/(2N) P recomb = 2r Let’s consider that a recombination event occurs on gamete R R. 1 + R where R=4Nr, N is effective population size and r per generation per offspring is recombination rate. present past

2-Locus Recombination Example 3. Repeat... Determine t n, then determine type of event. Locus a Locus b gamete 1gamete 2 t1t1 Say next a coalescence event occurs between gamete 1 and locus a of gamete 2. P coalesc = 3 choose 2 /(2N) P recomb = r t 2  sampled from exponential distribution t2t2 CA a present past

2-Locus Recombination Example Repeat until you find MRCA (most recent common ancestor) for all samples at all loci. Locus a Locus b gamete 1gamete 2 t1t1 t2t2 CA a CA b present t3t3 past  Note that the genealogies for locus a is different than locus b 

Introduction of Mutations Locus a Locus b gamete 1gamete 2 t1t1 t2t2 CA a CA b x x xx present t3t3

Infinite site Model Now recombinations can occur at any location  Can use same procedure as the two loci, just use appropriate probabilities  Slightly different model: 1 kb region, 999 locations of recombination