Statistical model for count data Speaker : Tzu-Chun Lo Advisor : Yao-Ting Huang.

Slides:



Advertisements
Similar presentations
MOMENT GENERATING FUNCTION AND STATISTICAL DISTRIBUTIONS
Advertisements

Discrete Uniform Distribution
Sampling: Final and Initial Sample Size Determination
Statistics review of basic probability and statistics.
Chap 9: Testing Hypotheses & Assessing Goodness of Fit Section 9.1: INTRODUCTION In section 8.2, we fitted a Poisson dist’n to counts. This chapter will.
1 Bernoulli and Binomial Distributions. 2 Bernoulli Random Variables Setting: –finite population –each subject has a categorical response with one of.
STAT 270 What’s going to be on the quiz and/or the final exam?
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Estimation of parameters. Maximum likelihood What has happened was most likely.
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference (Sec. )
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference.
= == Critical Value = 1.64 X = 177  = 170 S = 16 N = 25 Z =
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
8-3 Testing a Claim about a Proportion
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
Statistical Inference Lab Three. Bernoulli to Normal Through Binomial One flip Fair coin Heads Tails Random Variable: k, # of heads p=0.5 1-p=0.5 For.
Class notes for ISE 201 San Jose State University
Multivariate Probability Distributions. Multivariate Random Variables In many settings, we are interested in 2 or more characteristics observed in experiments.
One Sample  M ean μ, Variance σ 2, Proportion π Two Samples  M eans, Variances, Proportions μ1 vs. μ2 σ12 vs. σ22 π1 vs. π Multiple.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
10-Opening Picture of Probability and statistics 9.
Standard Statistical Distributions Most elementary statistical books provide a survey of commonly used statistical distributions. The reason we study these.
EDRS 6208 Analysis and Interpretation of Data Non Parametric Tests
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 8 Continuous.
Modeling and Simulation CS 313
Statistics for Engineer Week II and Week III: Random Variables and Probability Distribution.
Distributions Normal distribution Binomial distribution Poisson distribution Chi-square distribution Frequency distribution
HW adjustment Q70 solve only parts a, b, and d. Q76 is moved to the next homework and added-Q2 is moved to the next homework as well.
Statistical Review We will be working with two types of probability distributions: Discrete distributions –If the random variable of interest can take.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Random Sampling, Point Estimation and Maximum Likelihood.
The Negative Binomial Distribution An experiment is called a negative binomial experiment if it satisfies the following conditions: 1.The experiment of.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Introduction to DESeq and edgeR packages Peter A.C. ’t Hoen.
Estimating a Population Proportion
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Introduction Many experiments result in measurements that are qualitative or categorical rather than quantitative. Humans classified by ethnic origin Hair.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
Introduction to Statistical Inference Jianan Hui 10/22/2014.
4.2 Binomial Distributions
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
IE 300, Fall 2012 Richard Sowers IESE. 8/30/2012 Goals: Rules of Probability Counting Equally likely Some examples.
1 Probability and Statistical Inference (9th Edition) Chapter 5 (Part 2/2) Distributions of Functions of Random Variables November 25, 2015.
Statistics 300: Elementary Statistics Section 11-2.
Psychology 202a Advanced Psychological Statistics September 29, 2015.
MATH 4030 – 11A ANALYSIS OF R X C TABLES (GOODNESS-OF-FIT TEST) 1.
Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Section 6.4 Inferences for Variances. Chi-square probability densities.
Probability Generating Functions Suppose a RV X can take on values in the set of non-negative integers: {0, 1, 2, …}. Definition: The probability generating.
3.1 Statistical Distributions. Random Variable Observation = Variable Outcome = Random Variable Examples: – Weight/Size of animals – Animal surveys: detection.
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
Sampling and Sampling Distributions
Ch3.5 Hypergeometric Distribution
Understanding Sampling Distributions: Statistics as Random Variables
ESTIMATION.
IEE 380 Review.
Chapter 4. Inference about Process Quality
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Lecture Slides Essentials of Statistics 5th Edition
Chapter Six Normal Curves and Sampling Probability Distributions
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
CHAPTER 6 Statistical Inference & Hypothesis Testing
CHAPTER 15 SUMMARY Chapter Specifics
Chapter 7: Introduction to Sampling Distributions
Statistical Inference for the Mean: t-test
Each Distribution for Random Variables Has:
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Presentation transcript:

Statistical model for count data Speaker : Tzu-Chun Lo Advisor : Yao-Ting Huang

Outline Why use statistical model Target ▫Gene expression Binomial distribution ▫Poisson distribution Over dispersion Negative binomial ▫Chi-square approximation Conclusion

Statistics model A statistical model is a probability distribution constructed to enable inferences to be drawn or decisions made from data. Population sample Inference Make a decision : Hypothesis testing designer consumer We have to choose a statistics model for sample (mean, variance) We (mean, variance) size

Target Gene expression ▫We like to use statistical model to test an observed difference in read counts is significant. Look like a significant region How about this Can we sure ? Noise or not

Count data A type of data in which the observations can take only the non-negative integer values {0, 1, 2, 3,...}, and where these integers arise from counting rather than ranking. An individual piece of count data is often termed a count variable. Binomial Poisson Negative binomial All of them are this type

Binomial distribution

33 goals 110 shots in this season Success : 0.3 Fail : 0.7 What is the probability if he scored 6 goals in 10 shots

Binomial distribution

Poisson distribution

e = …

Poisson Games goals Goals of game Poisson Raw data

The presence of greater variability (statistical dispersion) in a data set than would be expected based on a given simple statistical model. Overdispersion

Negative binomial

Parameter estimation

Approximate control limits Chi-square approximation

Example = 67.0

Conclusion Thanks for attention

Statistics model Suitable type ▫Which distribution should we use Parameters ▫Get some information from data Inference ▫What do we want to know ▫How could we make a decision  Hypothesis testing

Statistics model Suitable type ▫Binomial distribution Parameters ▫n = 10, p = 0.7 Inference ▫2 successes

Multinomial distribution The analog of the Bernoulli distribution is the categorical distribution, where each trial results in exactly one of some fixed finite number k of possible outcomes. ibutionhttp://en.wikipedia.org/wiki/Multinomial_distr ibution

Trinomial distribution

Count data A type of data in which the observations can take only the non-negative integer values {0, 1, 2, 3,...}, and where these integers arise from counting rather than ranking. We tend to use fixed fractions of genes. The probability that reads appeared in this region The number of read counts in this interval (Binomial distribution) (Poisson distribution)

Poisson example

Negative binomial