Download presentation
Presentation is loading. Please wait.
Published byImogen Flynn Modified over 9 years ago
1
Statistical model for count data Speaker : Tzu-Chun Lo Advisor : Yao-Ting Huang
2
Outline Why use statistical model Target ▫Gene expression Binomial distribution ▫Poisson distribution Over dispersion Negative binomial ▫Chi-square approximation Conclusion
3
Statistics model A statistical model is a probability distribution constructed to enable inferences to be drawn or decisions made from data. Population sample Inference Make a decision : Hypothesis testing designer consumer We have to choose a statistics model for sample (mean, variance) We (mean, variance) size
4
Target Gene expression ▫We like to use statistical model to test an observed difference in read counts is significant. Look like a significant region How about this Can we sure ? Noise or not
5
Count data A type of data in which the observations can take only the non-negative integer values {0, 1, 2, 3,...}, and where these integers arise from counting rather than ranking. An individual piece of count data is often termed a count variable. Binomial Poisson Negative binomial All of them are this type
6
Binomial distribution
7
33 goals 110 shots in this season Success : 0.3 Fail : 0.7 What is the probability if he scored 6 goals in 10 shots
8
Binomial distribution 0 1 2 3 4 5 6 7 8 9 10 6
9
Poisson distribution
11
e = 2.718281828…
12
Poisson Games goals Goals of game01234567 Poisson0.51.62.5 1.81.10.60.2 Raw data12222011
13
The presence of greater variability (statistical dispersion) in a data set than would be expected based on a given simple statistical model. Overdispersion
14
Negative binomial
16
Parameter estimation
17
Approximate control limits Chi-square approximation
18
Example = 67.0
21
Conclusion Thanks for attention
22
Statistics model Suitable type ▫Which distribution should we use Parameters ▫Get some information from data Inference ▫What do we want to know ▫How could we make a decision Hypothesis testing
23
Statistics model Suitable type ▫Binomial distribution Parameters ▫n = 10, p = 0.7 Inference ▫2 successes
24
Multinomial distribution The analog of the Bernoulli distribution is the categorical distribution, where each trial results in exactly one of some fixed finite number k of possible outcomes. http://en.wikipedia.org/wiki/Multinomial_distr ibutionhttp://en.wikipedia.org/wiki/Multinomial_distr ibution
25
Trinomial distribution
26
Count data A type of data in which the observations can take only the non-negative integer values {0, 1, 2, 3,...}, and where these integers arise from counting rather than ranking. We tend to use fixed fractions of genes. The probability that reads appeared in this region The number of read counts in this interval (Binomial distribution) (Poisson distribution)
28
Poisson example
29
Negative binomial
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.