Download presentation
Presentation is loading. Please wait.
1
Short Primer on Comparative Genomics Today: Special guest lecture 12pm, Alway M108 Comparative genomics of animals and plants Adam Siepel Assistant Professor of Biological Statistics and Computational Biology Cornell University
2
Evolution at the DNA level …ACGGTGCAGTTACCA… …AC----CAGTCCACCA… Mutation SEQUENCE EDITS REARRANGEMENTS Deletion Inversion Translocation Duplication
3
Orthology and Paralogy HB Human WB Worm HA1 Human HA2 Human Yeast WA Worm Orthologs: Derived by speciation Paralogs: Everything else Orthologs: Derived by speciation Paralogs: Everything else
4
Orthology, Paralogy, Inparalogs, Outparalogs
5
Synteny maps Comparison of human and mouse
6
Synteny maps
7
Building synteny maps Recommended local aligners BLASTZ Most accurate, especially for genes Chains local alignments WU-BLAST Good tradeoff of efficiency/sensitivity Best command-line options BLAT Fast, less sensitive Good for comparing very similar sequences finding rough homology map
8
Index-based local alignment Dictionary: All words of length k (~10) Alignment initiated between words of alignment score T (typically T = k) Alignment: Ungapped extensions until score below statistical threshold Output: All local alignments with score > statistical threshold …… query DB query scan Question: Using an idea from overlap detection, better way to find all local alignments between two genomes?
9
Local Alignments
10
After chaining
11
Chaining local alignments 1.Find local alignments 2.Chain -O(NlogN) L.I.S. 3.Restricted DP
12
Progressive Alignment When evolutionary tree is known: Align closest first, in the order of the tree In each step, align two sequences x, y, or profiles p x, p y, to generate a new alignment with associated profile p result Weighted version: Tree edges have weights, proportional to the divergence in that edge New profile is a weighted average of two old profiles x w y z
13
Threaded Blockset Aligner Human–Cow HMR – CD Restricted Area Profile Alignment
14
Reconstructing the Ancestral Mammalian Genome Human: C Baboon: C Cat: C Dog: G C C or G G
15
Neutral Substitution Rates
16
Finding Conserved Elements (1) Binomial method 25-bp window in the human genome Binomial distribution of k matches in N bases given the neutral probability of substitution
17
Finding Conserved Elements (2) Parsimony Method Count minimum # of mutations explaining each column Assign a probability to this parsimony score given neutral model Multiply probabilities across 25-bp window of human genome A C A A G
18
Finding Conserved Elements
19
Finding Conserved Elements (3) GERP
20
Phylo HMMs HMM Phylogenetic Tree Model Phylo HMM
21
Finding Conserved Elements (3)
22
How do the methods agree/disagree?
23
Statistical Power to Detect Constraint L N C: cutoff # mutations D: neutral mutation rate : constraint mutation rate relative to neutral
24
Statistical Power to Detect Constraint L N C: cutoff # mutations D: neutral mutation rate : constraint mutation rate relative to neutral
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.