Ultraconserved Elements in the Human Genome Bejerano, G., et.al. Katie Allen & Megan Mosher.

Slides:



Advertisements
Similar presentations
Genomics – The Language of DNA Honors Genetics 2006.
Advertisements

Introduction to genomes & genome browsers
Major insights from the HGP on Nature (2001) 15 th Feb Vol 409 special issue; pgs 814 & )Gene content 2)Proteome content 3)SNP identification.
20,000 GENES IN HUMAN GENOME; WHAT WOULD HAPPEN IF ALL THESE GENES WERE EXPRESSED IN EVERY CELL IN YOUR BODY? WHAT WOULD HAPPEN IF THEY WERE EXPRESSED.
Describe the structure of a nucleosome, the basic unit of DNA packaging in eukaryotic cells.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
Section 8.6: Gene Expression and Regulation
Genome Evolution. Amos Tanay 2009 Genome evolution Lecture 10: Comparative genomics, non coding sequences.
[Bejerano Aut08/09] 1 MW 11:00-12:15 in Beckman B302 Profs: Serafim Batzoglou, Gill Bejerano TA: Cory McLean.
[Bejerano Aut07/08] 1 MW 11:00-12:15 in Redwood G19 Profs: Serafim Batzoglou, Gill Bejerano TA: Cory McLean.
[Bejerano Fall10/11] 1 Any Project reflections?
Profs: Serafim Batzoglou, Gill Bejerano TAs: Cory McLean, Aaron Wenger
[Bejerano Fall09/10] 1 Milestones due today. Anything to report?
Defining the Regulatory Potential of Highly Conserved Vertebrate Non-Exonic Elements Rachel Harte BME230.
Comparative Genomics and Evolution Pollard, K.S., et al., Forces Shaping the Fastest Evolving Regions in the Human Genome. PLoS Genetics 2(10), McLean,
From Gene to Protein. Genes code for... Proteins RNAs.
Lecture 12 Splicing and gene prediction in eukaryotes
[Bejerano Fall10/11] 1.
BNFO 602/691 Biological Sequence Analysis Mark Reimers, VIPBG
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
What is genomics? Study of genomes. What is the genome? Entire genetic compliment of an organism.
BNFO 602/691 Biological Sequence Analysis Mark Reimers, VIPBG
Chapter 19: Eukaryotic Genomes Most gene expression regulated through transcription/chromatin structure Most gene expression regulated through transcription/chromatin.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Eukaryotic Gene Control. Developmental pathways of multicellular organisms: All cells of a multicellular organism start with the same complement of DNA.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
발표자 석사 2 년 김태형 Vol. 11, Issue 3, , March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Click to edit Master title style Click to edit Master subtitle style CLICKER QUESTIONS For CAMPBELL BIOLOGY, NINTH EDITION Jane B. Reece, Lisa A. Urry,
Comparative genomics analysis of NtcA regulons in cyanobacteria: Regulation of nitrogen assimilation and its coupling to photosynthesis Wen-Ting Huang.
Fea- ture Num- ber Feature NameFeature description 1 Average number of exons Average number of exons in the transcripts of a gene where indel is located.
Protein Synthesis. Transcription DNA  mRNA Occurs in the nucleus Translation mRNA  tRNA  AA Occurs at the ribosome.
Phylogenomics “The intersection of phylogenetics and genomics”
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
Proposed redefinition of “gene” requires it to have a biological role Gerstein MB, …, Snyder M Genome Res 17: example of complexities observed.
Complexities of Gene Expression Cells have regulated, complex systems –Not all genes are expressed in every cell –Many genes are not expressed all of.
Today Elements of complex genomes Protein domains and exon shuffling
Pre-mRNA secondary structures influence exon recognition Michael Hiller Bioinformatics Group University of Freiburg, Germany.
Comparative Genomics Methods for Alternative Splicing of Eukaryotic Genes Liliana Florea Department of Computer Science Department of Biochemistry GWU.
.1Sources of DNA and Sequencing Methods.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 2 Genome Assembly.
MPL The DNA Sequence of chimpanzee chromosome 22 and comparative analysis with its human ortholog, chromosome 21 Bioinformatics Dae-Soo Kim.
Chapter 3 The Interrupted Gene.
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
Gene Regulation In 1961, Francois Jacob and Jacques Monod proposed the operon model for the control of gene expression in bacteria. An operon consists.
Comparative Genomics I: Tools for comparative genomics
Exam #1 is T 2/17 in class (bring cheat sheet). Protein DNA is used to produce RNA and/or proteins, but not all genes are expressed at the same time or.
G ENE R EGULATION. H OW DOES A CELL KNOW ? which gene to EXPRESS & which gene should stay SILENT?
Supplementary Fig. 1 Supplementary Figure 1. Distributions of (A) exon and (B) intron lengths in O. sativa and A. thaliana genes. Green bars are used for.
Chapter 11 Review. Explain the difference between each of the following 1. Operator, promoter -Operator: DNA segment where an inhibitor protein binds.
A high-resolution map of human evolutionary constraints using 29 mammals Kerstin Lindblad-Toh et al Presentation by Robert Lewis and Kaylee Wells.
KEY CONCEPT Gene expression is carefully regulated in both prokaryotic and eukaryotic cells. Chapter 11 – Gene Expression.
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
RNA MODIFICATION Eukaryotic mRNA molecules are modified before they exit the nucleus.
Protein Synthesis. One Gene – One Enzyme Protein Synthesis.
Supplementary Fig. 1 Supplementary Figure 1. Distributions of (A) exon and (B) intron lengths in O. sativa and A. thaliana genes. Green bars are used.
Evolution of gene function
Basics of Comparative Genomics
Exam #1 W 9/26 at 7-8:30pm in UTC 2.102A Review T 9/25 at 5pm in WRW 102 and in class 9/26.
Genomes and Their Evolution
Genomes and Their Evolution
SGN23 The Organization of the Human Genome
Recitation 7 2/4/09 PSSMs+Gene finding
What are the Patterns Of Nucleotide Substitution Within Coding and
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
Ultraconserved Elements in the Human Genome
Evolution of eukaryote genomes
Gene Density and Noncoding DNA
Basics of Comparative Genomics
.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 3 Gene Prediction and Annotation 4 Genome Structure 5 Genome.
Presentation transcript:

Ultraconserved Elements in the Human Genome Bejerano, G., et.al. Katie Allen & Megan Mosher

Ultraconserved Elements  Segments longer than 200 base pairs that are absolutely conserved, showing 100% identity with no insertions or deletions, between orthologous regions of the human, mouse, and rat genomes  481 such segments

Purpose  To determine the longest segments of the human genome that are maximally conserved (considered ultraconserved based on the prior definition) with orthologous segments in rodents

Location of U.C.E.s  Generally located in genes involving RNA processing or near genes involved in the regulation of transcription or development  Widely distributed  Often found in clusters

 ~ 5% of the human genome is more conserved than would be expected based on neutral evolution since the split with rodents  These highly conserved segments contain a large number of non-coding elements  They exhibit almost no natural variation within the human population  The probability of finding one such element in 2.9 billion bases is less than under a neutral evolution model

Location of U.C.E.s on the Genes

 Nearly all of these ultraconserved elements have been under extreme negative selection for more than 300 million years  The low level of variation suggests that these elements are changing at a rate roughly 20 times slower than the average for the genome

Of the 481 Ultra Conserved Elements:  111 are exonic – overlap the mRNA of a known human protein coding gene  256 are non-exonic – show no evidence of transcription  114 are possibly exonic

Exonic Non-Exonic Exonic Non-Exonic  Randomly distributed around the genome  Specifically associated with RNA processing  Congregate in clusters near transcription factors and developmental genes  Regulate transcription at the DNA level  Often found in “gene deserts”

Genes that overlap with U.C.E.s  Type 1 – overlap with exonic u.c.e.s - show enrichment for RNA binding and regulation of splicing - show enrichment for RNA binding and regulation of splicing - abundant in RNA recognition motif - abundant in RNA recognition motif

 Type 2 – near non-exonic u.c.e.s - enriched for regulation of transcription and DNA binding - enriched for regulation of transcription and DNA binding * Genes that flank intergenic ultraconserved elements are enriched for developmental genes, suggests that many u.c.e.s may be distal enhancers of early developmental genes

PTPB2 (Type 1 Gene)  Mostly intronic u.c.e.  May form an RNA structure that participates in the regulation of splicing through interactions with the splicesome  When this u.c.e. was folded into a secondary structure its energy was lower than all but 1 of 10,000 randomized versions of this sequence

“Flip” and “Flop” Exons  Exonic ultraconserved elements  Exhibit RNA editing and alternative splicing  Regulates the editing of adenosine to inosine

 The longest 3 ultraconserved elements are 779, 720, and 731bp long  All lie in the last three introns in the POLA gene – the DNA polymerase alpha-catalytic subunit on X  May be associated with the ARX gene  A similar u.c.e. lies near the ARX homeobox gene – involved in CNS development, associated with epilepsy, mental retardation, and cerebral malformations

Evolution of Ultraconserved Elements

Only 5% of the orthologs of u.c.e.s could be partially traced back to C. intestinalis, Drosiphilia, and C. elegans  All overlapped with coding exons  17 of 24 were alternatively spliced in humans  No case where an intronic element in humans was coding in any other species, showing intron has a function other than protein coding

 In cases where it could be traced beyond vertebrates, the orthologous introns in the more distant species were either very small or non-existent  It is possible that processes that produced ultra conserved elements in vertebrates also existed in other species  i.e. yeast

Paralogous Sets  12 paralogous sets of genes were found in the u.c.e.s  All paralogs have a highly conserved match in the chicken  Shows that significant divergence between paralogs in each cluster must have occurred in the early part of their evolution

“Near-freezing”  Most u.c.e.s represent chordate innovations that evolved rapidly but then slowed down considerably, becoming “near-frozen”  A significant number of shorter elements are different in birds but conserved in mammals – suggested that evolution followed by “near-freezing” is ongoing

 The conservation among u.c.e.s must result from a highly negative selection rate, a highly reduced mutation rate, or a combination of both  The problem with maintenance selection is that it does not result in total conservation unless multiple functions are overlaid on the same DNA

 Reduced mutation seems like a novel reason because it means the existence of regions of a few hundred bases with a 20-fold mutation rate reduction  There is no evidence however for hypomutable or hyper-repaired regions

Conclusion  Ultraconserved elements are important for organism development and gene regulation  Ultraconserved elements evolved quickly and have become “near-frozen”  This evolution seems to be ongoing  Conservation seems to have arisen from increased negative selection or decreased mutation rate