Understanding the function of conserved non-coding regions in the human genome Sofie Salama – Haussler lab CS273A, November 17, 2008.

Slides:



Advertisements
Similar presentations
The Human Genome Project Main reference: Nature (2001) 409,
Advertisements

Methods to read out regulatory functions
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. CHAPTER 18 LECTURE SLIDES.
Next lecture:techniques used to study the role of genes in develpoment Random genetics followed by screening Targeted mutagenesis (gene knockout) Transgenic.
1 Alternative Splicing. 2 Eukaryotic genes Splicing Mature mRNA.
[Bejerano Aut08/09] 1 MW 11:00-12:15 in Beckman B302 Profs: Serafim Batzoglou, Gill Bejerano TA: Cory McLean.
[Bejerano Fall10/11] 1 Thank you for the midterm feedback! Projects will be assigned shortly.
[Bejerano Fall10/11] 1 Any Project reflections?
Profs: Serafim Batzoglou, Gill Bejerano TAs: Cory McLean, Aaron Wenger
[Bejerano Fall09/10] 1 Milestones due today. Anything to report?
The Human Genome Project and 100 Million Years of Human Evolution
Genomes summary 1.>930 bacterial genomes sequenced. 2.Circular. Genes densely packed Mbases, ,000 genes 4.Genomes of >200 eukaryotes (45.
Chris Chander, Luke Adea BioSci D145 Feb. 12, 2015
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
[Bejerano Fall10/11] 1.
Positional cloning: the rest of the story a a a a a a a a X.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007.
Fine Structure and Analysis of Eukaryotic Genes
P300 Marks Active Enhancers Ruijuan LiChao HeRui Fu.
Ultraconserved Elements in the Human Genome Bejerano, G., et.al. Katie Allen & Megan Mosher.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
GenomesGenomes Chapter 21 Genomes Sequencing of DNA Human Genome Project countries 20 research centers.
“Recent next generation sequencing results” MACHADO LAB.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Chapter 21 Eukaryotic Genome Sequences
Anatomy of a Genome Project A.Sequencing 1. De novo vs. ‘resequencing’ 2.Sanger WGS versus ‘next generation’ sequencing 3.High versus low sequence coverage.
Pollard, KS et al. An RNA gene expressed during cortical development evolved rapidly in humans. Nature Aug Scanned the 2/3 portion of the genome.
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution; Comparing whole genomes enhances – Our ability to understand.
1Biol 466Toll-7 Project Determining the role of Toll-7 in Drosophila melanogaster through RNAi Biol466, Spring 2004 Cassandra Kleve.
The generalized transcription of the genome Víctor Gámez Visairas Genomics Course 2014/15.
LECTURE CONNECTIONS 19 | Molecular Genetic Analysis and © 2009 W. H. Freeman and Company Biotechnology.
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
MPL The DNA Sequence of chimpanzee chromosome 22 and comparative analysis with its human ortholog, chromosome 21 Bioinformatics Dae-Soo Kim.
Vertebrates Hair Mammary Glands Amniotic Egg Endothermy Four Limbs
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
The C3HC4-Type RING Zinc Finger and MYB Transcription Factor Families Matthew Taube June 5, 2008 HC70AL.
Biotechnology Techniques in Developmental Biology Ch. 5 - Gilbert pp
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
Can genes help explain our evolution? - What type of changes (regulatory or structural mutations?) - How many genes are involved?
Accessing and visualizing genomics data
IB Saccharomyces cerevisiae - Jan Major model system for molecular genetics. For example, one can clone the gene encoding a protein if you.
A high-resolution map of human evolutionary constraints using 29 mammals Kerstin Lindblad-Toh et al Presentation by Robert Lewis and Kaylee Wells.
Katherine S. Pollard Gladstone Institutes, Institute for Human Genetics and Division of Biostatistics - UCSF What makes us human?
Who is smarter and does more tricks you or a bacteria? YouBacteria How does my DNA compare to a prokaryote? Show-off.
9/24/07BCB 444/544 F07 ISU Dobbs #14 - Review: Nucleus, Chromosomes, Genes, RNA, Protein1 BCB 444/544 Lecture 14 Review: Nucleus, Chromosomes, Genes, RNA,
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
Lhx2, a LIM homeobox gene, is required for eye, forebrain, and definitive erythrocyte development Introduction Lhx2 is a member of the LIM homeodomain.
BioForum - California Academy of Sciences
The Transcriptional Landscape of the Mammalian Genome
Genomes and Their Evolution
Alu insert, PV92 locus, chromosome 16
Genomes and Their Evolution
Structure of proximal and distant regulatory elements in the human genome Ivan Ovcharenko Computational Biology Branch National Center for Biotechnology.
Peter John M.Phil, PhD Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences & Technology (NUST)
Genome Projects Maps Human Genome Mapping Human Genome Sequencing
Detection of the footprint of natural selection in the genome
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
Analysis of the Human Ferrochelatase Promoter in Transgenic Mice
Gene Density and Noncoding DNA
Volume 21, Issue 3, Pages (October 2017)
Volume 21, Issue 3, Pages (October 2017)
Volume 16, Issue 8, Pages (August 2016)
Material for Quiz 5 from Chapter 8
Brain Evolution and Uniqueness in the Human Genome
Derek de Rie and Imad Abuessaisa Presented by: Cassandra Derrick
Presentation transcript:

Understanding the function of conserved non-coding regions in the human genome Sofie Salama – Haussler lab CS273A, November 17, 2008

Haussler Lab Dry lab – comparative genomics research Browser staff – UCSC genome browser, ENCODE data coordination center, 1000 genomes Wet lab - Experimental analysis of interesting human genomic regions

Origin of conserved non- coding regions and co- regulated gene networks Function of ultraconserved elements Discovery of novel non- coding RNA genes Detailed analysis of Human Accelerated Regions (HAR’s) Understanding the function of conserved non-coding regions in the human genome

How are we different from chimps? Brain anatomy –3X larger, especially cortex –More later developing neurons of the upper cortical layers projecting within the cortex –functional asymmetries What are the genotypic differences responsible for these phenotypic differences? Hill, R. S. & Walsh, C. A. Nature 437, 64–67 (2005)

Clues from comparative genomics Human vs. chimpanzee genome –Genomes are almost identical –BUT, almost 29 million differences –What are the important differences??? Multiple mammalian genomes sequenced –Conservation used to identify functional elements –only 1/3 of conserved regions are protein coding

The HAR screen Identify previously conserved regions –≥100 bp 96% identical between the chimpananzee, mouse and rat genomes –~35,000 mammalian conserved regions Compare to human sequence to identify Human Accelerated Regions –Look for orthologous segments with a large number of changes –Develop statistical methods to rank and evaluate each HAR Identified 49 regions with a significant increased substitution rate in humans (genome wide FDR<5%) Katie Pollard

Wet lab HAR projects HAR population resequencing Analysis of HAR1 Characterization of HAR2 knockout and knockin mice

Why resequence the HARs? Positive selection –Beneficial mutation enters population –Spreads. Nearby (neutral) alleles from mutated chromosome hitchhike towards fixation – a selective sweep –Skew DAF spectrum towards both ends Confounding factor: time –Neutral drift removes variation in 4N eff generations (~1 MYr in human) Human/chimp ancestor 5- 7 MYA Stringer Nature 2003 Noonan et al. Science 2006

Resequence HARs 1 to 49 40kb around each HAR (~2.5Mb total with 13 control regions) 24 samples (48 chromosomes) YRI hapmap samples (panel P2 Seattle SNPs) Enough to do population genetic analysis on a HAR-by-HAR basis (not like our paper on ultras in the average) High throughput sequencing technology enables cost effective investigation. Sol Katzman

“Next-Gen” Sequencing ABI SOLiD (fluoro seq by repeated ligation) –35bp reads (fragment, not mate-pair) –$3-4K per run –2 slides per run –multiple samples per slide barcoded samples Isolated drops on a slide –50 to 100 Million reads per slide Total 2.5Gb of reads 50% mapped? 50% enriched? 250X coverage of 2.5Mb target regions? Divide by number of samples in run for sample coverage –From 1000 Genomes project: Need 11X to get both 99% prob Need 27X average to get 99% prob

Project Overview (part 1 of 2) to Part 2 Sol Katzman

Project Overview (part 2 of 2) from Part 1 Sol Katzman

Wet lab HAR projects HAR population resequencing Analysis of HAR1 Characterization of HAR2 knockout and knockin mice

and the winner is….HAR1! 118 bp segment with 18 changes between the human and chimp sequences

HAR1 genomic landscape Browser gazing suggested the HAR1 element may be expressed in both orientations rt-PCR on human tissue RNA preps suggested brain specific expression of the HAR1 element Used RACE to clone both forward and reverse transcripts from cortical and cerebellar RNA

HAR1 is transcribed HAR1F expressed in brain (cerebellum, forebrain structures), ovary and testes (~1/10 of brain expression) HAR1R expressed in brain (1/10 of HAR1F) and testes Outside HAR1 element, little conservation beyond primates HAR1

RNA in situ hybridization Fix tissue (whole embryo or sections) Synthesize digoxygenin labelled probe anti- sense to desired target Hybridize, wash, visualize using enzyme linked anti-DIG anitbody superfly.ucsd.edu

HAR1F is expressed in the in the neocortex Nelle Lambert, Marie-Alexandra Lambot, Sandra Coppens, Pierre Vanderhaeghen 500µm 250µm

Reelin and cortical development Amadio, JP & Walsh, CA, Cell 126: (2006)

HAR1F is expressed in the marginal zone and the cortical plate Nelle Lambert, Marie-Alexandra Lambot, Sandra Coppens, Pierre Vanderhaeghen 125 µm

Expression of HAR1F in the neocortex continues though 19 GW Nelle Lambert, Marie-Alexandra Lambot, Sandra Coppens, Pierre Vanderhaeghen 250 µm 1000 µm

Co-expression of Reelin and HAR1F in Cajal-Retzius neurons Nelle Lambert, Marie-Alexandra Lambot, Sandra Coppens, Pierre Vanderhaeghen 250 µm

Expression of HAR1F elsewhere in the brain at later embryonic stages Nelle Lambert, Marie-Alexandra Lambot, Sandra Coppens, Pierre Vanderhaeghen

The HAR1F neocortical expression pattern is found in macaque Expression pattern conserved since the divergence of hominoids and old world monkeys 25 MYA Colette Dehay, Pierre Vanderhaeghen

HAR1F is predicted to form a stable RNA structure Jakob Pederson

Human Chimp HumanChimp UGCA UGCA DMS Haller Igel, Manny Ares Structure probing reveals differences in the human and chimp structures

Human HAR1F differs from the ancestral RNA stucture

Resequencing/population genetics Samples –24 member human diversity panel (HAR1 element) –70 Caucasian and African American (6.5 kb region) –Other primates (gorilla, orangutan, macaque) Findings –human-specific changes fixed in the populations (NO SNPs!) –Changes happened at least 1 MYA, no evidence of a recent selective sweep –Large number of human changes extends throughout HAR1F 1 st exon Sol Katzman, Bryan King, Andy Kern

Summary HAR1 is the most extreme of a set of genomic regions showing increased substitutions specifically in the human lineage HAR1 overlaps 2 divergent ncRNA genes, HAR1F and HAR1R HAR1F is expressed in the neocortex in reelin producing Cajal-Retzius neurons which are critical for creating the architecture of the human cortex and also in other structures patterned by the reelin pathway HAR1F forms a stable RNA structure and the human substitutions appear to alter this structure

What does HAR1 do??? What is the cellular role of HAR1 ncRNAs? Where are they localize? Who do they interact with? What is their role in neural development? How do human HAR1 ncRNAs differ from other mammalian HAR1 ncRNAs?

Wet lab HAR projects HAR population resequencing Analysis of HAR1 Characterization of HAR2 knockout and knockin mice

HAR2 12 human substitutions in a 119 bp segment highly conserved in amiotes, present in frog Not in a mature transcript, no RNA secondary structure

HAR2 Genomic Neighborhood HAR2 located in an intron of Centaurin-gamma 2 Closest neighbor is Gastrulation and brain-specific homeobox protein 2 CENTG2-HAR2-GBX2 relationship conserved back to frog-human ancestor

Transgenic assay for enhancer activity LacZ Minimal Promoter HAR2 Harvest at embryonic timepoints. Stain to visualize lacZ activity. How does LacZ expression compare with that of nearby genes (centg2 and gbx2)?

HAR2 is a neural-specific enhancer Bryan King and Armen Shamamian

HAR2 is a limb specific enhancer Human HAR2 shows significant activity in the limb buds Human HAR2 is stronger and shows a broader pattern of expression Making the human substitutions in the chimp construct is sufficient for increased limb bud staining Prabhakar et al. (2008) Science

HAR2 targeted mutants HAR2 knockout – marked allele is made, breeding with constitutive cre mouse to remove vector/marker sequences HAR2 knockin human HAR2 – Have ES cell line, no chimeras yet HAR2 knockin mouse HAR2 – Have construct Robert Sellers, Armen Shamamian

Acknowledgements Haussler Lab Jeff Long, Ting Wang, Danielle Gomez Manny Ares Haller Igel Harry Noller David Feldheim Jena Yamada Nader Pourmand UCSC Collaborators Funding HHMI, NIDA Pierre Vanderhaegen – Univ. of Brussels Katie Pollard – UCD  UCSF/Gladstone Andy Kern - Dartmouth