SS 2008lecture 3 Biological Sequence Analysis 1 V3 regulation of imprinted genes Review of lecture V2... What does the differential methylation of CpG.

Slides:



Advertisements
Similar presentations
Epigenetic phenomena Epigenetics refers to genetic inheritance that is not coded by the DNA sequence It includes changes in gene expression due to modification.
Advertisements

Methods to read out regulatory functions
LINEs and SINEs ….& towards cancer! Presenter: Manindra Singh Course: MCB 720 (Winter Qt.)
Genomics – The Language of DNA Honors Genetics 2006.
Functional Non-Coding DNA Part II DNA Regulatory Elements BNFO 602/691 Biological Sequence Analysis Mark Reimers, VIPBG.
Transcriptional regulation in Eukaryotes The regulatory elements of bacterial, yeast, and human genes.
Differential Gene Expression
Describe the structure of a nucleosome, the basic unit of DNA packaging in eukaryotic cells.
IDENTIFICATION OF THE MOLECULAR MECHANISMS IN RETT SYNDROME AND RELATED DISORDERS (RTT-GENET) X.
Copyright, ©, 2002, John Wiley & Sons, Inc.,Karp/CELL & MOLECULAR BIOLOGY 3E The Stability of the Genome Duplication, Deletion, Transposition.
CTCF maintains differential methylation at the Igf2/H19 locus Christopher J. Schoenherr, John M. Levorse & Shirley M. Tilghman Nat Genet Jan;33(1):66-9.
Gene Regulation results in differential Gene Expression, leading to cell Specialization Eukaryotic DNA.
Epigenetics: Genomic imprinting. Genomic Imprinting Preferential expression (or repression) of one parental allele Epigenetic modification mechanism (CpG.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
The Organization and Control of Eukaryotic Genomes Ch. 19 AP Biology Ms. Haut.
An Introduction to ENCODE Mark Reimers, VIPBG (borrowing heavily from John Stamatoyannopoulos and the ENCODE papers)
Selfish DNA Honors Genetics.
Control of gene expression Transcriptional Post-transcriptional Epigenetics and long range control.
“REWIRING STEM CELLS: NEW TECHNIQUE MAY REVOLUTIONIZE UNDERSTANDING OF HOW GENES FUNCTION” AND “IMPORTANT DISCOVERY FOR DIAGNOSIS OF GENETIC DISEASES”.
3 mécanismes différents pour déterminer le sexe. Développement mâle et femelle.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
Gene & Genome Evolution1 Chapter 9 You will not be responsible for: Read the How We Know section on Counting Genes, and be able to discuss methodologies.
Regulation of Gene Expression Chapter 18. Warm Up Explain the difference between a missense and a nonsense mutation. What is a silent mutation? QUIZ TOMORROW:
CTCF and Imprinting Disorder Preeti Misra Sang-Gook Han.
More regulating gene expression. Combinations of 3 nucleotides code for each 1 amino acid in a protein. We looked at the mechanisms of gene expression,
Epigenetics Heritable characteristics of the genome other than the DNA sequence Heritable during cell-division (mitosis) To a lesser extent also over generations.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
BACTERIAL TRANSPOSONS
Eukaryotic Genomes 15 November, 2002 Text Chapter 19.
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
Advantages of C. elegans: 1. rapid life cycle 2. hermaphrodite 3. prolific reproduction 4. transparent 5. only ~1000 cells 6. laser ablation 7. complete.
Control of Eukaryotic Genome
CS173 Lecture 9: Transcriptional regulation III
‘mobile’ DNA or ‘jumping’ DNA Transposable elements as drivers of evolution.
Epigenetics Abira Khan. What is Epigenetics?  Histone code: Modifications associated with transcriptional activation- primarily methylation and acetylation-would.
Content What is epigenetics?. The Mapping of the Human Genome Project 2000 A working draft but completed in 2003 Only 20,000–25,000 genes! Only 1.5% of.
How do eucaryotic gene activator proteins increase the rate of transcription initiation? 1.By activating directly on the transcription machinery. 2.By.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Alu Elements PCR Workshop Instruction manuals that come with new gadgets are notoriously frustrating…but at least they do not insert, just when.
1. What is the Central Dogma? 2. How does prokaryotic DNA compare to eukaryotic DNA? 3. How is DNA organized in eukaryotic cells?
 DNA- genetic material of eukaryotes.  Are highly variable in size and complexity.  About 3.3 billion bp in humans.  Complexity- due to non coding.
The Organization and Control of Eukaryotic Genomes Ch. 19 AP Biology Ms. Haut.
Gene Regulation, Part 2 Lecture 15 (cont.) Fall 2008.
Distribution of CpG dinucleotide in the human genome and differences in methylation patterns between normal and tumor cells. In the majority of the mammalian.
Control of gene expression
PCB5065 Fall Exam 4 - Chase Name __________________________________
Regulation of Gene Expression
SGN23 The Organization of the Human Genome
Long Noncoding RNA in Prostate, Bladder, and Kidney Cancer
Introduction to Genetic Analysis
Nat. Rev. Endocrinol. doi: /nrendo
Molecular Mechanisms of Gene Regulation
Concept 18.2: Eukaryotic gene expression can be regulated at any stage
Rosalind M John, M.Azim Surani  Cell 
Volume 126, Issue 4, Pages (April 2004)
Figure 3 The Beckwith–Wiedemann syndrome locus at chromosome 11p15.5
Gene Density and Noncoding DNA
Addition of H19 ‘Loss of Methylation Testing’ for Beckwith-Wiedemann Syndrome (BWS) Increases the Diagnostic Yield  Jochen K. Lennerz, Robert J. Timmerman,
CTCF: Master Weaver of the Genome
M. Rumman Hossain BIOL 506 Fall 2011 Class Presentation
CTCF: Master Weaver of the Genome
Hannah K. Long, Sara L. Prescott, Joanna Wysocka  Cell 
Mechanisms and Consequences of Alternative Polyadenylation
Imprinted Chromatin around DIRAS3 Regulates Alternative Splicing of GNG12-AS1, a Long Noncoding RNA  Malwina Niemczyk, Yoko Ito, Joanna Huddleston, Anna.
Epigenetic Transitions in Germ Cell Development and Meiosis
Eukaryotic Gene Regulation
Genomic imprinting Current Biology
Gene Expression II Kim Foreman, PhD
Addition of H19 ‘Loss of Methylation Testing’ for Beckwith-Wiedemann Syndrome (BWS) Increases the Diagnostic Yield  Jochen K. Lennerz, Robert J. Timmerman,
Presentation transcript:

SS 2008lecture 3 Biological Sequence Analysis 1 V3 regulation of imprinted genes Review of lecture V2... What does the differential methylation of CpG islands mean? - What do the two models describe? - How did the authors arrive at the two models? - How could one distinguish between these two models?

SS 2008lecture 3 Biological Sequence Analysis 2 Outline what is genomic imprinting? networks of imprinted genes imprinting mechanisms protein-DNA interaction hypotheses detecting motifs in DNA sequences –evolutionary conserved regions –protein binding sites –gene regulation modules –"imprinting motifs" –... Alu sequences KCNQ1

SS 2008lecture 3 Biological Sequence Analysis 3 Genomic Imprinting monoallelic expression of a gene depending on its parental origin in mammals about 70 known imprinted genes (human, mouse), also found in insects and in flowering plants estimated: 1 - 2% of all genes = imprinted genes are often organized in clusters with "imprinting centers" Igf2 H19 Igf2 paternal gene copy maternal gene copy Igf2: coding insulin-like growth factor protein H19: untranslated RNA

SS 2008lecture 3 Biological Sequence Analysis 4 Imprinted genes Imprinted genes of the mouse are distributed unevenly throughout the genome. About half of the known ones are located on Chromosome 7, clustered into at least five distinct imprinted domains. red: maternally expressed genes blue: paternally expressed genes PLOS Genet. 2, e147 (2006)

SS 2008lecture 3 Biological Sequence Analysis 5 methylation of Cytosine in CpG: differentially methylated regions (DMRs) altered chromatin structure binding of proteins (transcription factors, silencers) depending on methylation status setting the imprint –hypothesis: male specific and female germ line specific proteins recognize different patterns and set different imprints in sperm and egg –how these imprint markers might find their targets: tandem repeats –sequence not (well) conserved – like many DMRs – –are enriched in the CpG islands of imprinted genes –special DNA structure sequence patterns (germ line specific protein/transcription factor binding sites): evolutionary conserved AGAACCGCGGCGAGAGGCC AGAACCGCGCCGAAGAACC ACAACCGCGCCGAAGAACC AGAACCGCGCCGAAAAGCC Imprinting Mechanisms

SS 2008lecture 3 Biological Sequence Analysis 6 Regulatory models at imprinted loci (A) The enhancer–blocker model (also known as the boundary model) is well studied at the Igf2/H19 locus and consists of an imprinting control region (ICR) located between a pair of reciprocally expressed genes that controls access to shared enhancer elements. On the paternal allele, the differentially methylated domain (DMD) acquires methylation (black circles) during spermatogenesis, which leads to repression of the H19 promoter. The hypomethylated maternal DMD acts as an insulator element, mediated through binding sites for the methylation-sensitive boundary factor CTCF (shaded ellipse). When CTCF is bound, Igf2 promoter access to the enhancers (E) distal to H19 is blocked. PLOS Genet. 2, e147 (2006) Blue boxes : paternally expressed alleles, red boxes : maternally expressed alleles, black boxes : silenced alleles, grey boxes : nonimprinted genes. Arrows on boxes indicate transcriptional orientation.

SS 2008lecture 3 Biological Sequence Analysis 7 Protein Interactions and Chromatin Loops Igf2 H19 Murrell et al. (2004) Nature Genet. 36: 889 maternal chromosome: DMR1 and DMR unmethylated, CTFC bound  H19 is expressed (interaction with the enhancers), Igf2 is silenced paternal chromosome: DMR and DMR2 methylated, no CTCF binding  Igf2 in contact with enhancers, active; H19 silenced reading the imprint: candidate "imprinting transcription factors" CTCF, YY1 chromatin loop model –DMRs interact via proteins –mediates interaction with the enhancers

SS 2008lecture 3 Biological Sequence Analysis 8 Regulatory models at imprinted loci (B) At the Igf2r locus on Chromosome 17, the paternally expressed, noncoding RNA Air acts to induce bidirectional cis- mediated silencing (black curved lines) on neighbouring protein-coding genes (maternally expressed Igf2r, Slc22a3, and Slc22a2). The grey ellipses are the intronic imprint control elements that are maternally methylated (black circles) and contain the promoter of the Air RNA. PLOS Genet. 2, e147 (2006)

SS 2008lecture 3 Biological Sequence Analysis 9 Regulatory models at imprinted loci (C) At microimprinted domains, oocyte-derived methylation in the promoter region of a protein- coding gene is likely to be the primary epigenetic mark leading to monoallelic silencing. With the exception of the U2af1-rs1 locus, the multiexonic genes within which the paternally expressed transcripts are embedded, escape imprinting. The paternally expressed Nap1l5 is situated within intron 22 of Herc3, which is expressed from both alleles. PLOS Genet. 2, e147 (2006)

SS 2008lecture 3 Biological Sequence Analysis 10 Evolution of imprinted loci Blue: paternally derived alleles, red: maternally derived alleles, Yellow: transposed sequence. Black lollipops: methylated CpGs, light blue dome: a trans-acting factor. Asterisk: gene duplicate. (A) Random molecular events or mutations in the germ-cell lineage generate alleles that undergo differential methylation when passing through the male and female germ line, which can confer either (B) negative or (C) positive fitness. PLOS Genet. 2, e147 (2006)

SS 2008lecture 3 Biological Sequence Analysis 11 Functions of Imprinted Genes imprinting disorders generally cause diseases –over- or underexpression of the corresponding gene products control cell proliferation –growth factors –tumor suppressors –embryonic development ("giant baby") important for brain development –(adult) behavior imprinted genes are often transcription factors –regulation of other genes

SS 2008lecture 3 Biological Sequence Analysis 12 Source: Unified Human Interactome ( Are the imprinted genes alone? Protein-Protein Interactions of Imprinted Genes

SS 2008lecture 3 Biological Sequence Analysis 13 Are the imprinted genes alone? Coexpression Network of Imprinted Genes Zac1 (Plagl1) is a transcription factor -> also regulatory networks! Arima et al. (2005) NAR 33: 2650 Varrault et al. (2006) Dev. Cell 11: 711

SS 2008lecture 3 Biological Sequence Analysis 14 Imprinted genes and repetitive elements Imprinted genes show depletion of short interspersed transposable elements (SINEs) and an enrichment of long interspersed nuclear element 1 (LINE-1) repeats.

SS 2008lecture 3 Biological Sequence Analysis 15 Alu sequence An Alu sequence is a short stretch 300 bp of DNA originally characterized by the action of the Alu restriction endonuclease that was isolated from Arthrobacter luteus. They are therefore classified as short interspersed nuclear elements (SINEs) and are the most abundant mobile elements in the human genome. There are over one million Alu sequences of different kinds interspersed throughout the human and other primate genomes, and probably make up about 10% of the whole genome. Less than 0.5% are polymorphic. Alu sequences are derived from the small cytoplasmic 7SL RNA, a component of the signal recognition particle. The recognition sequence of the Alu endonuclease is 5' AG/CT 3. Most human Alu sequence insertions can be found in the corresponding positions in the genomes of other primates. About 7,000 Alu insertions are unique to humans.

SS 2008lecture 3 Biological Sequence Analysis 16 variability of Alu sequences Nat. Rev. Gen. 3, 370 (2002)

SS 2008lecture 3 Biological Sequence Analysis 17 insertion of Alu sequence Nat. Rev. Gen. 3, 370 (2002) Alu elements are thought to „borrow“ factors such as a functional reverse transcriptase from nearby LINE elements.

SS 2008lecture 3 Biological Sequence Analysis 18 Nat. Rev. Gen. 3, 370 (2002) history of Alu sequences Most Alu repeats duplicated ca. 40 Mya. At that time ca. 1 new Alu insertion every primate birth. Currently, ca. 1 Alu insertion every 200 births. Possible reasons for decline: - altered transcription or reverse transcription activity - decreased availability of available insertion sites.

SS 2008lecture 3 Biological Sequence Analysis 19 Spread of an Alu insertion Nat. Rev. Gen. 3, 370 (2002)

SS 2008lecture 3 Biological Sequence Analysis 20 Example for imprinted gene: KCNQ1 – KvLQT1 KvLQT1 is a potassium channel protein coded for by the gene KCNQ1. KvLQT1 is present in the cell membranes of cardiac muscle tissue and in inner ear neurons among other tissues. In the cardiac cells, KvLQT1 mediates the IKs (or slow delayed rectifying K + ) current that contributes to the repolarization of the cell, terminating the cardiac action potential and thereby the heart's contraction. Mutations in the gene can lead to a defective protein and several forms of inherited arrhythmias as Long QT syndrome, Short QT syndrome, and Familial Atrial Fibrillation. The gene product can form heteromultimers with two other potassium channel proteins, KCNE1 and KCNE3. The gene is located in a region of chromosome 11 that contains a large number of contiguous genes that are abnormally imprinted in cancer and the Beckwith-Wiedemann syndrome. Two alternative transcripts encoding distinct isoforms have been described

SS 2008lecture 3 Biological Sequence Analysis 21 2D structure of KCNQ1 comment: S1 – S6 are six transmembrane helices the P-loop between S5 and S6 enters into the membrane and forms the selectivity pore. Smith et al. Biochemistry (2007) 46, 14141

SS 2008lecture 3 Biological Sequence Analysis 22 3D model for KCNQ1 based on Kv1.2 structure Ensembles of the 20 lowest energy models for open and closed state KCNQ1 monomers. This highlights the implicit flexibility and/or conformational uncertainty for the loop segments of the models. For the open state, blue regions were derived from the Kv1.2 crystal structure (2A79.pdb). Green regions were derived from the crystal structure backbone coordinates for the S1 and S3 regions. Orange regions were modeled de noVo using Rosetta. For the closed state, blue regions were derived from the KcsA crystal structure (1K4C.pdb). Yellow regions were derived from the Yarov-Yaravoy et al. Kv1.2 closed state model. Orange regions were modeled de noVo using Rosetta. Smith et al. Biochemistry (2007) 46, 14141

SS 2008lecture 3 Biological Sequence Analysis 23 open/closed structure of tetramer Smith et al. Biochemistry (2007) 46, 14141

SS 2008lecture 3 Biological Sequence Analysis 24 KCNQ1 – position of disease associated mutations Smith et al. Biochemistry (2007) 46, 14141

SS 2008lecture 3 Biological Sequence Analysis 25 CpG islands „rich“ in CG pairs more precisely: not so „poor“ in CG pairs as the rest of the genome recall that Cs in CpGs are often deamidated and converted into Ts. Why are CpG islands often found at the promoter region? Because this region is under high selective pressure.

SS 2008lecture 3 Biological Sequence Analysis 26 What are Tandem repeats? How does one find CpG islands? What are Gardiner-Frommer and Takai-Jones parameters? Why do we need t-tests? What are the findings of this paper?