Junk DNA and DNA editing Shai Carmi Bar-Ilan, BU מוצ " ש י " ג אייר 17/05/2008.

Slides:



Advertisements
Similar presentations
Site-specific recombination
Advertisements

Genomics – The Language of DNA Honors Genetics 2006.
Retroviruses And retroposons
Retroviruses and Retroposons Chapter Introduction Figure 22.1.
RNA and PROTEIN SYNTHESIS
Lecture 10 DNA Translation and Control
Transposons & Mechanisms of Transposition
Copyright, ©, 2002, John Wiley & Sons, Inc.,Karp/CELL & MOLECULAR BIOLOGY 3E The Stability of the Genome Duplication, Deletion, Transposition.
Lecture #8Date _________ n Chapter 19~ The Organization and Control of Eukaryotic Genomes.
GENE DUPLICATIONS A.Non-homologous recombination B.Transposition C.Non-disjunction in meiosis.
BIOLOGY EXAM II – CLARISSA’S PART 10/16: Pulse Chase, FRAP, Vesicle Transport, SNARES and other proteins involved 10/23: Vesicle Transport, Neurotransmission,
Genomic Repetitive Elements (Human Focus). TYPES OF ELEMENTS Tandem repeats: a) satellite DNA 1) centromeric and heterochromatic 2) minisatellite 3) microsatellite.
Retroviruses and Retroposons
Chapter 11 Gene Expression and Epigenetics
7 Mechanisms of Mutation and DNA Repair. Mutations Spontaneous mutation : occurs in absence of mutagenic agent Rate of mutation: probability of change.
Transcription: Synthesizing RNA from DNA
CHAPTER 3 GENE EXPRESSION IN EUKARYOTES (cont.) MISS NUR SHALENA SOFIAN.
Introduction to gene expression Seema Zargar. Lecture outline Introduction to all terms used in Gene expression.
Essentials of the Living World Second Edition George B. Johnson Jonathan B. Losos Chapter 13 How Genes Work Copyright © The McGraw-Hill Companies, Inc.
From DNA to Proteins Lesson 1. Lesson Objectives State the central dogma of molecular biology. Describe the structure of RNA, and identify the three main.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
The Organization and Control of Eukaryotic Genomes Ch. 19 AP Biology Ms. Haut.
Chapter 19: Eukaryotic Genomes Most gene expression regulated through transcription/chromatin structure Most gene expression regulated through transcription/chromatin.
Control of Gene Expression Eukaryotes. Eukaryotic Gene Expression Some genes are expressed in all cells all the time. These so-called housekeeping genes.
The Structure and Function of DNA CHAPTER 10 Transcription (DNA  RNA) RNA Polymerase Processing of Eukaryotic RNA Translation (mRNA  Protein) The Three.
Introns and Exons DNA is interrupted by short sequences that are not in the final mRNA Called introns Exons = RNA kept in the final sequence.
Gene Expression Chapter 13.
Human Genetics The Human Genome 1.
Regulating Gene Expression Proteins are not required by all cells at all times Eukaryotes – 4 ways – Transcriptional (as mRNA is being synthesized) – Post-
Transposition Evidence Mechanisms: DNA-mediated RNA-mediated.
RNA and Protein Synthesis
RNA AND PROTEIN SYNTHESIS RNA vs DNA RNADNA 1. 5 – Carbon sugar (ribose) 5 – Carbon sugar (deoxyribose) 2. Phosphate group Phosphate group 3. Nitrogenous.
Selfish DNA Honors Genetics.
Eukaryotic Gene Regulation. Chromatin Structure  DNA & protein  1) Nucleosomes  DNA & histones (proteins)  DNA wrapped around 8-piece histone bead.
Central Dogma of Molecular Biology From Wikipedia Edited by Jungho Kim.
RNA and Protein Synthesis
Fig Genome = Genic + Intergenic (or non-genic) Eukaryotic genomes: composition of human genome.
Plate 36 Retroviruses.
BACTERIA AND VIRUSES. DNA core Protein coat (capsid) Characteristics: Parasitic Replicate only inside phenomenal rate.
Lecture 9 Site Specific Recombination and Transposition Quiz 5 due today at 4 PM.
What’s So Special About DNA? DNA is one of the most boring macromolecules imaginable - its made of only four building blocks and has a perfectly monotonous.
HIV molecular biology BTY328: Virology
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
Introduction to Gene Expression
Ch. 17 From Gene to Protein. Genes specify proteins via transcription and translation DNA controls metabolism by directing cells to make specific enzymes.
Control of Eukaryotic Genome
Transposable Elements DNA Sequences That Change Positions in the Genome.
CHAPTER 13 RNA and Protein Synthesis. Differences between DNA and RNA  Sugar = Deoxyribose  Double stranded  Bases  Cytosine  Guanine  Adenine 
Ch Gene  Protein A gene is a sequence of nucleotides that code for a polypeptide (protein) Hundreds-thousands of genes are on a typical chromosome.
HIV/AIDS.
Functions of RNA mRNA (messenger)- instructions protein
Chap 18 The Genetics of Viruses and Bacteria. Structure of Virus Approximately 20 nm in diameter Their genome can contain DNA or RNA. Enclosed by a.
Johnson - The Living World: 3rd Ed. - All Rights Reserved - McGraw Hill Companies How Genes Work Chapter 8 Copyright © McGraw-Hill Companies Permission.
Shai Carmi, Erez Levanon Bar-Ilan University
Protein Synthesis RNA, Transcription, and Translation.
Source: A DULTS AND C HILDREN L IVING WITH HIV/AIDS (Est. Dec 2007) deaths: 2,900,000 in ,100,000 in 2007 new cases: 2,500,000 in.
Retroviruses and Trans(retro)posons
Protein Synthesis Chapter 9 p Protein Synthesis Gene- segment of DNA that codes for a protein. Gene Expression/ Protein Synthesis is the process.
Retroviruses - Retroviridae
Homologous Recombination
RNA & Protein Synthesis
Last Class DNA replication Chromosome replication DNA repair General Recombination.
The Organization and Control of Eukaryotic Genomes Ch. 19 AP Biology Ms. Haut.
Organization of prokaryotic, eukaryotic and viral genomes
Fig Prokaryotes and Eukaryotes
Context Cell nucleus chromosome gene double helix.
Transposable Elements
SGN23 The Organization of the Human Genome
9.11 Viruses That Use Reverse Transcriptase
Evolution of eukaryote genomes
Presentation transcript:

Junk DNA and DNA editing Shai Carmi Bar-Ilan, BU מוצ " ש י " ג אייר 17/05/2008

Genome structure DNA has mostly evolved to store the code of the proteins its host cell is using. Thus, the main functional units of any genome are protein coding genes. The central dogma of molecular biology: DNA→RNA → Protein 5’3’ A,C,G,T A,C,G,U 20 amino acids Final product : Proteins are the cellular machinery

מותר האדם מן הבהמה אין? In human, protein coding sequences are only 2% of the genome. All animals have the same order of magnitude of genes (few tens of thousands). Does non-coding DNA determines complexity? Is everything else junk?

Non-coding DNA The rest codes for introns, promoters and enhancers (regulation of expression), structural sequences (e.g. telomeres), non-coding RNAs such as rRNA and tRNA (translation), micro-RNA (silencing), snRNA (splicing). But this is not all! Almost HALF of the human genome is made of mobile elements. Pieces of ~100-10k base pairs moving around the genome in a cut&paste or copy&paste mechanisms.

DNA transposons DNA transposons: cut&paste using the enzyme transposase (3% of the genome). Sometimes transfers also host sequences. Increases the genome volume only through repeats at the edges or if happens during S-phase.

Retrotransposons Retrotransposons: copy&paste mechanism through RNA intermediate. Main classes:  LTR (retrovirus like, 8.7% of the genome).  LINE (Long interspersed nuclear elements, 21.3%).  SINE (Short interspersed nuclear elements, 13.6%). Retrotransposons behave like retroviruses. What are retroviruses?

Retroviruses Retroviruses are pieces of (ss) RNA (DNA in other viruses) wrapped in a capsid and envelope. They penetrate into the cell, and use the cell machinery to replicate, assemble a new virus, and infect another cell. Example: HIV. Few thousand bases

Retroviral infection

Retroviral proteins (advanced) Pol: Encodes a polyprotein with-  protease (cleavage of the retrovirus proteins).  Reverse transcriptase (copy the RNA to DNA).  RnaseH (degradation of RNA after reverse transcription).  Integrase (integration of the DNA into the genome). Gag: Codes for core and structural proteins of the virus. Env: Glycoprotein that recognizes membrane receptors of the host cell and initiate the process of infection. Complex splicing pattern, with partial overlap and frameshifting

Retrotransposons It is commonly believed that ancient retroviral infection in the germ line is the origin of nowadays retrotransposons. How did they occupy 40% of the genome? 1.Transcription: genomic DNA→RNA. 2.Translation of viral proteins (if possible). 3.Reverse transcription: RNA → DNA by reverse transcriptase. 4.Insertion into new genomic locations, increasing the number of genomic copies of the sequence. Mobile elements are like double edge sword. RETRO: violating the central dogma!

Why are retrotransposons good? Serve as reservoir of sequences for genetic innovation. Retroviral proteins have DNA binding capabilities which can be exploited by the host cell. Regulate expression levels of existing genes. Change gene regulation networks: By copying a promoter, two sequences are controlled by the same transcription factors (or in other cases by RNA binding proteins or miRNA).

Why are retrotransposons bad? Retroelements generate mutations, through direct insertion into genes, or unequal homologous recombination. Responsible to % of all genetic disorders (e.g. hemophilia). Change the normal transcription of the gene (alter promoter activity, anti-sense transcription, silencing via methylation or miRNA binding). Alternative splicing and protein isoforms.

Examples How can we stop them ???

Inhibition of retroelements Few mechanisms exist: Accumulation of mutations results in non-autonomous elements. Methylation and heterochromatin formation attenuates transcription (LINE). RNA interference. DNA editing (more to come). Did we succeed? Probably we did: (1)Here we are, more complex than any other organism. (2)Most elements are inactive– only Alu and L1 are active with insertion once in 100 births.

Basics of DNA editing The APOBEC3 family of proteins was found to restrict retroviral replication. One of its mechanisms of operation is by “Cytosine Deamination of the (-) strand DNA strand after reverse transcription”. Meaning… APOBEC catalyzes some chemical modification of the DNA just before it is integrated into the genome, eventually generating G→A mutation (editing). (localization varies nucleus/cytoplasm). Inducing tens/hundreds of mutations (uracil excision?). Editing itself is not sufficient to stop replication- other mechanisms are also used. Useful in the immune system to generate new antibodies!

Basics of DNA editing

Evolution of APOBEC APOBEC3G is one of the most positively selected genes (=changes the fastest). Ongoing arms race with HIV. In response to APOBEC, HIV developed the Vif protein that can ubiquitinate APOBEC (=send it to “recycle” (proteasome)). Different APOBECs restrict retroviruses/transposons in different mechanisms (e.g., binding to RNA and blocking reverse transcription).

DNA editing in the genome Some retrotransposons were edited by APOBEC, but yet integrated into the genome. New mechanism of mutagenesis. So far, almost neglected by geneticists. Together with Erez Levanon, HMS. Analyzed retroelements in mouse, human and chimp, applying new statistical approach.

Main results Editing has fingerprints in thousands of mouse IAP/MusD retroelements, with distinguished motifs. Predicting hundreds of thousands editing sites. Edited IAPs are transcribed more than non-edited. Some edited IAPs overlap with introns and exons. Phylogenetic tree can be changed if considering editing information. Editing also in non-LTR, LINE mouse elements. Editing in human and chimp HERV retroelements.

DNA editing demonstration Comparing two mouse IAPs. chr9:  chr8: One cluster of 68 consecutive G→A! Total 176/202 G→A mismatches. Easily available raw material for the generation of new functions! (for example: any editing in TGG creates premature stop codon). Can editing accelerate evolution?

DNA editing phylogenetics If two sequences are the same except for G→A mutation, the sequence with ‘G’ must precede the one with ‘A’. Thus we can build the tree of elements. Automatically generated genetic tree. Same tree, masking the editing. Editing affects phylogenetics!

Summary Significant fraction of the DNA originates from infection by ancient RNA viruses, spreading through the genome by reverse transcription and replication. Some of them ‘domesticated’ to benefit the host cell (not really junk!), but some induce deleterious mutations. One of the mechanisms to restrict retrotransposition is editing them before integration into the genome. Many genomic sites are ‘edited’ due to this restriction activity. New mechanism of mutagenesis, potentially leading to evolution of new molecules or function (for example, HIV drug resistance).