Completed genomes: viruses

Slides:



Advertisements
Similar presentations
Table of Contents Section 1 Viral Structure and Replication
Advertisements

Viruses Small but deadly!. The Black Death o Also known as the Black Plague, was a devastating pandemic that first struck Europe in the mid-late-14th.
VIRUS PROPERTIES Infectious – must be transmissible horizontally Intracellular – require living cells RNA or DNA genome, not both* Most all have protein.
Viral Genomics Friday, October 28, 2011 Genomics J. Pevsner
Viruses: a kind of “borrowed life” HIV infected T-cell.
Copyright © by Holt, Rinehart and Winston. All rights reserved. ResourcesChapter menu Viruses Chapter 24 Table of Contents Section 1 Viral Structure and.
VIRUSES Chapter 24 Video.
Viruses Chapter Nature of Viruses All viruses have same basic structure -Nucleic acid core surrounded by capsid Nucleic acid can be DNA or RNA;
E 1.3 Describe the difficulties in the classification of viruses
Chapter 19~Viruses.
Using Comparative Genomics to Explore the Genetic Code of Influenza Sangeeta Venkatachalam.
Branches of Microbiology Bacteriology Virology Mycology Parasitology Immunology Recombinant DNA technology.
Viral Genomics Wednesday, October 27, 2010 Genomics J. Pevsner
Viruses Gene Regulation results in differential Gene Expression, leading to cell Specialization.
An Introduction to the Viruses Chapter 6 Copyright © The McGraw-Hill Companies, Inc) Permission required for reproduction or display.
Virus Virus, infectious agent found in virtually all life forms, including humans, animals, plants, fungi, and bacteria. Viruses consist of genetic material—either.
Viruses. Relative sizes  Viruses are one of the smallest biological structures known  Between 20 and 50 nanometers in size.  The average animal cell.
Unit 6 – Organismal Biology Part 1: Bacteria and Viruses
Add how bacteria make you sick (toxins) Add how virus makes you sick Add vaccines.
Biology Sylvia S. Mader Michael Windelspecht
Viruses Chapter What is a virus? Viruses- microscopic particles that invade cells and destroy them. A virus is NOT a cell. Has genetic material.
VIRUSES SB13U Unit: Diversity of Living Things “The single biggest threat to man’s continued dominance on the planet is a virus.” —Joshua Lederberg, Nobel.
Chapter 27 Viruses The Nature of Viruses Viruses possess only a portion of the properties of organisms. Parasitic chemicals (segments of DNA of.
Viruses Lecture 16 Fall Viruses What is a virus? Are viruses alive? Read Discovery of Viruses pgs and Fig
Direct evidence of extensive diversity of HIV-1 in Kinshasa by 1960
Basics of HIV Virus Vijay Kandula, MD MPH AAHIVS
Chapter 19~Viruses.
An Introduction to the Viruses Non-Living Etiologies
Introduction to Viruses
The Genetics of Viruses and Prokaryotes
Viruses Page 328.
Ch. 19 Warm-up Why do many scientists classify viruses as non-living?
How to Use This Presentation
The Genetics of Viruses
Virus Basics - part I Viruses are genetic parasites that are smaller than living cells. They are much more complex than molecules, but clearly not alive,
Introduction to Virology
VIRUSES What are they & Where do they come from?.
VIRUSES CHAPTER 10.
Influenza Virus: Evolution in real time
Viruses.
PHARMACEUTICAL MICROBIOLOGY -1 PHT 226
Chapter 19~Viruses.
Viruses.
Viruses Rotavirus (causes stomach flu) HIV virus.
Chapter 24 Table of Contents Section 1 Viral Structure and Replication
Virology Introduction Viral Structure Bacteriophage Replication
Viruses Chapter 19 Page 478.
Ch. 17 Warm-up Why do many scientists classify viruses as non-living?
The Mimivirus Giant double stranded DNA virus Discovered in amoebas
Viruses Small but deadly!.
Chapter 19- Viruses.
Viruses & Bacteria 1.
Chapter 19 Viruses.
Chapter 19. Viruses.
General Animal Biology
Viral Genetics 12/7/2018.
Ch. 19 Warm-up Why do many scientists classify viruses as non-living?
Origins of Human Virus Diversity
Viruses.
Chapter 19 Viruses VIRUS Entry and uncoating DNA Capsid Transcription
Virus Classification: The World of Viruses
Chapter 18. Viral Genetics
Overview: A Borrowed Life
Fig Chapter 19: VIRUS Figure 19.1 Are the tiny viruses infecting this E. coli cell alive? 0.5 µm.
Gene Regulation results in differential Gene Expression, leading to cell Specialization Viruses
Viruses Chapter 26.
Viruses.
Viruses Page 328.
Viruses Page 328.
Presentation transcript:

Completed genomes: viruses Chapter 16: Completed genomes: viruses Jonathan Pevsner, Ph.D. pevsner@kennedykrieger.org Bioinformatics and Functional Genomics (Wiley-Liss, 3rd edition, 2015) You may use this PowerPoint for teaching purposes

Learning objectives After studying this chapter you should be able to: define viruses; explain the basis of the classification of viruses; describe the genomes of HiV, influenza, measles, Ebola, and herpesviruses; describe bioinformatics approaches to determining the function of viral genes and proteins; describe key bioinformatics resources for studying viruses; and compare and contrast DNA and RNA viruses.

Outline Introduction Classification of viruses Bioinformatics approaches to virology Human immunodeficiency virus (HIV) Influenza virus Measles virus Ebola virus Herpesvirus: from phylogeny to gene expression Giant viruses Perspectives

Introduction to viruses Viruses are small, infectious, obligate intracellular parasites. They depend on host cells to replicate. Because they lack the resources for independent existence, they exist on the borderline of the definition of life. The virion (virus particle) consists of a nucleic acid genome surrounded by coat proteins (capsid) that may be enveloped in a host-derived lipid bilayer. Viral genomes consist of either RNA or DNA. They may be single-, double, or partially double stranded. The genomes may be circular, linear, or segmented.

Outline Introduction Classification of viruses Bioinformatics approaches to virology Human immunodeficiency virus (HIV) Influenza virus Measles virus Ebola virus Herpesvirus: from phylogeny to gene expression Giant viruses Perspectives

Introduction to viruses Viruses have been classified by several criteria: -- based on morphology (e.g. by electron microscopy) -- by type of nucleic acid in the genome -- by size (rubella is about 2 kb; HIV-1 about 9 kb; poxviruses are several hundred kb). Mimivirus (for Mimicking microbe) has a double-stranded circular genome of 1.2 megabases (Mb). -- based on human disease

The International Committee on Taxonomy of Viruses (ICTV) offers a resource for classification https://talk.ictvonline.org

Virus taxonomy from the ICTV website B&FG 3e Fig. 16-1 Page 757 Year

Classification of viruses B&FG 3e Fig. 16-2 Page 759

Classification of viruses B&FG 3e Fig. 16-2 Page 759

Classification of viruses based on nucleic acid composition B&FG 3e Table 16.1 Page 760

Mutation rate as a function of genome size hammerhead viroid CChMVd tobacco mosaic virus, human rhinovirus, poliovirus, vesicular stomatitis virus, bacteriophage Φ6, measles virus C. elegans, Mus, Drosophila, human E. coli bacteriophages λ, T2, T4; herpes simplex virus B&FG 3e Fig. 16-3 Page 761

Human disease relevance of viruses Vaccine-preventable viral diseases include: Hepatitis A Hepatitis B Influenza Measles Mumps Poliomyelitis Rubella Smallpox Source: Centers for Disease Control website

Human disease relevance of viruses Disease Virus Hepatitis A Hepatitis A virus Hepatitis B Hepatitis B virus Influenza Influenza type A or B Measles Measles virus Mumps Rubulavirus Poliomyelitis Poliovirus (three serotypes) Rotavirus Rotavirus Rubella Genus Rubivirus Smallpox Variola virus Varicella Varicella-zoster virus Source: Centers for Disease Control website

Vaccine-preventable viral diseases B&FG 3e Table 16.2 Page 762 From www.cdc.gov

Seven viruses that cause cancer in humans B&FG 3e Table 16.3 Page 762

Viral genomes page (NCBI) B&FG 3e Fig. 16.4 Page 763

Viral metagenomics Viral metagenomics refers to the sampling of representative viral genomes from the environment. A typical viral genome is ~50 kilobases (in comparison, a typical microbial genome is ~2.5 megabases). A sample is collected (e.g. seawater, fecal material, or soil). Cellular material is excluded. Viral DNA is extracted, cloned, and sequenced. B&FG 3e Page 764

Henle-Koch postulates: criteria needed to establish a causal relationship between a microbe and a disease [1] The parasite occurs in every case of the disease in question and under circumstances which can account for the pathological changes and clinical course of the disease. [2] It occurs in no other disease as a fortuitous and nonpathogenic parasite. [3] After being fully isolated from the body and repeatedly grown in pure culture, it can induce the disease anew. These postulates (especially [3]) are very hard to prove with viruses. Huebner suggested alternate postulates. B&FG 3e Page 765

Huebner’s postulates: criteria needed to establish a causal relationship between a virus and a disease [1] The virus must be a “real” entity established by passage in culture. [2] The virus must originate from human specimens. [3] Active infection should produce an antibody response. [4] A new virus should be fully characterized and compared with other agents. [5] The virus must be constantly associated with a specific illness. [6] Human volunteers inoculated with the newly recognized agent in double‐blind studies should reproduce the clinical syndrome. [7] Epidemiological studies should identify patterns of infection and disease. [8] A specific vaccine should prevent the disease, establishing an agent as the cause. B&FG 3e Page 765

Outline Introduction Classification of viruses Bioinformatics approaches to virology Human immunodeficiency virus (HIV) Influenza virus Measles virus Ebola virus Herpesvirus: from phylogeny to gene expression Giant viruses Perspectives

Bioinformatic approaches to viruses Some of the outstanding problems in virology include: -- Why does a virus such as HIV-1 infect one species (human) selectively? -- Why do some viruses change their natural host? In 1997 a chicken influenza virus killed six people. -- Why are some viral strains particularly deadly? -- What are the mechanisms of viral evasion of the host immune system? -- Where did viruses originate?

Diversity and evolution of viruses The unique nature of viruses presents special challenges to studies of their evolution. viruses tend not to survive in historical samples viral polymerases of RNA genomes typically lack proofreading activity viruses undergo an extremely high rate of replication many viral genomes are segmented; shuffling may occur viruses may be subjected to intense selective pressures (host immune respones, antiviral therapy) viruses invade diverse species the diversity of viral genomes precludes us from making comprehensive phylogenetic trees of viruses

Outline Introduction Classification of viruses Bioinformatics approaches to virology Human immunodeficiency virus (HIV) Influenza virus Measles virus Ebola virus Herpesvirus: from phylogeny to gene expression Giant viruses Perspectives

Bioinformatic approaches to HIV Human Immunodeficiency Virus (HIV) is the cause of AIDS. Some have estimated that 70 million people were infected with HIV (2017) and 35 million died. HIV-1 and HIV-2 are primate lentiviruses. The HIV-1 genome is 9181 bases in length. Note that as of 2017 there are 700,000 Entrez nucleotide records for this genome (but only one RefSeq entry). Phylogenetic analyses suggest that HIV-2 appeared as a cross-species contamination from a simian virus, SIVsm (sooty mangebey). Similarly, HIV-1 appeared from simian immunodeficiency virus of the chimpanzee (SIVcpz).

HIV phylogeny based on pol suggests five clades 1. Simian immunodeficiency virus from the chimpanzee Pan troglodytes (SIVcpz) with HIV-1 Hahn et al., 2000

HIV phylogeny based on pol suggests five clades 2. SIV from the sooty mangabeys Cerecocebus atys (SIVsm), with HIV-2 and SIV from the macaque (genus Macaca; SIVmac) Hahn et al., 2000

HIV phylogeny based on pol suggests five clades 3. SIV from African green monkeys (genus Chlorocebus)(SIVagm) Hahn et al., 2000

HIV phylogeny based on pol suggests five clades 4. SIV from Sykes’ monkeys, Cercopithecus albogularis (SIVsyk) Hahn et al., 2000

HIV phylogeny based on pol suggests five clades 5. SIV from l’Hoest monkeys (Cercopithecus lhoesti); from suntailed monkeys (Cercopithecus solatus); and from mandrill (Mandrillus sphinx) Hahn et al., 2000

Evolutionary relationships of primate lentiviruses Full-length Pol protein sequences were aligned and a tree was created using the maximum‐likelihood method. There are five major lineages (arrows 1–5). B&FG 3e Fig. 16.5 Page 767

Evolutionary relationships of primate lentiviruses The HIV‐1/SIVcpz lineage is displayed based on a maximum‐likelihood tree using Env protein sequences. Three major HIV‐1 groups (M, N, O; arrows 6–8) are distinguished. B&FG 3e Fig. 16.5 Page 767

Bioinformatic approaches to HIV: NCBI NCBI offers a retrovirus resource with reference genomes and protein sets, and several tools (alignment, genotyping).

Retrovirus resources (NCBI) B&FG 3e Fig. 16.6 Page 769

Bioinformatic approaches to HIV: LANL Los Alamos National Laboratory (LANL) databases provide a major HIV resource. See http://hiv-web.lanl.gov/ LANL offers -- an HIV BLAST server -- Synonymous/non-synonymous analysis program -- a multiple alignment program -- a PCA-like tool -- a geography tool

Map of HIV infection subtypes (worldwide) LANL geography tool B&FG 3e Fig. 16.7 Page 770 Map of HIV infection subtypes (worldwide)

LANL map of HIV infection subtypes (Europe) LANL geography tool B&FG 3e Fig. 16.7 Page 770 LANL map of HIV infection subtypes (Europe)

Outline Introduction Classification of viruses Bioinformatics approaches to virology Human immunodeficiency virus (HIV) Influenza virus Measles virus Ebola virus Herpesvirus: from phylogeny to gene expression Giant viruses Perspectives

Influenza virus Influenza virus leads to 200,000 hospitalizations and ~36,000 deaths in the U.S. each year. Influenza viruses belong to the family Orthomyxoviridae. The viral particles are about 80-120 nm in diameter and can be spherical or pleiomorphic. They have a lipid membrane envelope that contains the two glycoproteins: hemagglutinin (H) and neuraminidase (N). These two proteins determine the subtypes of Influenza A virus. Influenza A

Influenza virus Since 1976, the H5N1 avian influenza virus has infected at least 232 people (mostly in Asia), of whom 134 have died. A major concern is that a human influenza virus and the H5N1 avian influenza strain were to combine, a new lethal virus could emerge causing a human pandemic. In a pandemic, 20% to 40% of the population is infected per year. ►The 1918 Spanish influenza virus killed tens of millions of people (H1N1 subtype). ►1957 (H2N2) ► 1968 (H3N2) ► Asia 2003-2005 (H5N1) ► 2009 (H1N1, “swine flu”)

Influenza virus There are three types: A, B, C ► A and B cause flu epidemics ► Influenza A: 20 subtypes; occurs in humans, other animals. For example, in birds there are nine subtypes based on the type of neuraminidase expressed (group 1: N1, N4, N5, N8; group 2: N2, N3, N6, N7, N9). The structure of H5N1 avian influenza neuraminidase has been reported (Russell RJ et al., Nature 443:45, 2006). ► Influenza A genome consists of eight, single negative-strand RNAs (from 890 to 2340 nucleotides). Each RNA segment encodes one to two proteins.

examples of family Orthomyxoviridae complete genomes Influenza viruses: examples of family Orthomyxoviridae complete genomes B&FG 3e Table 16.4 Page 771

Genes in a representative Influenza A virus complete genome (A/Puerto Rico/8/34(h1N1)) B&FG 3e Table 16.5 Page 772

Schematic of the eight segments from a typical Influenza A virus B&FG 3e Fig. 16.7 Page 770

Chronological summary of influenza A strains B&FG 3e Fig. 16.9 Page 773

Outline Introduction Classification of viruses Bioinformatics approaches to virology Human immunodeficiency virus (HIV) Influenza virus Measles virus Ebola virus Herpesvirus: from phylogeny to gene expression Giant viruses Perspectives

Measles virus Measles virus is one of the deadliest viruses in human history Leading cause of death in children in many countries Morbillivirus of the Paramyxoviridae family (includes mumps and respiratory syncytial virus) Reference genome (see NCBI): accession NC_001498.1 Nonsegmented, negative‐sense RNA genome protected by nucleocapsids and an envelope. Genome has 15,894 bases and six genes that encode eight proteins. Six genes are designated N (nucleocapsid), P (phosphoprotein), M (matrix), F (fusion), H (hemagglutinin), and L (large polymerase). B&FG 3e Page 774

Eight proteins encoded by six genes of the measles virus genome B&FG 3e Fig. 16.7 Page 770

Outline Introduction Classification of viruses Bioinformatics approaches to virology Human immunodeficiency virus (HIV) Influenza virus Measles virus Ebola virus Herpesvirus: from phylogeny to gene expression Giant viruses Perspectives

Ebola virus Ebola is a filovirus that is transmitted between people by contact with body fluids. First reported outbreak occurred in 1976. Largest outbreak began in 2014, centered initially in West Africa. The virus causes hemorragic fever that is often fatal. Ebola virus is an enveloped, single‐stranded RNA negative‐strand virus of the family Filoviridae. The Zaire Ebola virus reference genome is 18,959 bases in length (accession NC_002549.1), with seven genes encoding nine proteins. Bioinformatics resource include a UCSC Ebola virus browser. B&FG 3e Page 775

Outline Introduction Classification of viruses Bioinformatics approaches to virology Human immunodeficiency virus (HIV) Influenza virus Measles virus Ebola virus Herpesvirus: from phylogeny to gene expression Giant viruses Perspectives

Herpesvirus Herpesviruses are double-stranded DNA viruses that include herpes simplex, cytomegalovirus, and Epstein-Barr. The genomic DNA is packed inside an icosahedral capsid; with a lipid bilayer the diameter is ~200 nanometers.

Herpesvirus Phylogenetic analysis suggests three major groups that originated about 180-220 MYA. Mammalian herpesviruses are in all three subfamilies. Avian and reptilian herpesviruses are all in the Alphaherpesvirinae.

Herpesvirus: three main groups Millions of years before present

Herpesvirus taxonomy McGeoch et al. (Virus Res. 117:90-104, 2006) describe a new herpesvirus taxonomy. Family Herpesviridae Subfamilies Alpha-, Beta-, Gammaherpesvirinae New family Alloherpesviridae (piscine, amphibian herpesviruses)

Phylogeny of the herpesviruses: comparison to the evolution of host genomes B&FG 3e Fig. 16.11 Page 777

Phylogeny of the herpesviruses: comparison to the evolution of host genomes B&FG 3e Fig. 16.11 Page 777

Out‐of‐Africa hypothesis: herpesviruses infected marine invertebrates such as oyster ~500 MYA, and various members of the Herpesviridae (listed to the right) established host species specificity. B&FG 3e Fig. 16.12 Page 778

Herpesvirus taxonomy Genome sizes range from 124 kb (simian varicella virus from Alphaherpesvirinae) to 241 kb (chimpanzee cytomegalovirus from Betaherpesvirinae). ► GC content ranges from 32% to 75%. ► Protein-coding regions occur at a density of one gene per 1.5 to 2 kb of herpesvirus DNA. ► There are immediate-early genes, early genes (nucleotide metabolism, DNA replication), and late genes (encoding proteins comprising the virion). ► Introns occur in some herpesvirus genes. ► Noncoding RNAs have been described (e.g. latency-associated transcripts in HSV-1).

Bioinformatic approaches to herpesvirus Consider human herpesvirus 8 (HHV-8)(family Herpesviridae; subfamily Gammaherpesvirinae). Its genome is ~140,000 base pairs and encodes ~80 proteins. Its RefSeq accession number is NC_003409. We can explore this virus at the NCBI website. Try NCBI  Genomes  viruses  dsDNA

HHV8 genome overview (NCBI) B&FG 3e Fig. 16.13 Page 779

HHV8 proteins (graphic and tabular summaries) B&FG 3e Fig. 16.13 Page 779

Viruses can acquire host genes HHV-8 proteins include structural and metabolic proteins. There are also viral homologs of human host proteins such as the apoptosis inhibitor Bcl-2, an interleukin receptor, and a neural cell adhesion-related adhesin. Mechanisms by which viruses may acquire host proteins include recombination, transposition, splicing. A DELTA BLAST search using HHV-8 interleukin IL-8 receptor as a query reveals several other viral IL-8 receptor molecules.

A viral protein (HHV‐8 ORF74) is a G‐protein coupled receptor that is homologous to a superfamily of mammalian G‐protein coupled receptors, including a high‐affinity interleukin 8 (IL‐8) receptor. B&FG 3e Fig. 16.14 Page 780 Results of a DELTA BLAST search (query YP_001129433.1) are shown.

Bioinformatic approaches to herpesvirus Functional genomics approaches have been applied to human herpesvirus 8 (HHV-8). For example, changes in viral gene expression have been measured at different stages of infection. Conversely, gene expression changes have been measured in human cells following viral infection.

Outline Introduction Classification of viruses Bioinformatics approaches to virology Human immunodeficiency virus (HIV) Influenza virus Measles virus Ebola virus Herpesvirus: from phylogeny to gene expression Giant viruses Perspectives

Mimivirus: mimicking microbe Mimivirus is a member of the Mimiviridae family of nucleocytoplasmic large DNA viruses (NCLDVs). It was isolated from amoebae growing in England. The mature particle has a diameter of ~400 nanometers, comparable to a small bacterium (e.g. a mycoplasma). Thus, mimivirus is by far the largest virus identified to date.

Mimivirus: mimicking microbe The mimivirus genome is 1.2 Mb (1,181,404 base pairs). It is a double-stranded DNA virus. ► Two inverted repeats of 900 base pairs at the ends (thus it may circularize) ► 72% AT content (~28% GC content) ► 1262 putative open-reading frames (ORFs) of length >100 amino acids. 911 of these are predicted to be protein-coding genes ► Unique features include genes predicted to encode proteins that function in protein translation. The inability to perform protein synthesis has been considered a prime feature of viruses, in contrast to most life forms. See Raoult D et al. (2004) Science 306:1344.

Largest virus genomes All are double-stranded DNA (no RNA stage). the order Megavirales has been proposed to reflect the genome size of at least 1 megabase. B&FG 3e Table 16.6 Page 782

MUMmer: alignment of two genomes Use MUMmer on the command line. Obtain viral genome sequences in the FASTA format (e.g. from NCBI). B&FG 3e Page 784

Comparison of megabase‐size viral genome sequences using MUMmer software The Acanthamoeba polyphaga Mimivirus (reference, x axis) and Acanthamoeba castellanii mamavirus (y axis) genomes are largely collinear. B&FG 3e Fig. 16.16 Page 785

Comparison of megabase‐size viral genome sequences using MUMmer software Comparison of Acanthamoeba polyphaga Mimivirus versus Acanthamoeba polyphaga moumouvirus. Forward MUMs are indicated in red, while reverse MUMs are colored blue. A prominent inversion is evident and a translocation (arrow 1). B&FG 3e Fig. 16.16 Page 785

Virus resources available on web B&FG 3e Table 16.7 Page 787

Outline Introduction Classification of viruses Bioinformatics approaches to virology Human immunodeficiency virus (HIV) Influenza virus Measles virus Ebola virus Herpesvirus: from phylogeny to gene expression Giant viruses Perspectives

Perspectives There are relatively few species of viruses because of their specialized requirements for replication in host cells. Thousands of viral genomes have been sequenced, and metagenomics projects are helping to define the diversity of viral genomes. It remains a challenge to use bioinformatics knowledge to successfully create vaccines. Many viruses (such as measles) remain deadly to humans, and some such as influenza pose potentially catastrophic threats. B&FG 3e Page 785