Viral Genomics Friday, October 28, 2011 Genomics 260.605.01 J. Pevsner

Slides:



Advertisements
Similar presentations
Virus Classification And Description. Classification Parameters Several Parameters Are Used for Classification –Viral classification study is referred.
Advertisements

Table of Contents Section 1 Viral Structure and Replication
Max Sanam.  Understand stages in animal virus replication  Compare and contrast the multiplication cycle of DNA and RNA-containing animal viruses 
Viruses AP Biology Unit 2 Images taken without permission from and
 Obligate intracellular parasite  Small: nm  Nucleic acid genome  DNA or RNA  single- or double-stranded  Protein capsid  Lipid envelope.
General properties of viruses 1-They are very small in size, from nm 2-They contain one kind of nucleic acid (RNA or DNA) as their genome 3-They.
Viruses.  What is a virus? Defined by their inability to replicate/multiply without utilizing a host cells reproductive mechanisms. Only contain ONE.
Viruses Small but deadly!. The Black Death o Also known as the Black Plague, was a devastating pandemic that first struck Europe in the mid-late-14th.
Influenza A Virus Pandemic Prediction and Simulation Through the Modeling of Reassortment Matthew Ingham Integrated Sciences Program University of British.
VIRUS PROPERTIES Infectious – must be transmissible horizontally Intracellular – require living cells RNA or DNA genome, not both* Most all have protein.
An Introduction to the Viruses
Viruses: a kind of “borrowed life” HIV infected T-cell.
Unit 3: Viruses!.
Origins of HIV Dr. Matthew Marsden, Ph.D. UCLA School of Medicine
Herpesviruses. 100 nm Herpesvirus structures are unique, with tegument layer present and genomic DNA wrapped around core.
VIRUSES Chapter 24 Video.
Lecture 9 Viruses, Viroids, Prions
Viruses Chapter Nature of Viruses All viruses have same basic structure -Nucleic acid core surrounded by capsid Nucleic acid can be DNA or RNA;
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
E 1.3 Describe the difficulties in the classification of viruses
Chapter 24 Video.  Computer Viruses?  Not in the scope of this class. They behave similarly, but are not at all related.
What do you know about Viruses? 1. What are the 5 most common viral infections? 2. Name 2 similarities between a virus and a bacteria? 3. Name 2 differences.
Chapter 19~Viruses.
Diversity of Living Things
Using Comparative Genomics to Explore the Genetic Code of Influenza Sangeeta Venkatachalam.
Viral disease Learning objective: To be able to describe the structural features of a virus.
Branches of Microbiology Bacteriology Virology Mycology Parasitology Immunology Recombinant DNA technology.
Viral Genomics Wednesday, October 27, 2010 Genomics J. Pevsner
Morphology large complex virion,ovoid in shape,with rounded ends and characteristic ball of wool appearance.somewhat larger, by electron microscopy.
Viruses Gene Regulation results in differential Gene Expression, leading to cell Specialization.
Chapter 1 Introduction to virus
BTY328: Virology Dr William Stafford Viral characteristics and isolation-Lecture 1&2 Origin and diversity of viruses?-Tutorial Viral.
An Introduction to the Viruses Chapter 6 Copyright © The McGraw-Hill Companies, Inc) Permission required for reproduction or display.
An Introduction to the Viruses Chapter 6 Copyright © The McGraw-Hill Companies, Inc) Permission required for reproduction or display.
Biology Sylvia S. Mader Michael Windelspecht Chapter 20 Viruses Modified by D. Herder Copyright © The McGraw-Hill Companies, Inc. Permission required for.
Herpesviridae and You Adrienne Manuel I400. THE Immune system: a brief overview For Humans and animals to have maximum health, their bodies needs defense.
REASSORTMENT OF INFLUENZA VIRUS
Add how bacteria make you sick (toxins) Add how virus makes you sick Add vaccines.
Biology Sylvia S. Mader Michael Windelspecht
Copyright OpenHelix. No use or reproduction without express written consent1.
VIRUSES SB13U Unit: Diversity of Living Things “The single biggest threat to man’s continued dominance on the planet is a virus.” —Joshua Lederberg, Nobel.
1 Zoology 145 course General Animal Biology For Premedical Student H Zoology Department Lecture 3 : Viruses.
VIRAL STRUCTURE Image source: healthoma.com. Sources: raritanval.edu; slavirusportfolio.wikispaces.com, virology.wisc.edu.
Chapter 27 Viruses The Nature of Viruses Viruses possess only a portion of the properties of organisms. Parasitic chemicals (segments of DNA of.
Copyright © 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings PowerPoint ® Lecture Presentations for Biology Eighth Edition Neil Campbell.
INTRODUCTION TO VIRUSES. Viruses They are the non-cellular form of life. A virus is an obligate intracellular parasite containing genetic material surrounded.
Viruses Lecture 16 Fall Viruses What is a virus? Are viruses alive? Read Discovery of Viruses pgs and Fig
The Genetics of Viruses & Bacteria Chapter 18. Overview Viruses and bacteria –are the simplest biological systems –provided evidence that genes are made.
Chapter 19~Viruses.
An Introduction to the Viruses Non-Living Etiologies
Completed genomes: viruses
Good teaching is more a giving of right questions than a giving of right answers. – Josef Albers Viruses Chapter 19.
Viruses Page 328.
VIRUSES What are they & Where do they come from?.
Pipelines for Computational Analysis (Bioinformatics)
Influenza Virus: Evolution in real time
PHARMACEUTICAL MICROBIOLOGY -1 PHT 226
Chapter 19~Viruses.
Virology Introduction Viral Structure Bacteriophage Replication
Viruses.
The Mimivirus Giant double stranded DNA virus Discovered in amoebas
General Animal Biology
Origins of Human Virus Diversity
Chapter 15 Viruses.
Gene Regulation results in differential Gene Expression, leading to cell Specialization Viruses
Good teaching is more a giving of right questions than a giving of right answers. – Josef Albers Viruses Chapter 19.
Viruses Chapter 26.
Viruses Page 328.
Viruses Page 328.
Presentation transcript:

Viral Genomics Friday, October 28, 2011 Genomics J. Pevsner

Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics (2 nd edition) by J Pevsner (ISBN ). Copyright © 2009 by Wiley. These images and materials may not be used without permission from the publisher. Visit Copyright notice

Outline of today’s lecture Introduction Classification of Viruses Diversity and Evolution of Viruses Metagenomics and Virus Diversity Bioinformatics Approaches to Problems in Virology Influenza Virus Herpesvirus: From Phylogeny to Gene Expression Human Immunodeficiency Virus Measles Virus

Learning objectives for today’s lecture Describe how viruses are classified Explain bioinformatics approaches to virology Describe the influenza virus genome including the new H1N1 virus Provide a description of the Herpesviruses Use NCBI and LANL resources to identify the function and evolution of Human Immunodeficiency Virus (HIV-1)

Viruses are small, infectious, obligate intracellular parasites. They depend on host cells to replicate. Because they lack the resources for independent existence, they exist on the borderline of the definition of life. The virion (virus particle) consists of a nucleic acid genome surrounded by coat proteins (capsid) that may be enveloped in a host-derived lipid bilayer. Viral genomes consist of either RNA or DNA. They may be single-, double, or partially double stranded. The genomes may be circular, linear, or segmented. Introduction to viruses Page 567

Viruses have been classified by several criteria: -- based on morphology (e.g. by electron microscopy) -- by type of nucleic acid in the genome -- by size (rubella is about 2 kb; HIV-1 about 9 kb; poxviruses are several hundred kb). Mimivirus (for Mimicking microbe) has a double-stranded circular genome of 1.2 megabases (Mb). -- based on human disease Page 568 Introduction to viruses

Fig Page 569

Fig Page 570 The International Committee on Taxonomy of Viruses (ICTV) offers a website, accessible via NCBI’s Entrez site

Mimivirus is the first member of the Mimiviridae family of nucleocytoplasmic large DNA viruses (NCLDVs). Recently (10/11) this group has been named megavirus for viruses having a genome size of at least one megabase. It was isolated from amoebae growing in England. The mature particle has a diameter of ~400 nanometers, comparable to a small bacterium (e.g. a mycoplasma). Thus, mimivirus is by far the largest virus identified to date. Mimivirus: mimicking microbe Page 569

The mimivirus genome is 1.2 Mb (1,181,404 base pairs). It is a double-stranded DNA virus. ► Two inverted repeats of 900 base pairs at the ends (thus it may circularize) ► 72% AT content (~28% GC content) ► 1262 putative open-reading frames (ORFs) of length >100 amino acids. 911 of these are predicted to be protein-coding genes ► Unique features include genes predicted to encode proteins that function in protein translation. The inability to perform protein synthesis has been considered a prime feature of viruses, in contrast to most life forms. See Raoult D et al. (2004) Science 306:1344. Mimivirus: mimicking microbe Page 569

Viral metagenomics refers to the sampling of representative viral genomes from the environment. A typical viral genome is ~50 kilobases (in comparison, a typical microbial genome is ~2.5 megabases). A sample is collected (e.g. seawater, fecal material, or soil). Cellular material is excluded. Viral DNA is extracted, cloned, and sequenced. Viral metagenomics Page 573

Edwards RA, Rohwer F. Nature Reviews Microbiology 3, (2005) “The Phage Proteomic Tree is a whole-genome-based taxonomy system that can be used to identify similarities between complete phage genomes and metagenomic sequences. This new version of the tree contains 167 phage genomes. Phages in black cannot be classified into any clade. In the key, each phage is defined in a clockwise direction.”

Vaccine-preventable viral diseases include: Hepatitis A Hepatitis B Influenza Measles Mumps Poliomyelitis Rubella Smallpox Page 571 Human disease relevance of viruses Source: Centers for Disease Control website

DiseaseVirus Hepatitis A Hepatitis A virus Hepatitis BHepatitis B virus InfluenzaInfluenza type A or B MeaslesMeasles virus MumpsRubulavirus PoliomyelitisPoliovirus (three serotypes)Rotavirus RubellaGenus Rubivirus SmallpoxVariola virus VaricellaVaricella-zoster virus Page 571 Source: Centers for Disease Control website Human disease relevance of viruses

Outline of today’s lecture Introduction Classification of Viruses Diversity and Evolution of Viruses Metagenomics and Virus Diversity Bioinformatics Approaches to Problems in Virology Influenza Virus Herpesvirus: From Phylogeny to Gene Expression Human Immunodeficiency Virus Measles Virus

Some of the outstanding problems in virology include: -- Why does a virus such as HIV-1 infect one species (human) selectively? -- Why do some viruses change their natural host? In 1997 a chicken influenza virus killed six people. -- Why are some viral strains particularly deadly? -- What are the mechanisms of viral evasion of the host immune system? -- Where did viruses originate? Bioinformatic approaches to viruses Page 574

The unique nature of viruses presents special challenges to studies of their evolution. viruses tend not to survive in historical samples viral polymerases of RNA genomes typically lack proofreading activity viruses undergo an extremely high rate of replication many viral genomes are segmented; shuffling may occur viruses may be subjected to intense selective pressures (host immune respones, antiviral therapy) viruses invade diverse species the diversity of viral genomes precludes us from making comprehensive phylogenetic trees of viruses Diversity and evolution of viruses Page 574

Find viruses at the NCBI Genomes site influenza SARS viruses

Overview of viral complete genomes PASC ► All ►

PASC (PAirwise Sequence Comparison) is a web tool for analysis of pairwise identity distribution within viral families. The identities are pre-computed for every pair within the families and with distribution plotted in a form of histogram where each bar corresponds to an interval of identities.

Example of PASC output for Herpesviridae Click the plots to obtain alignments of viral genomes having varying degrees of relatedness

Overview of viral complete genomes

Outline of today’s lecture Introduction Classification of Viruses Diversity and Evolution of Viruses Metagenomics and Virus Diversity Bioinformatics Approaches to Problems in Virology Influenza Virus Herpesvirus: From Phylogeny to Gene Expression Human Immunodeficiency Virus Measles Virus

Influenza viruses belong to the family Orthomyxoviridae. The viral particles are about nm in diameter and can be spherical or pleiomorphic. They have a lipid membrane envelope that contains the two glycoproteins: hemagglutinin (H) and neuraminidase (N). These two proteins determine the subtypes of Influenza A virus. Influenza virus Influenza A Influenza virus leads to 200,000 hospitalizations and ~36,000 deaths in the U.S. each year. Page 574

Since 1976, the H5N1 avian influenza virus has infected at least 232 people (mostly in Asia), of whom 134 have died. A major concern is that a human influenza virus and the H5N1 avian influenza strain were to combine, a new lethal virus could emerge causing a human pandemic. In a pandemic, 20% to 40% of the population is infected per year. ►The 1918 Spanish influenza virus killed tens of millions of people (H1N1 subtype). ►1957 (H2N2) ► 1968 (H3N2) ► Asia (H5N1) ► 2009 (H1N1, “swine flu”) Influenza virus Page 575

There are three types: A, B, C ► A and B cause flu epidemics ► Influenza A: 20 subtypes; occurs in humans, other animals. For example, in birds there are nine subtypes based on the type of neuraminidase expressed (group 1: N1, N4, N5, N8; group 2: N2, N3, N6, N7, N9). The structure of H5N1 avian influenza neuraminidase has been reported (Russell RJ et al., Nature 443:45, 2006). ► Influenza A genome consists of eight, single negative- strand RNAs (from 890 to 2340 nucleotides). Each RNA segment encodes one to two proteins. Influenza virus Page 575

Page 576

NCBI offers an Influenza Virus Resource (

Growth of Influenza Virus Sequences in GenBank (updated 10/11 )

Holmes et al. (2005) performed phylogenetic analyses of 156 complete genomes of human H3N2 influenza A viruses collected over time ( ) in one location (New York State). Phylogenetic analysis revealed multiple reassortment events. One clade of H3N2 virus, present since 2002, is the source for the HA gene in all subsequently sampled viruses. Large-scale influenza virus genome analysis Holmes EC, et al. Whole-genome analysis of human influenza A virus reveals multiple persistent lineages and reassortment among recent H3N2 viruses. PLoS Biol Sep;3(9):e300. Page 576

Evolutionary Relationships of Concatenated Major Coding Regions of Influenza A Viruses Sampled in New York State during 1999– The maximum likelihood phylogenetic tree is mid-point rooted for purposes of clarity, and all horizontal branch lengths are drawn to scale. Bootstrap values are shown for key nodes. Isolates assigned to clade A (light blue), clade B (yellow), and clade C (red) are indicated, as are those isolates involved in other reassortment events: A/New York/11/2003 (orange), A/New York/182/2000 (dark blue), and A/New York/137/1999 and A/New York/138/1999 (green). Holmes EC, et al. Whole-genome analysis of human influenza A virus reveals multiple persistent lineages and reassortment among recent H3N2 viruses. PLoS Biol Sep;3(9):e300.

Ghedin et al. (2005) sequenced 209 complete genomes of human influenza A virus (sequencing 2,821,103 nucleotides). See Nature 437:1162. Large-scale influenza virus genome analysis

Each row represents a single amino acid position in one protein. Amino acids (single-letter abbreviations are used) are colour-coded as shown in the key, so that mutations can be seen as changes in colour when scanning from left to right along a row. For simplicity, only amino acids that showed changes in at least three isolates are shown. Each column represents a single isolate, and columns are only a few pixels wide in order to display all 207 H3N2 isolates in this figure. Isolates are ordered along the columns chronologically according to the date of collection; boundaries between influenza seasons are indicated by gaps between columns. A more detailed version of this figure, showing positions that experienced any amino acid change and showing identifiers for the isolates in each column, is available as Supplementary Fig. 1. Ghedin E, et al. Large-scale sequencing of human influenza reveals the dynamic nature of viral genome evolution. Nature Oct 20;437(7062):

207 H3N2 isolates amino acid positions in influenza proteins

Outline of today’s lecture Introduction Classification of Viruses Diversity and Evolution of Viruses Metagenomics and Virus Diversity Bioinformatics Approaches to Problems in Virology Influenza Virus Herpesvirus: From Phylogeny to Gene Expression Human Immunodeficiency Virus Measles Virus

Herpesviruses are double-stranded DNA viruses that include herpes simplex, cytomegalovirus, and Epstein-Barr. The genomic DNA is packed inside an icosahedral capsid; with a lipid bilayer the diameter is ~200 nanometers. Herpesvirus Page 578

Phylogenetic analysis suggests three major groups that originated about MYA. Mammalian herpesviruses are in all three subfamilies. Avian and reptilian herpesviruses are all in the Alphaherpesvirinae. Page 578 Herpesvirus

Fig Page 578 Millions of years before present Herpesvirus: three main groups

McGeoch et al. (Virus Res. 117:90-104, 2006) describe a new herpesvirus taxonomy. Family Herpesviridae Subfamilies Alpha-, Beta-, Gammaherpesvirinae New family Alloherpesviridae (piscine, amphibian herpesviruses) Herpesvirus taxonomy Page 578

Alphaherpesvirinae Gammaherpesvirinae Betaherpesvirinae Alloherpesviridae (piscine, amphibian) Malacoherpesviridae (invertebrate HV) protein-coding regions Blocks of core genes (I–VII) putative ATPase subunit of the terminase McGeoch DJ et al. (Virus Res. 117:90-104, 2006)

Genome sizes range from 124 kb (simian varicella virus from Alphaherpesvirinae) to 241 kb (chimpanzee cytomegalovirus from Betaherpesvirinae). ► GC content ranges from 32% to 75%. ► Protein-coding regions occur at a density of one gene per 1.5 to 2 kb of herpesvirus DNA. ► There are immediate-early genes, early genes (nucleotide metabolism, DNA replication), and late genes (encoding proteins comprising the virion). ► Introns occur in some herpesvirus genes. ► Noncoding RNAs have been described (e.g. latency- associated transcripts in HSV-1). Herpesvirus taxonomy

Consider human herpesvirus 8 (HHV-8)(family Herpesviridae; subfamily Gammaherpesvirinae). Its genome is ~140,000 base pairs and encodes ~80 proteins. Its RefSeq accession number is NC_ We can explore this virus at the NCBI website. Try NCBI  Entrez  Genomes  viruses  dsDNA viruses, no RNA stage  Herpesvirales Bioinformatic approaches to herpesvirus Page 579

HHV8 taxonomy link HHV8 genome link HHV8 genome summary

Page 579 clusters► NCBI virus site includes tools (e.g. “Protein clusters”) to analyze herpesviruses

Fig Page 579 NCBI virus site includes tools (e.g. “Protein clusters”) to analyze herpesviruses

HHV-8 proteins include structural and metabolic proteins. There are also viral homologs of human host proteins such as the apoptosis inhibitor Bcl-2, an interleukin receptor, and a neural cell adhesion-related adhesin. Mechanisms by which viruses may acquire host proteins include recombination, transposition, splicing. A blastp search using HHV-8 interleukin IL-8 receptor as a query reveals several other viral IL-8 receptor molecules. Viruses can acquire host genes Page 579

Fig Page 581

Functional genomics approaches have been applied to human herpesvirus 8 (HHV-8). For example, microarrays have been used to define changes in viral gene expression at different stages of infection (Paulose-Murphy et al., 2001). Conversely, gene expression changes have been measured in human cells following viral infection. Bioinformatic approaches to herpesvirus Page 582

Fig Page 582 Paulose-Murphy et al. (2001) described HHV-8 viral genes that are expressed at different times post infection

Paulose-Murphy et al. (2001)

Outline of today’s lecture Introduction Classification of Viruses Diversity and Evolution of Viruses Metagenomics and Virus Diversity Bioinformatics Approaches to Problems in Virology Influenza Virus Herpesvirus: From Phylogeny to Gene Expression Human Immunodeficiency Virus Measles Virus

Human Immunodeficiency Virus (HIV) is the cause of AIDS. Some have estimated that 33 million people were infected with HIV (2006). HIV-1 and HIV-2 are primate lentiviruses. The HIV-1 genome is 9181 bases in length. Note that there are >300,000 Entrez nucleotide records for this genome (but only one RefSeq entry). Phylogenetic analyses suggest that HIV-2 appeared as a cross-species contamination from a simian virus, SIVsm (sooty mangebey). Similarly, HIV-1 appeared from simian immunodeficiency virus of the chimpanzee (SIVcpz). Bioinformatic approaches to HIV Page 583

Fig Page 584 HIV phylogeny based on pol suggests five clades Hahn et al., Simian immunodeficiency virus from the chimpanzee Pan troglodytes (SIVcpz) with HIV-1

HIV phylogeny based on pol suggests five clades Hahn et al., SIV from the sooty mangabeys Cerecocebus atys (SIVsm), with HIV-2 and SIV from the macaque (genus Macaca; SIVmac) Fig Page 584

HIV phylogeny based on pol suggests five clades Hahn et al., SIV from African green monkeys (genus Chlorocebus)(SIVagm) Fig Page 584

HIV phylogeny based on pol suggests five clades Hahn et al., SIV from Sykes’ monkeys, Cercopithecus albogularis (SIVsyk) Fig Page 584

HIV phylogeny based on pol suggests five clades Hahn et al., SIV from l’Hoest monkeys (Cercopithecus lhoesti); from suntailed monkeys (Cercopithecus solatus); and from mandrill (Mandrillus sphinx)

NCBI offers a retrovirus resource with reference genomes and protein sets, and several tools (alignment, genotyping). Bioinformatic approaches to HIV: NCBI Page 585

10/11

Example of genotyping tool from NCBI retrovirus resource reference sequence with the highest score

Los Alamos National Laboratory (LANL) databases provide a major HIV resource. See LANL offers -- an HIV BLAST server -- Synonymous/non-synonymous analysis program -- a multiple alignment program -- a PCA-like tool -- a geography tool Bioinformatic approaches to HIV: LANL Page 586

LANL offers many HIV tools including analysis algorithms

Fig Page 588

Monday we will have computer lab on viruses and bacteria. Check the site to find a word document describing the exercises, as well as web links. Wednesday 11/2 we will cover chapter 15 (bacteria and archaea). On Friday 11/4 Egbert Hoiczyk will give a lecture on bacteria. After that we move on to the eukaryotes. Next…