3. genome analysis.

Slides:



Advertisements
Similar presentations
© Wiley Publishing All Rights Reserved. Using Nucleotide Sequence Databases.
Advertisements

1 3. genome analysis. 2 The first DNA-based genome to be sequenced in its entirety was that of bacteriophage Φ-X174; (5,368 bp), sequenced by Frederick.
Genomic Repeat Visualisation Using Suffix Arrays Nava Whiteford Department of Chemistry University of Southampton
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Environmental Genome Shotgun Sequencing of the Sargasso Sea
Using Artemis to generate the genome Map: Demo 1.
Figure S1_Yao Qin et al. Figure S1 Occurrence and distribution of trihelix family in different plant species. Red branches in the cladogram indicate that.
Mutation And Natural Selection how genomes record a history of mutations and their effects on survival Tina Hubler, Ph.D., University of North Alabama,
Brückner et al., Fig. 1b Brückner et al., Fig. 1B a c b 6 Fig. 1. Circular representation of Streptococcus pneumoniae genome comparisons.
 What is different between these 2 sequences? GGAATTCCTAGCAAT CCTTAAGGATCGTTA CTACGTGAGGAATTC GATGCACTCCTTAAG.
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
Visualizing Biosciences Genomics & Proteomics. “Scientists Complete Rough Draft of Human Genome” - New York Times, June 26, 2000 The problem: –3 billion.
Transcription/Translation foldable
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
“Coding” for the building blocks of our bodies Edited
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
Volume 10, Issue 3, Pages (September 2011)
A Corkscrew Model for Dynamin Constriction
Identification and Characterization of pre-miRNA Candidates in the C
Volume 2, Issue 1, Pages (July 2012)
Volume 4, Issue 3, Pages (March 1996)
Michael Cullen, Stephen P
Bacterial genomics: The controlled chaos of shifty pathogens
Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets  Benjamin P. Lewis, Christopher B. Burge,
Atomic Model of CPV Reveals the Mechanism Used by This Single-Shelled Virus to Economically Carry Out Functions Conserved in Multishelled Reoviruses 
Essential Question: How cells make proteins
Debanu Das, Millie M Georgiadis  Structure 
Genomic rearrangements of E
Volume 20, Issue 12, Pages (June 2010)
Luisa De Sordi, Varun Khanna, Laurent Debarbieux  Cell Host & Microbe 
The Release 5.1 Annotation of Drosophila melanogaster Heterochromatin
Volume 6, Issue 11, Pages (November 1998)
Volume 154, Issue 1, Pages (July 2013)
Size Polymorphisms in the Human Ultrahigh Sulfur Hair Keratin-Associated Protein 4, KAP4, Gene Family  Naoyuki Kariya, Yutaka Shimomura, Masaaki Ito 
Circular map of the chromosome of M. tuberculosis H37Rv.
Venn diagram of blaKPC-containing plasmids detected in the environment and patients. Venn diagram of blaKPC-containing plasmids detected in the environment.
Volume 25, Issue 12, Pages e3 (December 2017)
Volume 16, Issue 5, Pages (December 2004)
The Complete Genome Sequence of Escherichia coli K-12
Volume 27, Issue 24, Pages e6 (December 2017)
Volume 16, Issue 8, Pages (August 2008)
Architecture of the chloroplast 70S ribosome
Volume 128, Issue 6, Pages (March 2007)
The mosaic genome structure and phylogeny of Shiga toxin-producing Escherichia coli O104:H4 is driven by short-term adaptation  K. Zhou, M. Ferdous, R.F.
Volume 17, Issue 6, Pages (June 2009)
Volume 16, Issue 5, Pages (December 2004)
Phylogenetic tree of 170 Lactobacillus species.
Human Promoters Are Intrinsically Directional
Sequence alignment of PHCCEx domains with secondary structure elements of the Tiam2 PHCCEx domain at the top. Sequence alignment of PHCCEx domains with.
A B C Supplementary Figure S1 Fluorescence microscopy of cultures enriched with TAO100. Cultures were stained with DAPI (a), FITC-labeled DNA probe for.
ADAGE weights reflect a gene’s common regulatory and process features.
Volume 6, Issue 6, Pages (December 2000)
Anne Dallas, Harry F Noller  Molecular Cell 
Replica Exchange Molecular Dynamics Simulations Provide Insight into Substrate Recognition by Small Heat Shock Proteins  Sunita Patel, Elizabeth Vierling,
Structural Flexibility of CaV1. 2 and CaV2
Characterization of New Members of the Human Type II Keratin Gene Family and a General Evaluation of the Keratin Gene Domain on Chromosome 12q13.13  Michael.
Volume 108, Issue 1, Pages (January 2015)
Whole-genome comparison of E
A multitiered approach to characterize transcriptome structure.
Maximum likelihood phylogeny of USA500 and other CC8 strains.
Fig. 2 Overview of transcriptional mutagenesis in yeast.
Volume 41, Issue 2, Pages (January 2011)
Structural insights based on chimeric Alp4-GCP2 analysis.
Volume 11, Issue 7, Pages (May 2015)
Adam T. McGeoch, Stephen D. Bell  Cell 
by Pan Tao, Xiaorong Wu, and Venigalla Rao
Long noncoding RNAs are distributed throughout the genome and are characteristically distinct from annotated mRNAs. Long noncoding RNAs are distributed.
Intrastrand biases (nucleotide skew) in the T4 genome.
Debanu Das, Millie M Georgiadis  Structure 
Integrated analysis of gene expression and copy number alterations.
Presentation transcript:

3. genome analysis

 The first DNA-based genome to be sequenced in its entirety was that of bacteriophage Φ-X174; (5,368 bp), sequenced by Frederick Sanger in 1976 3. genome analysis

 The first DNA-based genome to be sequenced in its entirety was that of bacteriophage Φ-X174; (5,368 bp), sequenced by Frederick Sanger in 1976 There are several things to notice in this plot. First, the genome is circular. The density of the four nucleotides are plotted in the four outer-most circles. This density is not evenly distributed; although all four of the scales range from 0% (min., no colour) to 40% (max colour intensity), it can be easily seen that the sequence is dominated by T's (red circle), and that there are relatively few G's (outermost turquoise circle) and C's (pink circle), and a few A-rich regions (green 2nd circle). There are many genes which overlap (the genes are indicated in the "annotation circle", which is the fifth circle from the outside - with the blue bands representing genes in the forward direction). GC Skew = (G - C)/(G + C) AT Skew = (A - T)/(A + T) http://users.rcn.com/jkimball.ma.ultranet/BiologyPages/P/PhiX.html 3. genome analysis

exploring genome (and protein) sequences one language one basic tool 3. genome analysis

exploring genomes 3. genome analysis

exploring genomes 3. genome analysis

exploring genomes 3. genome analysis Broad Institute of MIT and Harvard Department of Energy Joint Genome Institute (Berkeley, USA) Institute for Genome Sciences (University of Mariland, Baltimore USA) 3. genome analysis

3. genome analysis

3. genome analysis

In whole genome shotgun sequencing (top), the entire genome is sheared randomly into small fragments (appropriately sized for sequencing) and then reassembled. In hierarchical shotgun sequencing (bottom), the genome is first broken into larger segments. After the order of these segments is deduced, they are further sheared into fragments appropriately sized for sequencing. 3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

Circular maps of the chromosome and plasmids of enteropathogenic E Circular maps of the chromosome and plasmids of enteropathogenic E. coli (da Iguchi A et al. J. Bacteriol. 2009) CDS = CoDing Sequence, region of nucleotides that corresponds to the sequence of amino acids in the predicted protein PP = prophage: a phage (viral) genome inserted and integrated into the circular bacterial DNA chromosome Circular maps of the chromosome and plasmids of EPEC strain E2348/69. (A) EPEC strain E2348/69 chromosome. From the outside in, the first circle shows the locations of PPs and IEs (purple, lambda-like PPs; light blue, other PPs; green, IEs and the LEE element), the second circle shows the nucleotide sequence positions (in Mbp), the third and fourth circles show CDSs transcribed clockwise and anticlockwise, respectively (gray, conserved in all eight other sequenced E. coli strains; red, conserved only in the B2 phylogroup; yellow, variable distribution; blue, E2348/69 specific), the fifth circle shows the tRNA genes (red), the sixth circle shows the rRNA operons (blue), the seventh circle shows the G+C content, and the eighth circle shows the GC skew. (B) EPEC strain E2348/69 plasmids. The boxes in the outer and inner circles represent CDSs transcribed clockwise and anticlockwise, respectively. Pseudogenes are indicated by black boxes, and other CDSs are indicated by the colors described above for panel A. IE = integrative elements Circular maps of the chromosome and plasmids of EPEC strain E2348/69. (A) EPEC strain E2348/69 chromosome. From the outside in, the first circle shows the locations of PPs and IEs (purple, lambda-like PPs; light blue, other PPs; green, IEs and the LEE element), the second circle shows the nucleotide sequence positions (in Mbp), the third and fourth circles show CDSs transcribed clockwise and anticlockwise, respectively (gray, conserved in all eight other sequenced E. coli strains; red, conserved only in the B2 phylogroup; yellow, variable distribution; blue, E2348/69 specific), the fifth circle shows the tRNA genes (red), the sixth circle shows the rRNA operons (blue), the seventh circle shows the G+C content, and the eighth circle shows the GC skew. (B) EPEC strain E2348/69 plasmids. The boxes in the outer and inner circles represent CDSs transcribed clockwise and anticlockwise, respectively. Pseudogenes are indicated by black boxes, and other CDSs are indicated by the colors described above for panel A. 3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

3. genome analysis

Structural genomics NMR DNA Protein Structure Algorithm X Ray diffractometry NMR Residue    THR 0.0 147.7 172.9 107.2 -125.3 187.4 CYS 123.4 63.6 103.7 PRO 60.3 83.9 -116.7 Protein Structure cryo-electron tomography DNA 0101#01001010#10111010# 01010001#10010#1001#101 10010#100100100101011#0 Algorithm 3. genome analysis