Figure 1a. Insertion of sequence into Claudi capsid gene

Slides:



Advertisements
Similar presentations
Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.
Advertisements

EVOLUTIONARY CHANGE IN DNA SEQUENCES - usually too slow to monitor directly… … so use comparative analysis of 2 sequences which share a common ancestor.
Expect value Expect value (E-value) Expected number of hits, of equivalent or better score, found by random chance in a database of the size.
BLAST Tutorial 3 What is BLAST? Basic Local Alignment Search Tool Is a set of similarity search programs designed to explore sequence databases. What are.
Computational Biology, Part 2 Sequence Comparison with Dot Matrices Robert F. Murphy Copyright  1996, All rights reserved.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Making Sense of DNA and protein sequence analysis tools (course #2) Dave Baumler Genome Center of Wisconsin,
Inferring function by homology The fact that functionally important aspects of sequences are conserved across evolutionary time allows us to find, by homology.
Locating genes in Plasmodium falciparum You have seen how artemis is used to view, analyse and annotate bacterial genomes, but now we are going to move.
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
Bikash Shakya Emma Lang Jorge Diaz.  BLASTx entire sequence against 9 plant genomes. RepeatMasker  55.47% repetitive sequences  82.5% retroelements.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Computational Biology, Part 3 Sequence Alignment Robert F. Murphy Copyright  1996, All rights reserved.
BIOINFORMATICS IN BIOCHEMISTRY Bioinformatics– a field at the interface of molecular biology, computer science, and mathematics Bioinformatics focuses.
Copyright OpenHelix. No use or reproduction without express written consent1.
LOC_Os02g08480 Supplementary Figure S1. Exons shorter than a read length have few or no reads aligned. The gene at LOC_Os02g08040 contains exons shorter.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
Sequence-based Similarity Module (BLAST & CDD only ) & Horizontal Gene Transfer Module (Ortholog Neighborhood & GC content only)
1 P6a Extra Discussion Slides Part 1. 2 Section A.
A Tutorial of Sequence Matching in Oracle Haifeng Ji* and Gang Qian** * Oklahoma City Community College ** University of Central Oklahoma.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Point Specific Alignment Methods PSI – BLAST & PHI – BLAST.
Doug Raiford Phage class: introduction to sequence databases.
Bacillus Phage Annotation Phage Lab II, Spring 2015.
Summer Bioinformatics Workshop 2008 BLAST Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State University – Rochester Center
 Series of enzyme catalyzed reactions  Glycolysis - citrate cycle – oxidative phosphorylation  Sugar -> energy.
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
Bacterial infection by lytic virus
Bacterial infection by lytic virus
C acaatataATGGAGCGTGAGACTTCGTCATCTTCAACTCCTCCGGAGGATCTTGTTACATCGATGATCGGAAAGTTCGTCGCTGTCATGTCTA b acaatataATGGAGCGTGAGACTTCGTCATCTTCAACTCCTCCGGAGGATCTTGTTACATCGATGATCGGAAAGTTCGTCGCTGTCTTGTCTA.
Scoring Sequence Alignments Calculating E
22 kDa a-coixin gene cluster
Genome Center of Wisconsin, UW-Madison
Mark M Metzstein, H.Robert Horvitz  Molecular Cell 
Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211  Michael A. Rogers, Hermelita Winter,
BLAST.
Marrying structure and genomics
Bacterial genomics: The controlled chaos of shifty pathogens
Comparative Genomics.
What do you with a whole genome sequence?
Basic Local Alignment Search Tool
Characterization of Human KAP24
Volume 117, Issue 3, Pages (September 1999)
Size Polymorphisms in the Human Ultrahigh Sulfur Hair Keratin-Associated Protein 4, KAP4, Gene Family  Naoyuki Kariya, Yutaka Shimomura, Masaaki Ito 
Volume 26, Issue 2, Pages e3 (February 2018)
A Novel Family of Candidate Pheromone Receptors in Mammals
Supplemental Figure 3 A B C T-DNA 1 2 RGLG1 2329bp 3 T-DNA 1 2 RGLG2
Genomic context, sequence identity, and structural homology modeling of the aadA31 gene detected in P. multocida PM13 and H. somni HS31. Genomic context,
Anton Arkhipov, Wouter H. Roos, Gijs J.L. Wuite, Klaus Schulten 
Scot A Wolfe, Elizabeth I Ramm, Carl O Pabo  Structure 
Volume 91, Issue 7, Pages (December 1997)
Jun Xu, Roger W. Hendrix, Robert L. Duda  Molecular Cell 
Gautam Dey, Tobias Meyer  Cell Systems 
Michael A. Rogers, Hermelita Winter, Christian Wolf, Jürgen Schweizer 
Complete Haplotype Sequence of the Human Immunoglobulin Heavy-Chain Variable, Diversity, and Joining Genes and Characterization of Allelic and Copy-Number.
Volume 108, Issue 1, Pages (January 2015)
Basic Local Alignment Search Tool
Tagging of the endogenous Py03652 gene with the gfp gene in Plasmodium yoelii. Tagging of the endogenous Py03652 gene with the gfp gene in Plasmodium yoelii.
Volume 94, Issue 2, Pages (July 1998)
GOMASHI ANNOTATION GENES 1-13
Sequence Analysis Alan Christoffels
Figure 1. An outline of the novel ∼3
Fig. 2 Overview of transcriptional mutagenesis in yeast.
Figure 1. The overlap between Ensembl/GENCODE, RefSeq and UniProtKB genes. The number of genes classified as coding in ... Figure 1. The overlap between.
Rachelle Gaudet, Andrew Bohm, Paul B Sigler  Cell 
by Pan Tao, Xiaorong Wu, and Venigalla Rao
Splice isoforms of the JNK1, JNK2, and JNK3 proteins.
Knock-in of the rpl42-P56Q mutation using the split-ura4 system.
Presentation transcript:

Figure 1a. Insertion of sequence into Claudi capsid gene Domain 1, HK97-Like Capsid Fold Domain 2, Probability: 69.8, E-value: 9.2E-4 Bacterial Ig-like domain, group 2 KongoTrouble Phamerator map image of Claudi capsid protein (gene 31) region compared to KongoTrouble (gene 27). Claudi capsid is missing its Big2 domain (dotted line) due to insertion of sequence into the capsid gene. Domain labels are from HHPred. Claudi

Fig 1b. Insertion of sequence into Claudi capsid gene KT: 14053 KT: 14054 232 bp INSERTION TTAATATAA C: 14362 C: 14594 Insertion in Claudi capsid gene. Alignment of KongoTrouble and Claudi capsid gene sequences shows a 6-nucleotide repeat (AAAAGG) is present on either side of the inserted nucleotide sequence. The insertion includes a stop codon (in red) that allows translation to stop after completion of the HK97-like domain. Coding potential from Genemark is shown below each domain, showing the Big2 domain has high coding potential but no start codon.

Fig 1c. Alignment of insertion sequence 97% identity to Bacillus Prophage Insertion, 232 bp Claudi gp 31 ATTGAAATGACAAATGTTTACAATCCAAAAGGTTAATATAA gp 32 AAAAGGT- - ATACTGGAATTACTTCTA KongoTrouble ATTGAAATGACAAATGTTTACAATCCAAAAGGTCTATACTGGAACTACTTCTA gp 27 Potential mechanism of recombination for Claudi capsid gene. Sequence alignment includes alignment of the 6-nucleotide repeat sequence into a structure that might be recognized by a recombination enzyme.

Fig 2a. Insertion of gene into SerPounce genome Gene 44 has been inserted into SerPounce’s genome Blasting SerPounce against itself has shown a 77 bp repeat on either side of the gene Possible insertion mechanism Purple shading is due to the first insertion in SerPounce being identical to the a sequence in KongoTrouble Orange shading shows a near to identical match of the second repeat with the same sequence in KongoTrouble KongoTrouble SerPounce

Fig 2b. Insertion of gene into SerPounce genome Gene 44, 30% Identity with Listeria Phage Insertion SerPounce 77 bp repeat Gene 43 ATCCCCTTTATGATTGTTTTAGTTGTTTTCTTAACCTATAATTATTATATCATTTATGTGTTAAATCGTCAATAGAT KongoTrouble ATCCCCTTT-TG-TTGTTTTAGTTGTTTT-TTAACCTATAATTATTATAACATTTATGTGTTAAATCGTCAATAGAT Gene 40 ATCCCCTTTATGATTGTTTTAGTTGTTTTCTTAACCTATAATTATTATATCATTTATGTGTTAAATCGTCAATAGAT Visualization of SerPounce Gene 44 Insertion This protein isn’t in any other phage in the database. SerPounce gp44 doesn’t have Bacillus or Bacillus phage homologs. Best matches are to Listeria phage protein.

Missing second (incomplete) domain Has complete domain Sequence used, Claudi gene 31 and space till gene 32 (query) compared to ViolettaMad gene 16 (subject) Claudi is on top in blue and Violettamad is on the bottom in red Red bar show the similar sequence between the two genomes Above we see a that there was an insertion in the Claudi gene 31, creating a break in the red bar. The rest of the original gene is downstream show again in red.

Note, difference in protein sequence highlighted in orange. Claudi capsid protein has a missing domain, as seen when compared to the VIolettaMad above.

Panel B Insertion Mechanism Draft Gap sequence completely found in Claudi and SerPounce Genome Rest of hits are in Bacillus family How is this a mechanism? Ask Allison Claudi Gap results in blastn with repeats has a lot of hits look at pdf on wiki

Panel C Draft Sequence used, Claudi gene 31 and space till gene 32 (query) compared to KongoTrouble gene 27 (subject) Claudi is on top in blue and KongoTrouble is on the bottom in red Red bar show the similar sequence between the two genomes Above we see a that there was an insertion in the Claudi gene 31, creating a break in the red bar. The rest of the original gene is downstream show again in red. Results for blasting Claudi gap against KongoTrouble gene Complete gene domain is only found in the podoviruses (phamerator)

Panel D Someone is making an image or something here.