Sequence Specific DNA Uptake Genetic exchange & bacterial evolution DNA uptake is primitive genetic exchange Some important human pathogens have DNA uptake.

Slides:



Advertisements
Similar presentations
RNA and Protein Synthesis
Advertisements

Introduction to Bioinformatics
Local alignments Seq X: Seq Y:. Local alignment  What’s local? –Allow only parts of the sequence to match –Results in High Scoring Segments –Locally.
©CMBI 2005 Exploring Protein Sequences - Part 2 Part 1: Patterns and Motifs Profiles Hydropathy Plots Transmembrane helices Antigenic Prediction Signal.
Definitions Optimal alignment - one that exhibits the most correspondences. It is the alignment with the highest score. May or may not be biologically.
Structural bioinformatics
Sequence Similarity Searching Class 4 March 2010.
Protein domains vs. structure domains - an example.
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
Sequence Comparison Intragenic - self to self. -find internal repeating units. Intergenic -compare two different sequences. Dotplot - visual alignment.
Protein Modules An Introduction to Bioinformatics.
Evolution of DNA by celluLar automata HC Lee Department of Physics Department of Life Sciences National Central University.
Genomics and bioinformatics summary 1. Gene finding: computer searches, cDNAs, ESTs, 2.Microarrays 3.Use BLAST to find homologous sequences 4.Multiple.
Chromosomes carry genetic information
© Wiley Publishing All Rights Reserved. Biological Sequences.
Journal club 06/27/08. Phylogenetic footprinting A technique used to identify TFBS within a non- coding region of DNA of interest by comparing it to the.
Genome Evolution: Duplication (Paralogs) & Degradation (Pseudogenes)
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
Comparative Genomics of the Eukaryotes
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Gene Structure and Identification
Alignment Statistics and Substitution Matrices BMI/CS 576 Colin Dewey Fall 2010.
Biostatistics-Lecture 15 High-throughput sequencing and sequence alignment Ruibin Xi Peking University School of Mathematical Sciences.
Wellcome Trust Workshop Working with Pathogen Genomes Module 3 Sequence and Protein Analysis (Using web-based tools)
Gene to Protein Part 2: Translation After the mRNA transcript leaves the nucleus it goes to a ribosome (site of protein synthesis).
Multiple Sequence Alignment May 12, 2009 Announcements Quiz #2 return (average 30) Hand in homework #7 Learning objectives-Understand ClustalW Homework#8-Due.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
1 Orthology and paralogy A practical approach Searching the primaries Searching the secondaries Significance of database matches DB Web addresses Software.
© Wiley Publishing All Rights Reserved. Building Multiple- Sequence Alignments.
BIOINFORMATICS IN BIOCHEMISTRY Bioinformatics– a field at the interface of molecular biology, computer science, and mathematics Bioinformatics focuses.
Bacterial Genetics - Assignment and Genomics Exercise: Aims –To provide an overview of the development and.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
Sequencing a genome and Basic Sequence Alignment
Construction of Substitution Matrices
You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. What do you.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
 During DNA replication, the two strands of the original parent DNA molecule, shown in blue, each serve as a template for making a new strand, shown in.
A Tutorial of Sequence Matching in Oracle Haifeng Ji* and Gang Qian** * Oklahoma City Community College ** University of Central Oklahoma.
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
A Biology Primer Part III: Transcription, Translation, and Regulation Vasileios Hatzivassiloglou University of Texas at Dallas.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Sequence Alignment.
The Genetic Code. The DNA that makes up the human genome can be subdivided into information bytes called genes. Each gene encodes a unique protein that.
Construction of Substitution matrices
More about proteins Proteins are the building block of our bodies. They make up many components (muscle, skin) or direct the synthesis of components (bone,
Specific Defenses of the Host Part 2 (acquired or adaptive immunity)
RNA Makin’ Proteins DNAMutations Show off those Genes!
(H)MMs in gene prediction and similarity searches.
What is BLAST? Basic BLAST search What is BLAST?
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Substitution Matrices and Alignment Statistics BMI/CS 776 Mark Craven February 2002.
Chapter – 10 Part II Molecular Biology of the Gene - Genetic Transcription and Translation.
SC.912.L.16.3 DNA Replication. – During DNA replication, a double-stranded DNA molecule divides into two single strands. New nucleotides bond to each.
Modern Genetics: Protein Synthesis. Protein Synthesis Gene: Sequence of DNA on a chromosome which codes for a protein. How do genes code for traits? A.
What is BLAST? Basic BLAST search What is BLAST?
Bacterial infection by lytic virus
Bacterial infection by lytic virus
Bioinformatics and BLAST
There are four levels of structure in proteins
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Bacterial genomics: The controlled chaos of shifty pathogens
Basic Local Alignment Search Tool
Explore Evolution: Instrument for Analysis
EVIDENCE FOR EVOLUTION
Homology Modeling.
Structural evidence: Embryonic similarities Vestigial organs
Evolution of Genomes Chapter 21.
Basic Local Alignment Search Tool
Copyright Pearson Prentice Hall
Presentation transcript:

Sequence Specific DNA Uptake Genetic exchange & bacterial evolution DNA uptake is primitive genetic exchange Some important human pathogens have DNA uptake systems –Haemophilus influenzae, Neisseria meningitidis, N. gonorrheorae, etc. H. influenzae and N. meningitidis prefer to uptake homologous DNA by recognizing uptake specific sequence (USS)

USS in H. influenzae 1.86 Mbp H. influenzae has 1471 copies of 29-base USS USS has 9-base oligo AAGTGCGGT 100% conserved 1471 copies: 100 times statistical average –Occupies 2.4% of whole genome Questions: –Why so many? –How did this evolve? –What is the cost?

68% (975/1471) of USS in 38% (656/1378) of genes. 433 genes has one USS, 152 has two, 56 has three, 8 has four, 6 has five, one (HI1685) has eight USS. –Focus on genes with single USS Distribution of USS in H. inf.

USS and UEP USS: 9-base oligo embedded in gene (DNA) UEP: when gene expressed into protein USS translated to 3- or 4-residue USS encoded peptide (amino acids) –3-residue UEPs: 60% (39/618) are TAL –4-residue UEPs: 63% (269/426) contain SAV

Methods for studying cost of embedding USS 1. Conservation of UEP sites in homologs –study conservation of UEP sites in host sequence and corresponding sites in homologs 2. Conservation of segment containing UEP –study conservation of segment containing UEP relative other segments within same protein

Conservation of UEP sites in homologs Compute matching scores between: –Query & matches; matches & matches –UEP & mi; mi & mj

BLAST search for homologs. First red line is query sequence, rest are matches. Present case has two high-similarity matches.

Each point: one UEP in one protein Ave. qm score (y) vs. mm score (x) (b) Ave. qm/QM (y) vs. mm/MM (x) score UEP sites in protein not less conserv- ed than cor- responding sites in homologs.

Segmentation of protein sequence containing UEP. XXX is position of UEP (USS encoded peptide). Conservation of segment containing UEP

Relative segment similarity scores in gene HI0027. Yellow bar is segment containing UEP.

Each point: one protein. Y: score of UEP containing sector. X: (a) ave. all sectors. (b) lowest quartile. (c) 3 rd quartile. (d) 1st quartile. Summary of relative sector scores for 473 proteins. UEP almost never in most conserved sites.

Summary UEP not more or less conserved in protein than corresponding sites in homologs of protein. –If less conserved, then would imply some disruption of protein, at some cost. –Result implies cost not detectable by method. Segment in protein containing UEP almost always in close to least conserved sections in protein. –Suggests UEP embedded in highly conserved section of protein eliminated by evolution –Explanation for result in first test Multiplication of USS did interact with evolution.

References