Mysterious Sequence Repeats in Phage Genomes AKHIL GARG BNFO 301 APRIL 30, 2015.

Slides:



Advertisements
Similar presentations
Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome ECS289A.
Advertisements

Diversity generating reverse transcriptase Reverse transcriptase an enzyme.
Determining the Future of A3 phages using Average Nucleotide Identity (ANI) and Immunity Testing Emily S., Emily N., Kris C., and Casey N. Abstract We.
Introduction to Bioinformatics Tuesday, 12 March.
Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
Short Dispersed Repeats KALEIGH, MARIAM, MICHAEL AND NICHOLAS.
Psi-BLAST, Prosite, UCSC Genome Browser Lecture 3.
Bi-correlation clustering algorithm for determining a set of co- regulated genes BIOINFORMATICS vol. 25 no Anindya Bhattacharya and Rajat K. De.
Finding approximate palindromes in genomic sequences.
MOPAC: Motif-finding by Preprocessing and Agglomerative Clustering from Microarrays Thomas R. Ioerger 1 Ganesh Rajagopalan 1 Debby Siegele 2 1 Department.
An analysis of “Alignments anchored on genomic landmarks can aid in the identification of regulatory elements” by Kannan Tharakaraman et al. Sarah Aerni.
The Arkin model of the lysis- lysogeny decision of phage lambda Tom Evans.
Microarray Data Analysis Using R Studies in Tissue Databases Mark Reimers, NCI.
Computational Biology, Part 2 Sequence Comparison with Dot Matrices Robert F. Murphy Copyright  1996, All rights reserved.
Identifying recombination events in phage Giles through presence of repeat sequences MEGAN MAIR.
Microbes in Medicine and Research Microbes and Society Fall 2007.
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
The Search for Small Regulatory RNA Central Dogma: DNA to RNA to Protein Replication Processing / Translocation hnRNA rRNAtRNA mRNA.
Introduction to Bioinformatics Dot Plots. One of the simplest and oldest methods for sequence alignment Visualization of regions of similarity –Assign.
Regulatory factors 1) Gene copy number 2) Transcriptional control 2-1) Promoters 2-2) Terminators, attenuators and anti-terminators 2-3) Induction and.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Clustered Repeats and Regulatory Sites Abdulrahman Alazemi, Shahroze Abbas, Liam Lewis, Donald Ta, Ann Vo.
DNA Technology and Genomics Chapter 20 A. P. Biology Mr. Knowles Liberty Senior High School.
Computational Biology, Part 3 Sequence Alignment Robert F. Murphy Copyright  1996, All rights reserved.
Yersinia Palindromic Sequences Introduction to Bioinformatics 301 April 30th, 2015 Jordan Davis.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Mutation And Natural Selection how genomes record a history of mutations and their effects on survival Tina Hubler, Ph.D., University of North Alabama,
Used for detection of genetic diseases, forensics, paternity, evolutionary links Based on the characteristics of mammalian DNA Eukaryotic genome 1000x.
Jonathan Kindberg BNFO /24/2013 Is there a correlation between similar cluster phages’ tandem repeats within the tape measure protein and its location.
Gene Regulation, Part 1 Lecture 15 Fall Metabolic Control in Bacteria Regulate enzymes already present –Feedback Inhibition –Fast response Control.
Search for CRISPRs in mycobacteriophages
Identification of Tandem Repeats in Geobacillus and Heamophilious Phages Ramey Elsarrag.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Consensus SDR Sequence in E. coli near Important Genes Nav Saini BNFO 301 4/29/15.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Bioinformatics Lecture to accompany BLAST/ORF finder activity
Lysis or Lysogeny? Gene Regulation by CI (and CI like) Repressor Proteins By Firras Garada.
Inference with Gene Expression and Sequence Data BMI/CS 776 Mark Craven April 2002.
Doug Raiford Phage class: introduction to sequence databases.
Analysis: Tools for directly examining sequence What follows is a simulation of the proposed sequence interface. A PC-based prototype exists, but the interface.
You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. The sequence.
Chapter 19 The Organization & Control of Eukaryotic Genomes.
Page: Genial - Restriction Enzymes and Site-Specific DNA Cleavage Procedures for chemical isolation of DNA usually lead to random breakage of double-stranded.
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
SC.912.L.16.3 DNA Replication. – During DNA replication, a double-stranded DNA molecule divides into two single strands. New nucleotides bond to each.
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
Characterization of Transition Metal-Sensing Riboswitches
22 kDa a-coixin gene cluster
[Rz/Rz1, LysB/LysC, gp u/v] proteins of Lytic Cassette
Structure of proximal and distant regulatory elements in the human genome Ivan Ovcharenko Computational Biology Branch National Center for Biotechnology.
Ahnert, S. E., & Fink, T. M. A. (2016). Form and function in gene regulatory networks: the structure of network motifs determines fundamental properties.
CRISPR + CAS = Defensive or Immune System
attP region and attB regions
Inferred dynamic model of transcriptional regulatory network.
Behaviorally dependent allele-specific expression.
X-linked oogenic transcripts are expressed late in the germline
Transcriptional response to mating in whole A. gambiae females.
CRISPRs and Tandem Repeats
Mapping Global Histone Acetylation Patterns to Gene Expression
Nonspecific Protein-DNA Binding Is Widespread in the Yeast Genome
Volume 126, Issue 6, Pages (September 2006)
ncRNAs are developmentally regulated as well as mRNAs.
Human Promoters Are Intrinsically Directional
Structure of the Genome
Volume 110, Issue 4, Pages (August 2002)
Identification of TSIX, Encoding an RNA Antisense to Human XIST, Reveals Differences from its Murine Counterpart: Implications for X Inactivation  Barbara.
Extended analysis of differential expression datasets.
Expression of selected intergenic regions in response to increasing concentrations of XIP. qRT-PCR analysis of gene expression was performed with strain.
Integrated analysis of gene expression and copy number alterations.
Derek de Rie and Imad Abuessaisa Presented by: Cassandra Derrick
Presentation transcript:

Mysterious Sequence Repeats in Phage Genomes AKHIL GARG BNFO 301 APRIL 30, 2015

Phages are useful! Therapy for antibiotic-resistant bacteria Way of delivering genes to other organisms We need to understand how phage genes are regulated in order to use them

Phages – near the edge of life ◦Can’t replicate on their own ◦Yet have genomes endotoxin-test.com

Phage genomes Short: usually less than nucleotides (~1/10 or ~1/100 size of bacteria) Jam-packed with genes Color: genes Black: intergenic regions BioBIKE

And yet… There are patterns in the intergenic regions! On the left: a previously found 17-nucleotide intergenic palindromic repeat in the genomes of cluster O mycobacteriophages Could not have arisen by chance What is its function? Cresawn et al. (2015)

Other phages Do repeats occur in other phages? Cluster (group of closely related phages) A3 has a repeat: [TC]GT[GT][GC][GT][AC]TGTCAAGT (Also occurs in cluster A4 and A10)

Phage# forward# backward Bxz211 Jhc11711 Microwolf12 Rockstar102 Vix11 Heldan92 Methuselah12 Jobu0812 Phantastic91 Anubis12 Marquardt11 Phoxy12 Spike509*12 Farber11 QuinnKiro82 Tiffany11 Cluster A3: matches of [TC]GT[GT][GC][GT][AC]TGTCAAGT Very unlikely to arise by chance: p=1/10 11 ! Repeat does not exist in bacteria

These repeats Importance? Function?

Clues to function Where in the genome? Within closely-related phages, similar genes tend to be close to each other Take start coordinate of each repeat Divide coordinate by the length of the genome Plot results in a bar graph

Genome location - method Take start coordinate of each repeat Divide coordinate by the length of the genome

Location within genome Like the cluster O repeats, these repeats are concentrated in the right end of the genome Why? ◦Are certain genes nearby? Plot results in a bar graph

Limitations of genome location % Human created – genome location % has no biological meaning ◦only a way of searching for anomalies Need to look at specific genes at the right of genomes ◦How is the genome organized near its right? ◦Are the genes structural? Lysis-related? Replicative?

Conclusions Function? Not yet known. Need to research this repeat more ◦find more context (which genes are around the repeats) ◦characterize how it occurs in other clusters Could lead to insight on phage gene regulation ◦does the sequence regulate a specific subset of genes? ◦does the sequence control the amount of transcription or translation?