The Artemis Comparison Tool

Slides:



Advertisements
Similar presentations
Blast outputoutput. How to measure the similarity between two sequences Q: which one is a better match to the query ? Query: M A T W L Seq_A: M A T P.
Advertisements

          Sequence Analysis with Artemis and.
-1- Module 3: Comparative Genomics Module 3 Comparative Genomics Introduction The Artemis Comparison Tool (ACT), also written by Kim Rutherford, was designed.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Types of homology BLAST
PSI (position-specific iterated) BLAST The NCBI page described PSI blast as follows: “Position-Specific Iterated BLAST (PSI-BLAST) provides an automated,
Protein Modules An Introduction to Bioinformatics.
Genomic Rearrangements CS 374 – Algorithms in Biology Fall 2006 Nandhini N S.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Working with Pathogen Genomes
BLAST: Basic Local Alignment Search Tool Urmila Kulkarni-Kale Bioinformatics Centre University of Pune.
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Making Sense of DNA and protein sequence analysis tools (course #2) Dave Baumler Genome Center of Wisconsin,
Figure 1. P. Knowlesi top, six frame translation showing snap generated gene models (blue), contigs depicted alternate brown and orange. P falciparum (bottom)
Pairwise Alignment How do we tell whether two sequences are similar? BIO520 BioinformaticsJim Lund Assigned reading: Ch , Ch 5.1, get what you can.
Wellcome Trust Workshop Working with Pathogen Genomes Module 3 Sequence and Protein Analysis (Using web-based tools)
An Introduction to Bioinformatics
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution Comparing whole genomes enhances – Our ability to understand.
NCBI Review Concepts Chuong Huynh. NCBI Pairwise Sequence Alignments Purpose: identification of sequences with significant similarity to (a)
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
Module 3 Sequence and Protein Analysis (Using web-based tools) Working with Pathogen Genomes - Uruguay 2008.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
BLAST Basic Local Alignment Search Tool (Altschul et al. 1990)
NCBI resources II: web-based tools and ftp resources Yanbin Yin Fall 2014 Most materials are downloaded from ftp://ftp.ncbi.nih.gov/pub/education/ 1.
Chapter 21 Eukaryotic Genome Sequences
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution; Comparing whole genomes enhances – Our ability to understand.
You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. What do you.
Assignment feedback Everyone is doing very well!
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui.
Sequence Analysis with Artemis and Artemis Comparison Tool (ACT) Carribean Bioinformatics Workshop 18 th -29 th January, 2010.
Bioinformatic Tools for Comparative Genomics of Vectors Comparative Genomics.
Identification of Ortholog Groups by OrthoMCL Protein sequences from organisms of interest All-against-all BLASTP Between Species: Reciprocal best similarity.
Automated sequencing machines,
Comparative genomics Haixu Tang School of Informatics.
A Genomics View of Unix. General Unix Tips To use the command line start X11 and type commands into the “xterm” window A few things about unix commands:
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Database search. Overview : 1. FastA : is suitable for protein sequence searching 2. BLAST : is suitable for DNA, RNA, protein sequence searching.
Tweaking BLAST Although you normally see BLAST as a web page with boxes to place data in and tick boxes, etc., it is actually a command line program that.
David Wishart February 18th, 2004 Lecture 3 BLAST (c) 2004 CGDN.
Sequence Search Abhishek Niroula Department of Experimental Medical Science Lund University
MGM workshop. 19 Oct 2010 Some frequently-used Bioinformatics Tools Konstantinos Mavrommatis Prokaryotic Superprogram.
Annotation of eukaryotic genomes
What is BLAST? Basic BLAST search What is BLAST?
Practice -- BLAST search in your own computer 1.Download data file from the course web page, or Ensemble. Save in the blast\dbs folder. 2.Start a CMD window,
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
What is BLAST? Basic BLAST search What is BLAST?
Genetics and Evolutionary Biology
Basics of BLAST Basic BLAST Search - What is BLAST?
Basics of Comparative Genomics
Comparative Genomics.
How to use a bioinformatics website!
Genomes and their evolution
Genome Center of Wisconsin, UW-Madison
Bioinformatics and BLAST
By Chunfang Zheng and David Sankoff, 2014
BLAST.
Bio Do Now Get out natural selection lab
Identify D. melanogaster ortholog
Sequence alignment, Part 2
Comparative Genomics.
Basic Local Alignment Search Tool
Basic Local Alignment Search Tool (BLAST)
Pairwise Sequence Alignment
Bioinformatics Lecture 2 By: Dr. Mehdi Mansouri
Basics of Comparative Genomics
Basic Local Alignment Search Tool
Presentation transcript:

The Artemis Comparison Tool ACT

Comparative Genomics It might be expected that if two organisms diverged from a common ancestor, they should share: - similar gene content - similar gene organization However, genome evolution is a highly dynamic process: - gene acquisition and gene loss will change genome content - gene rearrangements will change the order of genes. Comparative genomics tell us about common and unique features between different organisms at the genome level. Identification of similarities and differences between genomes may allow us to understand: - How two organisms evolved - Why certain organisms cause diseases while others do not

Region sharing similarity Representing sequence similarity/identity MQALL… …RHP Region sharing similarity (BlastP) MAVV… …RHP

ACT- Artemis Comparison Tool

Orthologue of E. coli b0170 Orthologue of E. coli b0169 Orthologue of E. coli b0171

Genome 1 Genome 2 Region of Synteny

Genome 1 Genome 2 Rearrangements

Genome 1 Genome 2 Insertion or deletion

Whole genomes Chromosomal rearrangements Pathogenicity islands/prophages Gene acquisition and loss Gene rearrangements Pseudogenes Phase variation Single nucleotides

Prokaryotic examples

Whole genomes: S. typhi vs. E. coli SPI-2 SPI-9 SPI-1 SPI-10 SPI-7 Vi S. typhi DNA matches E. coli

Multiple genomes B. pertussis IS elements B. bronchiseptica prophage B. parapertussis IS elements

Whole genomes: S. aureus Mu50 vs. N315

Multiple genomes Mu50 N315 MRSA252 MW2 MSSA476

Pathogenicity islands/prophages tRNA-Phe type IV pilus sopE Vi antigen S. typhi E. coli

Gene acquisition and loss Insect toxin Y. pestis Y. enterocolitica

Small rearrangements

Pseudogenes secreted effector protein (type-III) S. typhi S. typhimurium

ACT Eukaryotic examples

Synteny:T.annulata and T. parva (Chr2) BlastN & TBlastX T. annulata T. parva

Synteny: TA & TP ACT comparison

T. annulata & T. parva comparison (Chr 3) Synteny Genus-specific expansion Species-specific expansion Species-specific genes

P. falciparum vs P. knowlesi: break in synteny Chrl3 P. knowlesi

Improving gene prediction with ACT P. falciparum chr13 P. knowlesi Comparison between orthologous genes in P. falciparum and P. knowlesi

ACT comparison: 3 malaria species P. yoelii P. falciparum P. knowlesi

“species-specific” genes at interruptuions in synteny Plasmodium falciparum Plasmodium knowlesi Plasmodium yoelii

ACT comparison of qut gene cluster

Generating ACT comparison files ACT supports three different comparison file formats: 1) BLAST version 2.2.2 output: The blastall command must be run with the -m 8 flag which generates one line of information per HSP. 2) MEGABLAST output: ACT can also read the output of MEGABLAST, which is part of the NCBI blast distribution. 3) MSPcrunch output: MSPcrunch is program for UNIX and GNU/Linux systems which can post-process BLAST version 1 output into an easier to read format. ACT can only read MSPcrunch output with the -d flag. Here is an example of an ACT readable comparison file generated by MSPcrunch -d. 1399 97.00 940 2539 sequence1.dna 1 1596 AF140550.seq 1033 93.00 9041 10501 sequence1.dna 9420 10880 AF140550.seq 828 95.00 6823 7890 sequence1.dna 7211 8276 AF140550.seq 773 94.00 2837 3841 sequence1.dna 2338 3342 AF140550.seq The columns have the following meanings (in order): score, percent identity, match start in the query sequence, match end in the query sequence, query sequence name, subject sequence start, subject sequence end, subject sequence name. The columns should be separated by single spaces.