Download presentation
Presentation is loading. Please wait.
Published byNelson Nelson Modified over 9 years ago
1
Sequence Analysis with Artemis & Artemis Comparison Tool (ACT) South East Asian Training Course on Bioinformatics Applied to Tropical Diseases - 2005 (Sponsored by UNDP/World Bank/WHO/TDR) International Centre For Genetic Engineering And Biotechnology, New Delhi, INDIA
2
Overview of the genome sequencing and sequence analysis. Demonstration of Artemis. Hands on guided exercise in Artemis. Demonstration of ACT. Hands on guided exercise in ACT Generating ACT comparison files Workshop Overview
3
Wellcome Trust Photo Library The Wellcome Trust Sanger Institute Funded by The Wellcome Trust, a registered charity. Established in 1993 to begin the Human genome project. First Draft (2000) complete (2003-4) Data release policy: All sequence data is released immediately and is freely available via the internet in order to maximise its benefit for research. http://www.sanger.ac.uk ftp://ftp.sanger.ac.uk/ Wellcome Trust Photo Library
4
Generating the complete genome sequence
5
Infrastructure
6
Levels of automation Colony picking robots Plasmid preps robots TOTAL:140 ABI3700 ABI3730
7
Automated sequencing Each ABI reads 96 DNA sequences at once. The machines are run 10 times a day, 7 days a week. Throughput of 1,200 to 1,300 96-well plates per day ± 120,000 DNA samples read each day. Each day, the Sanger Institute reads 60 million base pairs. That’s equal to one of the smaller human chromosomes and many times that of an average bacterial genome.
8
Pathogen Sequencing Unit http://www.sanger.ac.uk/Projects/Microbes Bacteria: M. tuberculosis M. leprae Y. pestis S. typhi C. Diphtheriae Bordetella spp. x3 B. pseudomallei S. aureus MRSA S. aureus MSSA E. carrotovora Yeasts and Fungi: Saccharomyces cerevisiae Schizosaccharomyces pombe Aspergillus fumigatus Candida dubliniensis Candida parapsilosis Protozoa: Plasmodium falciparum X3 Plasmodium spp. X5 Leishmania spp. Trypanosoma spp. Eimeria Theileria Babesia The Pathogen Group is funded by the Beowulf Genomics Initiative to sequence the genomes of a wide range of small Eukaryotes and microbes.
9
Sequencing strategy and assembly
10
Contiguous sequence DNA pUC clone end sequence physical gap sequence gap Shotgun sequencing – strategy ‘Draft sequence’ Order of contigs? 95% coverage, 4-5x depth.
11
‘A genome in a day’ ‘15 in a month’ ‘High-quality draft sequence’
12
Contiguous sequence DNA pUC clone end sequence large clone end sequence physical gap sequence gap Shotgun sequencing – strategy Finished sequence: 100% coverage, 10x depth.
13
Repeats!!!
14
Shotgun assembly - Yersinia pestis
15
Primary DNA sequence Dotter BlastN BlastX Gene finders tRNA scan RepeatsPseudo-genesrRNA Genes tRNA Manual curation
16
Primary DNA sequence Dotter BlastN BlastX Gene finders tRNA scan RepeatsPseudo-genesrRNA Genes tRNA FastaBlastPPfamPrositePsortSignalPTMHMM Manual curation Manual curation Annotated sequence
17
PSU Projects Organism Annotated genome Finished genome Database entry Artemis
18
Sequence viewer and analysis tool –Visualization of sequence features DNA Six frame translation –Perform and view analysis Basic analysis Launch more complex analysis and searches Import and view the results of other searches
19
Outline of Artemis demonstration Artemis window features Open a genome sequence Changing the view Getting around –Goto Menu –Navigator –Feature Selector Basic analysis –Edit a feature –Fasta search –Show feature plots
20
Artemis Sliders Drop Down Menus Entry Button Line Main Sequence View Panel Magnified Sequence View Panel Feature Menu Drop Down Menus Entry Button Line Main Sequence View Panel Magnified Sequence View Panel Feature Menu
21
Artemis
22
Curating gene models in Artemis Use of multiple lines of evidence
23
Curating gene models in Artemis Use of FASTA evidence
29
EST sequencing & mapping AAAAAAAAAA CAP AAAAAAAAAA CAP TTTTTTTTT intron exon 5’UTR M stop 3’UTR EST cDNA mRNA
30
ESTs Curating gene models in Artemis Use of EST evidence
32
Curation of gene models in Artemis Mapping proteome fragments to genome
33
Curation and annotation in Artemis Mapping InterPro domain hits to genome
34
Finished sequence Gene Finder PHAT Glimmer Orpheus FASTA BLAST EST Primary gene model Annotation of pathogen genomes at the PSU (using ARTEMIS) Complete Annotation Organism-specific gene familiesFunctional classification (GO / Riley) Comparative genomics (using ACT) Refined gene model InterPro scan HMMPfam HMMSMART PRINTS PROSITE ProDom TIGRFAMs Manual curation SignalP TMHMM t-RNA scan
35
Gene model annotation Gene function
36
Top tips! Manual annotation. Use a several lines of evidence: - Run several available gene finding programs - Search programs: local (BLAST) and global (FASTA) alignments -Protein domains and motifs: Interpro (Pfam, prosite, SMART etc.) -Transmembrane / signal peptide prediction (TMHMM, SignalP) - Base your annotation on characterised proteins where possible (e.g. UNIPROT entry) - Read the literature (Pubmed entry)
37
Sanger Front page
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.