Day Two. DAY TWO 9:00 – 9:10Recap of day one 9:10 – 9:55TOPAAS demo (Sander) 9:55 – 10:15Coffee break 10:30 – 11:30New Technology Data 11:30 – 12:30High.

Slides:



Advertisements
Similar presentations
Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
Advertisements

Lecture 14 Genome sequencing projects
9 Genomics and Beyond Brief Chapter Outline
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
CSE182-L12 Gene Finding.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Workshop in Bioinformatics 2010 Class # Class 8 March 2010.
Cross-curricular Assignment Using your case study…
Expanding the Tool Kit for BAC Extension Summary of completion criteria developed for NSF Tomato Sequencing Workshop January 14, 2007.
International Tomato Finishing Workshop Wellcome Trust Sanger Institute April 2007 Wellcome Trust Medical Photographic Library.
Novel multi-platform next generation assembly methods for mammalian genomes The Baylor College of Medicine, Australian Government and University of Connecticut.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
CS273a Lecture 4, Autumn 08, Batzoglou Hierarchical Sequencing.
Genome sequencing and assembling
INDIAN INITIATIVE FOR TOMATO GENOME SEQUENCING Tomato Finishing Workshop T. R. Sharma National Research Centre on Plant Biotechnology Indian Agricultural.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Genome Analysis Determine locus & sequence of all the organism’s genes More than 100 genomes have been analysed including humans in the Human Genome Project.
Genome Sequencing. Bacteriophage fX174, the first genome to be sequenced, is a viral genome with only 5,368 base pairs (bp). Fred Sanger invented "shotgun"
Genome Sequencing and Assembly High throughput Sequencing Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520.
Federated Searching Pre-Conference Workshop - The federated searching cookbook Qin Zhu HP Labs Research Library February 18, 2007.
BioInformatics (2). Physical Mapping - I Low resolution  Megabase-scale High resolution  Kilobase-scale or better Methods for low resolution mapping.
Mouse Genome Sequencing
Large-scale genome projects
Chromosome 8 Sequencing: Current Status and Future Prospects toward Finishing Shusei Sato, Erika Asamizu, Takakazu Kaneko, Hiroyuki Fukuoka, Satoshi Tabata.
Solanum lycopersicum Chromosome 4 Sequencing Update SOL Germany– October 2008 Wellcome Trust Medical Photographic Library.
CUGI Pilot Sequencing/Assembly Projects Christopher Saski.
Tomato Chromosome 4: A Mapping & Sequencing Update 28 th September 2005 Christine Nicholson Mapping Core Group Welcome Trust Sanger Institute, UK.
SOL 2008 October 12-16, Cologne, Germany CHROMOSOME 7 THE FRENCH CONTRIBUTION TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966.
Implementing ERMs: Opportunities and Challenges Jeff Campbell, Systems Librarian, UNC Chapel Hill Rebecca Kemp, Serials Supervisor, UNC Wilmington 2007.
Sequence assembly using paired- end short tags Pramila Ariyaratne Genome Institute of Singapore SOC-FOS-SICS Joint Workshop on Computational Analysis of.
Bioinformatics Overview, NCBI & GenBank JanPlan 2012.
Genome Sequencing in the Legumes Le et al Phylogeny Major sequencing efforts Minor sequencing efforts ~14 MY ~45 MY.
Steps in a genome sequencing project Funding and sequencing strategy source of funding identified / community drive development of sequencing strategy.
Biological Motivation for Fragment Assembly Rhys Price Jones Anne R. Haake.
The Changing Face of Sequencing
Solanum lycopersicum Chromosome 4 Sequencing Update UK-SOL– Dec 2008 Wellcome Trust Medical Photographic Library.
FINISHING WORKSHOP APRIL 2008 CHROMOSOME 7 THE FRENCH CONTRIBUTION TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15.
Theobroma cacao Integrated Physical and Genetic Map 2 BAC Libraries 250 Genetic Markers.
Finishing tomato chromosomes #6 and #12 using a Next Generation whole genome shotgun approach Roeland van Ham, CBSG, NL René Klein Lankhorst, EUSOL Giovanni.
Chromosome 2 Doil Choi, Sunghwan Jo KOREA. Cytological architecture of chromosome kb/µm DAPI (4’-6-diamidino-2-phenylindole) stained pachytene chromosome.
中国农业科学院蔬菜花卉研究所 Institute of Vegetables and Flowers Chinese Academy of Agricultural Sciences Zhonghua Zhang Institute of Vegetables and Flowers, Chinese.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, C. Perla 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino.
Wageningen, April 24-25, 2008 II Tomato Finishing Workshop Chromosome 12 Update ENEA, Rome University of Naples ‘Federico II’ CRIBI and Univ. of Padua.
HeterochromatinEuchromatin Relative chromosome length Relative bivalent diameter X 1.23 X 1.00 Relative area Relative optical density.
Applied Bioinformatics Week 5. Topics Cleaning of Nucleotide Sequences Assembly of Nucleotide Reads.
1.Data production 2.General outline of assembly strategy.
Human Genome.
2nd TOMATO FINISHING WORKSHOP chromosome 9 Wageningen, April 24-25, 2008.
System Test Planning SYSTTPLAN 1 Location of Test Planning Responsibilities for Test Planning Results of Test Planning Structure of a Test Plan Test Definitions.
Mojavensis: Issues of Polymorphisms Chris Shaffer GEP 2009 Washington University.
13 th January 2008 Plant & Animal Genome Conference Progress with Sequencing Tomato Chromosome 4 Clare Riddle Tomato Project Group Wellcome Trust Sanger.
Genome representation and variant identification Deanna M. Church, NCBI.
16 th April 2007 Christine Nicholson, Mapping Core Group Wellcome Trust Sanger Institute Tomato Chromosome 4 Mapping & Use of FPC Copyright Wellcome Trust.
Sequencing Chromosome 12. runs db (blast) SOL dbrelational db Choice of suitable seed BACs Running 96 samples For each BAC check db update db update dbcheck.
26 th July 2006 Christine Nicholson, Mapping Core Group Karen McLaren, Finishing Group Leader Wellcome Trust Sanger Institute Sequencing the Gene Space.
Chapter 5 Sequence Assembly: Assembling the Human Genome.
Web-CAT: Automatic grading using student-written tests Web-CAT: Grade it your way Decide when and how students can submit, including.
JRA1 Meeting – 09/02/ Software Configuration Management and Integration EGEE is proposed as a project funded by the European Union under contract.
MICROBIOLOGIA GENERALE Prokaryotic genomes. The prokaryotic genome.
MICROBIOLOGIA GENERALE Prokaryotic genomes. The Escherichia coli nucleoid.
Management Information System (MIS) MIS is short for management information system or management information services. Management information system,
Virginia Commonwealth University
Presented By: Chinua Umoja
COMPUTATIONAL GENOMICS GENOME ASSEMBLY
Databases BI420 – Introduction to Bioinformatics Gabor T. Marth
Identification and Characterization of pre-miRNA Candidates in the C
Introduction to Sequencing
Databases BI420 – Introduction to Bioinformatics Gabor T. Marth
SOL Jeju Island, Korea 14th September 2007
Presentation transcript:

Day Two

DAY TWO 9:00 – 9:10Recap of day one 9:10 – 9:55TOPAAS demo (Sander) 9:55 – 10:15Coffee break 10:30 – 11:30New Technology Data 11:30 – 12:30High Repeat Content Clones 12:30 – 13:30Lunch 13:30 – 14:30Alternative chemistry SIL and TIL library utilisation 14:30 – 15:00Available Tools and Databases on SGN (Lukas) END

Issues Arising  Criteria for `unmapped` BAC procedure added to standards document – include IL mapping centrally  Trace file submission and naming convention – needs further discussion  SBM Blast server available for data mining the Selected BAC Mixture Shotgun - add consensus values to the contigs to increase utility  Some BACs remain at Phase 2 due to complex repeats  Cases of potential deletion/ duplication in Korea Chrm 2 set  Overlap criteria and checks added to the standards document

Use of restriction digest data  Not all groups currently make use of restriction digest data  Lukas to investigate central Restriction Digest Resource  Mail to PIs for agreement for making a requirement or not Use of misc_feature tags  Not all groups currently use this system  Reduce list down to essential misc_feature tags  Mail to PIs for agreement

TOPAAS – Sander Peters

New Technology Data Types 454 runs have been done not submitted Both Illumina and SOLiD planned How do we submit this data to HTGS? Do we need to make the data type distinction clear? Small read storage will be an issue.

International Finishing Standards Discussion Attempting to capture information about the sequence type etc There are no agreed quality scores for the New Tech Data

Categories and Current Assigned Codes 1. Technology – order is ~chronological in terms of WTSI usage A – Slab Gels (AB 373, AB 377) B - Megabase C – Capillary (AB 3100, AB 3700, AB 3730) D – 454 E – Solexa F – ABI Solid 2. Data Coverage This will be represented as a numeric value in accordance with how much coverage there is over the target sequence. 3. Assembler – order assigned is arbitrary A – Newbler (454) B – Arachne C – Phrap D – Phusion-phrap E – Cemos F – Mosaic (combined assembly) G – Sakke (solexa) etc 4. Post-shotgun Intervention – currently divided into broad tasks – the finer definition will see more division between Eukaryotes and Prokaryotes. A – No post-shotgun intervention at all B – Targeted finishing (contig or specific loci) C – Contiguate D – Not contiguous but contigs ordered and orientated E – Automated Sequence Improvement F – Manual Intervention with no chemistry G – Finished for prokaryotes – Sequence Improved for eukaryotes H – Finished clone or chromosome

Life after the workshop……. Brief Workshop report and minutes to be written up Lukas to re-circulate standards documents to group Add section to the next newsletter Await decision on digests and misc_feature tags Have a safe journey home and thank you all for your effort and input

Consortium have been set up since 2005 Promoting a schema for minimum information about a genome sequence and metagenome samples/sequences MIGs and MIMs May prove useful for comparative sequencing – very applicable to pathogen genomes