GNPAnnot Community Annotation System applied to sugarcane BAC clone sequences Valentin GUIGNON PAG Sugarcane Genome Sequencing Initiative Sunday, 16 January.

Slides:



Advertisements
Similar presentations
Gene Structure Annotation Philippe Lamesch International Arabidopsis conference July 23, 2008, Montreal.
Advertisements

Introduction 1.Ordering of P. knowlesi contigs v P. falciparum methodology progress/status towards a synteny map – ‘true’ scaffold 2. Gene prediction generating.
Application to find Eukaryotic Open reading frames. Lab.
2 Unité de Biométrie et d’Intelligence Artificielle (UBIA) INRA
EAnnot: A genome annotation tool using experimental evidence Aniko Sabo & Li Ding Genome Sequencing Center Washington University, St. Louis.
Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
Ab initio gene prediction Genome 559, Winter 2011.
From Genes to Genomes: Concepts and Applications of DNA Technology, Jeremy W. Dale, Malcolm von Schantz and Nick Plant. © 2012 John Wiley & Sons, Ltd.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction Gene prediction methods Gene indices Mapping cDNA on.
BME 130 – Genomes Lecture 7 Genome Annotation I – Gene finding & function predictions.
Eukaryotic Gene Finding
Lecture 12 Splicing and gene prediction in eukaryotes
Eukaryotic Gene Finding
Genome Annotation BCB 660 October 20, From Carson Holt.
Gene Finding Genome Annotation. Gene finding is a cornerstone of genomic analysis Genome content and organization Differential expression analysis Epigenomics.
Gene Structure and Identification
Genomics Chapter 18.
Genome Annotation using MAKER-P at iPlant Collaboration with Mark Yandell Lab (University of Utah) iPlant: Josh Stein (CSHL) Matt Vaughn.
Bikash Shakya Emma Lang Jorge Diaz.  BLASTx entire sequence against 9 plant genomes. RepeatMasker  55.47% repetitive sequences  82.5% retroelements.
Rice Sequence and Map Analysis Leonid Teytelman. Rice Genome Annotation Sequence Alignments Automation Comparative Maps Genetic Marker Correspondences.
Arabidopsis Genome Annotation TAIR7 Release. Arabidopsis Genome Annotation  Overview of releases  Current release (TAIR7)  Where to find TAIR7 release.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
 GEP Digital Laboratory Notebook Nick Reeves, Mt. San Jacinto Community College.
GeneWise and Artemis Exercises Spliced Alignment using GeneWise Click on the GeneWise hyperlink on the course links page,
Functionality of pack-mule sequences in Rice genome Kousuke Hanada 9/21/’06.
Common Errors in Student Annotation Submissions contributions from Paul Lee, David Xiong, Thomas Quisenberry Annotating multiple genes at the same locus.
UMR ASP UMR ASP Structural & Comparative Genomics in Bread Wheat TriAnnotPipeline A LifeGrid Project based on AUVERGRID F. Giacomoni, M.
Welcome to DNA Subway Classroom-friendly Bioinformatics.
Analysis of the RNAseq Genome Annotation Assessment Project by Subhajyoti De.
Recombinant DNA Technology and Genomics A.Overview: B.Creating a DNA Library C.Recover the clone of interest D.Analyzing/characterizing the DNA - create.
Advancing Science with DNA Sequence Finding the genes in microbial genomes Natalia Ivanova MGM Workshop January 31, 2012.
Advancing Science with DNA Sequence Finding the genes in microbial genomes Natalia Ivanova MGM Workshop May 15, 2012.
Pattern Matching Rhys Price Jones Anne R. Haake. What is pattern matching? Pattern matching is the procedure of scanning a nucleic acid or protein sequence.
Web Databases for Drosophila An introduction to web tools, databases and NCBI BLAST Wilson Leung08/2015.
Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.
.1Sources of DNA and Sequencing Methods.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 2 Genome Assembly.
Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.
JIGSAW: a better way to combine predictions J.E. Allen, W.H. Majoros, M. Pertea, and S.L. Salzberg. JIGSAW, GeneZilla, and GlimmerHMM: puzzling out the.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Primer on Annotation of Drosophila Genes GEP Workshop – January 2016 Wilson Leung and Chris Shaffer.
Advisory Board Meeting, Caltech 2004 Genome Sequence Updates. Paul Davis The Sanger Institute.
Gene Structure and Identification III BIO520 BioinformaticsJim Lund Previous reading: 1.3, , 10.4,
Peptide-assisted annotation of the Mlp genome Philippe Tanguay Nicolas Feau David Joly Richard Hamelin.
(H)MMs in gene prediction and similarity searches.
Finding genes in the genome
Annotation of eukaryotic genomes
What is BLAST? Basic BLAST search What is BLAST?
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
Using DNA Subway in the Classroom Genome Annotation: Red Line.
Basics of Genome Annotation Daniel Standage Biology Department Indiana University.
1 Gene Finding. 2 “The Central Dogma” TranscriptionTranslation RNA Protein.
Web Databases for Drosophila
What is BLAST? Basic BLAST search What is BLAST?
bacteria and eukaryotes
Basics of BLAST Basic BLAST Search - What is BLAST?
GEP Annotation Workflow
Bioinformatics and BLAST
Genome Editing with Apollo
Genome organization and Bioinformatics
Strategies for annotation of a genome
Identify D. melanogaster ortholog
The Release 5.1 Annotation of Drosophila melanogaster Heterochromatin
2 Unité de Biométrie et d’Intelligence Artificielle (UBIA) INRA
4. HMMs for gene finding HMM Ability to model grammar
.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 3 Gene Prediction and Annotation 4 Genome Structure 5 Genome.
Survey of Misannotations and
Determine CDS Coordinates
Common Errors in Student Annotation Submissions contributions from Paul Lee, David Xiong, Thomas Quisenberry Annotating multiple genes at the same locus.
Presentation transcript:

GNPAnnot Community Annotation System applied to sugarcane BAC clone sequences Valentin GUIGNON PAG Sugarcane Genome Sequencing Initiative Sunday, 16 January 2011

Valentin GUIGNON2 What is GNPAnnot 9 partners BIVI Spo GDEC >10 studied species

Sunday, 16 January 2011Valentin GUIGNON3 What is GNPAnnot 3 bioinformatics platform

Sunday, 16 January 2011Valentin GUIGNON4 Goals  Automatic annotation pipeline for genes and repeats  Complete manual annotation framework with Data confidentiality Inspection of manual annotation Annotation history  Comparative genomics  Data query and report system

Sunday, 16 January 2011Valentin GUIGNON5 GNPAnnot Concept

Sunday, 16 January 2011Valentin GUIGNON6 In House Annotation Pipeline Automatic genes structural & functional prediction Eugene FGenesHBlastxGenome ThreaderSpliceMachineEugene HMM DNA sequence (BAC) STRUCTURAL FUNCTIONAL blastp, tblastn, Interproscan, BBMH, Greenphyl

Sunday, 16 January 2011Valentin GUIGNON7 Repeats Automatic Annotation  Dawg Paws  Repet

Sunday, 16 January 2011Valentin GUIGNON8 About our Annotation Pipelines  Species-specific parameters  Sugarcane trained on rice  Already in use for full-genoms  We can process your sequences

Sunday, 16 January 2011Valentin GUIGNON9 Portal:

Sunday, 16 January 2011Valentin GUIGNON10 Portal:

Sunday, 16 January 2011Valentin GUIGNON11 Portal:

Sunday, 16 January 2011Valentin GUIGNON12 Portal:

Sunday, 16 January 2011Valentin GUIGNON13 Portal:

Sunday, 16 January 2011Valentin GUIGNON14 Portal:

Sunday, 16 January 2011Valentin GUIGNON15 Portal:

Sunday, 16 January 2011Valentin GUIGNON16 GBrowse

Sunday, 16 January 2011Valentin GUIGNON17 GBrowse

Sunday, 16 January 2011Valentin GUIGNON18 GBrowse

Sunday, 16 January 2011Valentin GUIGNON19 Artemis

Sunday, 16 January 2011Valentin GUIGNON20 Artemis

Sunday, 16 January 2011Valentin GUIGNON21 Artemis

Sunday, 16 January 2011Valentin GUIGNON22 Artemis Validations: # Start/Stop codon validation: -Sh253G12_g190: Start Codon: OK Stop Codon: OK # Sequence validation: -Sh253G12_g190: Length: ERROR: coding sequence length ( 883 bp) is not a multiple of 3! # Introns validation: -Sh253G12_g190 Intron AG Site: ERROR: unrecognized acceptor site (*CA*GAAG at position from contig sequence begining) between exons 7 and 8! # Mandatory properties management: -Sh253G12_g190: Mandatory properties management: ERROR: missing /functional_completeness qualifier! Mandatory Properties Management: ERROR: missing /inference qualifier! # Gene structure validation: -Sh253G12_g190 (non-obsolete mRNA): OK # Evidence code coherence management: -Sh253G12_g190: Evidence Code Management: WARNING: /evidence_code value should be set for gene Sh253G12_g190! Your changes will be committed to the database and the errors notified above will be reported as qualifiers (when available).

Sunday, 16 January 2011Valentin GUIGNON23 Artemis

Sunday, 16 January 2011Valentin GUIGNON24 Artemis

Sunday, 16 January 2011Valentin GUIGNON25 Annotation History

Sunday, 16 January 2011Valentin GUIGNON26 Data Confidentiality GBrowse Access Restriction

Sunday, 16 January 2011Valentin GUIGNON27 Data Confidentiality Access Restriction Administration

Sunday, 16 January 2011Valentin GUIGNON28 Sugarcane BAC Analysis Results  Some statistics… 17 scaffolds representing bp 196 predicted genes Currently 284 genes with an average length of 2420 bp (36% of scaffolds) 8 predicted TE (transposable elements) Currently 132 TE with an average length of 3943 bp (28% of scaffolds)

Sunday, 16 January 2011Valentin GUIGNON29 Other Sequence Analysis Results  Synteny Banana BAC / Rice

Sunday, 16 January 2011Valentin GUIGNON30 Other Sequence Analysis Results Quick Search: « Hibernate Search » based Advanced Search

Sunday, 16 January 2011Valentin GUIGNON31 Other Sequence Analysis Results  Genome Report System

Sunday, 16 January 2011Valentin GUIGNON32 Other Sequence Analysis Results  Methabolic Pathway

Sunday, 16 January 2011Valentin GUIGNON33 Sum up  Many annotation tools  High quality manual annotations  SouthGreen platform can help you See also… Presentations: W315, W107, W069, W152, W511, W327 and W585 Posters: P050, P800, P805 and P820

Thanks for your attention!