Genome Sequence Annotation Server

Slides:



Advertisements
Similar presentations
Model Organism Databases and Community Annotation
Advertisements

Submitting a Genome to RAST. Uploading Your Job 1.Login to your RAST account. You will need to register if this is your first time using SEED technologies.
2 Unité de Biométrie et d’Intelligence Artificielle (UBIA) INRA
Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.
Annotating a Scarlet Runner Bean genome fragment put together by shotgun sequencing Scarlet Runner ean Max Bachour.
Genome Annotation BCB 660 October 20, From Carson Holt.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
GenSAS: Genome Sequence Annotation Server, a Tool for Online Annotation and Curation Dorrie Main, Taein Lee, Ping Zheng, Sook Jung, Stephen P. Ficklin,
Dorrie Main, Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy and Don Jones.
Jodi Humann, Stephen Ficklin, Taein Lee, Chun-Huai Cheng, Jill Wegrzyn, David Neale and Dorrie Main A web-based platform for genome annotation GenSAS Poster.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
Genome Annotation using MAKER-P at iPlant Collaboration with Mark Yandell Lab (University of Utah) iPlant: Josh Stein (CSHL) Matt Vaughn.
Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy, Don Jones, Dorrie Main.
MAKER Annotation Process Example of Glossina VectorBase Karyn Mégy Dan Hughes.
UMR ASP UMR ASP Structural & Comparative Genomics in Bread Wheat TriAnnotPipeline A LifeGrid Project based on AUVERGRID F. Giacomoni, M.
11 Overview Paracel GeneMatcher2. 22 GeneMatcher2 The GeneMatcher system comprises of hardware and software components that significantly accelerate a.
Welcome to DNA Subway Classroom-friendly Bioinformatics.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Jodi Humann, Stephen Ficklin, Taein Lee, Chun-Huai Cheng, Sook Jung, Jill Wegrzyn, David Neale and Dorrie Main An easy to use, web-based solution for specialty.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Developed by James Estill, Dept. of Plant Biology, University of Georgia.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Annotating genomes using MAKER-P and iPlant. What Are Annotations? Annotations are descriptions of features of the genome –Structural: exons, introns,
Building WormBase database(s). SAB 2008 Wellcome Trust Sanger Insitute Cold Spring Harbor Laboratory California Institute of Technology ● RNAi ● Microarray.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.
Copyright OpenHelix. No use or reproduction without express written consent1.
TrypDB Analysis Workflow Common Analysis T Cruzi Analysis T Brucei Analysis L Braziliensis Analysis L Infantum Analysis L Major Analysis Mercator.
Supplementary Figure S1. Supplementary Figure S2.
Jing Yu, Sook Jung, Chun-Huai Cheng, Taein Lee, Katheryn Buble, Ping Zheng, Jodi L. Humann, Deah McGaughey, Heidi Hough, Stephen P. Ficklin, B. Todd Campbell,
TrypDB Analysis Workflow Common Analysis T Cruzi Analysis T Brucei Analysis L Braziliensis Analysis L Infantum Analysis L Major Analysis Mercator.
The Bovine Genome Database Abstract The Bovine Genome Database (BGD, facilitates the integration of bovine genomic data. BGD is.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
Web Apollo/JBrowse • JBrowse is a web based genome browser
Annotating The data.
NGS File formats Raw data from various vendors => various formats
Resources Available for Fragaria Research through the Genome Database for Rosaceae Dorrie Main, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Taein Lee,
CyVerse Discovery Environment
Basics of BLAST Basic BLAST Search - What is BLAST?
CottonGen: An Up-to-Date Resource Enabling Genetics, Genomics and Breeding Research for Crop Improvement Plant and Animal Genome Conference XXV Jing Yu1,
The Cool Season Food Legume Database: An Integrated Resource for Basic, Translational and Applied Research Dorrie Main, Chun-Huai Cheng, Stephen Ficklin,
Genome Sequence Annotation Server
the Genome Database for Rosaceae: New Data and Functionality
CottonGen An Online Resource for the Cotton Community
Introduction to G-OnRamp
Plant and Animal Genome Conference XXIV
Genome Annotation w/ MAKER
Cuong Nguyen, Deng Xin, Dongmei, Zheng Wang
Strategies for annotation of a genome
Ensembl Genome Repository.
Updates to the CSFL Genome Database:
for the Cotton Community
Updates and Future Direction
Explore Evolution: Instrument for Analysis
CottonGen BIMS for Effective and Efficient Management of Breeding Data
Using CottonGen for Crop Improvement
Genome Database for Rosaceae:
A web-based platform for structural and functional annotation of model and non-model organisms Jodi Humann, Taein Lee, Stephen Ficklin,
Yating Liu July 2018 G-OnRamp workshop
2 Unité de Biométrie et d’Intelligence Artificielle (UBIA) INRA
Follow-up from last night: XSEDE credits
How to Effectively Search and Download Data in CottonGen
CottonGen: Enabling Cotton Research through Big-Data Analysis and Integration Jing Yu, Sook Jung, Chun-Huai Cheng, Taein Lee, Katheryn Buble, Ping Zheng,
New Data and Functionality in NRSP10 Databases
2016 Beltwide Cotton Conference
Welcome - webinar instructions
Resources for HLB and Citrus Genomics, Genetics and Breeding Research
Germplasm Overview Page Trait Descriptor Standard Images
Presentation transcript:

Genome Sequence Annotation Server Structural and functional annotation of model and non-model organisms with GenSAS v5.0, a web-based annotation platform Jodi L. Humann, Stephen P. Ficklin, Taein Lee, Chun-Huai Cheng, Heidi Hough, Sook Jung, Jill Wegrzyn, David Neale, Dorrie Main jhumann@wsu.edu

What is DNA annotation and why do it? Getting the DNA sequence is only the first step Need to know the biological relevance of the DNA sequence Annotated sequence can be used to find putative genes of interest for study

What scientists want Current annotation tools: Many tools available, but run independently of each other Most of the tools are run via the command line and require server access Scientists want a platform that: Is a single location for DNA annotation Does not require management of computing equipment and software tools Is easy to use and can be adapted to a variety of DNA sequences

What is GenSAS? A single website that combines numerous annotation tools into one interface User accounts keep data private and secure as well as allow users to share data for collaborative annotation Easy-to-use interfaces, with integrated instructions allow researchers at all skill levels to annotate DNA

GenSAS annotation process Upload Sequences Create Project Upload Evidence Identify Repeats RepeatMasker, RepeatModeler Mask Sequences Align Transcripts BLAST, BLAT, PASA TopHat Structural Annotation Augustus, GeneMarkES, Genscan, GlimmerM, SNAP Choose Official Gene Set EvidenceModeler Refine Gene Models PASA Functional Annotation BLAST, InterProScan, Pfam, SignalP, TargetP Manual Curation Apollo, JBrowse Generate Files for Publication

www.gensas.org

GenSAS welcome tab provides users with a quick overview of what each of the three screen sections do.

Sequence Tab: Single sequence or multi- sequence FASTA file Sequence subset based on sequence names or minimum size can be made Project Tab: Open existing project or shared project Create new project

GFF3 & Evidence Tabs (optional): EST, mRNA sequences Repeat motifs Protein sequences NCBI gene structures Pre-processed Illumina RNA- Seq reads All tabs have an Instructions section that can be opened and collapsed The more organism specific data you have, the better the annotation will be

Repeats Tab: Evidence based repeat finder De novo repeat finder Masking Tab: Check results in JBrowse and choose which set(s) to use to make masked consensus

Job status can be monitored through Job Queue Progress through GenSAS is automatically saved Users can log off GenSAS and jobs will continue running While jobs are running, users can look at the completed results in Apollo/JBrowse Once the project has results, users can share the project with other GenSAS users for collaborative annotation

Look at the results of jobs, before moving to next step!

Look at the results of jobs, before moving to next step!

Align Tab: Align RNA-Seq data for training the gene prediction programs Align species-specific transcripts and proteins Structural Tab: Gene prediction programs SSR Finder, tRNAScanSE, RNammer, getorf

OGS (Official Gene Set) Tab: Sets gene model for manual annotation process and final publication Refine Tab: Use PASA and RNA evidence to refine OGS gene models

Functional Tab: Gene models are functionally annotated

GenSAS exports data in GFF3 and FASTA formats Manual annotation from Apollo are automatically merged into OGS at Publish Step GenSAS exports data in GFF3 and FASTA formats

Future development Integrate Apollo 2 and newest JBrowse Add option to create single merged GFF3 of all annotation data under Publish step Improve how BLAST jobs are submitted to cluster to reduce run time

Supported by