Download presentation
Presentation is loading. Please wait.
Published byRobyn Blair Modified over 9 years ago
1
New data and tools at TAIR (The Arabidopsis Information Resource)
2
Overview of TAIR Genome release Published papers Gene function Journal collaborations Direct submission RNA-seqProteomic Corrections Other data: Markers Ecotypes Gene symbols New genomes New tools Researchers Directly (TAIR pages) AND via other databases
3
TAIR10 Genome Release Genome release RNA-seqProteomic Corrections No assembly updates Will incorporate: –200M Ecker and Mockler RNA-seq reads –Additional proteomics data –Individual gene structure corrections sent to us
4
Mapping and Assembly 1.Mapping RNA-seq sequences (Tophat (C. Trapnell), Supersplat (T.C. Mockler)) Peptides (6-frame translation, spliced exon graph) 2.Assembly approaches Augustus (M. Stanke) o Uses spliced RNA seq reads, peptides o Aim: Identify additional splice-variants, update existing genes TAU (T.C. Mockler) o Uses spliced RNA seq reads o Aim: Identify additional splice-variants Cufflinks (C. Trapnell) o Uses spliced and unspliced RNA seq data o Aim: Identify novel genes
5
Preliminary Results Augustus/TAU/Cufflinks predicted models are classified into categories: Novel genes 21 Updated genes 812 Splice-variants2134 B-list1586 Rejects2318
6
TAIR10 Genome Release Genome release RNA-seqProteomic Corrections No assembly updates Will incorporate: –200M Ecker and Mockler RNA-seq reads –Additional proteomics data –Individual gene structure corrections sent to us Release expected in August 2010
7
Experimentally Verified Gene Function From research articles read by TAIR curators From TAIR’s collaboration with journals From direct submissions by researchers to TAIR Published papers Gene function Journal collaborations Direct submission Where does it come from???
8
How? –Papers are prioritized according to novelty of gene function results –Highest priority papers are read and gene function is extracted Why? –A lot of high quality experimental gene function information is only available in the form of articles How many? –About 1/3 of all new articles containing gene function data are curated at TAIR each year Published papers Gene function Literature Curation
9
How? –Author instructions, Excel sheet or online form Why? –To capture a larger fraction of gene function data –Because publication is the right time to get the data into TAIR What journals? Gene function Journal collaborations Journal Collaboration
11
How? –Author instructions, Excel sheet or online form Why? –To capture a larger fraction of gene function data –Because publication is the right time to get the data into TAIR What journals? Gene function Journal collaborations 2010: Journal of Integrative Plant Biology Journal of Experimental Botany Plant Science Environmental Botany Plant Physiology and Biochemistry Plant, Cell and Environment Plant Physiology (2008) The Plant Journal (2009) Journal Collaboration
12
Direct Submission of Gene Function How? –Excel sheet or online form Why? –To capture more data with a small curation team –Because researchers are the experts on the genes they study Gene function Direct submission
13
New online submission form 17986450
15
Why Gene Ontology? Standardization allows comparison across experiments and species Hierarchical structure allows high level categorization Well structured ontology framework facilitates computational analysis Attached to data source (peer reviewed published research) Experimental evidence can be distinguished from predictions
16
Example Gene Ontology annotations GeneGO termEvidenceReference Phot1PhototropismMutant phenotypeHuala et al 1997 Phot1CytoplasmDirect assaySakamoto et al 2002 Phot1Serine / threonine kinase activity Direct assayChristie et al 1998 Biological process Cellular component Molecular function 3 GO flavors
18
New online submission form Autocomplete (just start typing to get a list of matching terms)
19
New online submission form
21
What is the result of TAIR’s effort to capture gene function? How many genes have experimental gene function in TAIR? Published papers Gene function Journal collaborations Direct submission
22
Number of genes 9342 genes (May 31 2010) Genes in TAIR with experimental evidence for biological process, molecular function or cellular component
23
Arabidopsis Gene Function in TAIR Year Genes Protein coding genes Predicted function Experimental function
25
Overview of TAIR Genome release Published papers Gene function Journal collaborations Direct submission RNA-seqProteomic Corrections Other data: Markers Ecotypes Gene symbols New genomes New tools Researchers Directly (TAIR pages) AND via other databases
26
GBrowse_syn Tool by Sheldon McKay, CSHL Alignment data from Pedro Pattyn, Van de Peer lab, U. of Ghent
27
GBrowse_syn A. lyrata A. thaliana poplar
28
NBrowse Tool by H.-L. Kao, F. Piano, M. Schuman, M. Gibson, Kris Gunsalus, NYU Interaction datasets curated by TAIR, BioGRID and IntAct
29
NBrowse Tool by H.-L. Kao, F. Piano, M. Schuman, M. Gibson, Kris Gunsalus, NYU Interaction datasets curated by TAIR, BioGRID and IntAct
30
NBrowse Tool by H.-L. Kao, F. Piano, M. Schuman, M. Gibson, Kris Gunsalus, NYU Interaction datasets curated by TAIR, BioGRID and IntAct
33
Genes have been loaded Working on adding some gene function information and improving searching Arabidopsis lyrata
34
Overview of TAIR Genome release Published papers Gene function Journal collaborations Direct submission RNA-seqProteomic Corrections Other data: Markers Ecotypes Gene symbols New genomes New tools Researchers Directly (TAIR pages) AND via other databases
35
Central registry for Gene Symbols
39
Helpdesk
42
RSS news feed
44
TAIR Facebook Page
45
TAIR Twitter Feed
46
Tanya Berardini Donghui Li Gene Function/GO: Bob Muller Larry Ploetz Chris Wilks (50%) ? David Swarbreck Philippe Lamesch Rajkumar Sasidharan Genome Annotation: TAIR Staff Tech Team: Cynthia Lee Shanker Singh
47
TAIR Sponsors: Funding Agencies: Host Institution: Partner:
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.