Presentation is loading. Please wait.

Presentation is loading. Please wait.

New data and tools at TAIR (The Arabidopsis Information Resource)

Similar presentations


Presentation on theme: "New data and tools at TAIR (The Arabidopsis Information Resource)"— Presentation transcript:

1 New data and tools at TAIR (The Arabidopsis Information Resource)

2 Overview of TAIR Genome release Published papers Gene function Journal collaborations Direct submission RNA-seqProteomic Corrections Other data: Markers Ecotypes Gene symbols New genomes New tools Researchers Directly (TAIR pages) AND via other databases

3 TAIR10 Genome Release Genome release RNA-seqProteomic Corrections No assembly updates Will incorporate: –200M Ecker and Mockler RNA-seq reads –Additional proteomics data –Individual gene structure corrections sent to us

4 Mapping and Assembly 1.Mapping RNA-seq sequences (Tophat (C. Trapnell), Supersplat (T.C. Mockler)) Peptides (6-frame translation, spliced exon graph) 2.Assembly approaches Augustus (M. Stanke) o Uses spliced RNA seq reads, peptides o Aim: Identify additional splice-variants, update existing genes TAU (T.C. Mockler) o Uses spliced RNA seq reads o Aim: Identify additional splice-variants Cufflinks (C. Trapnell) o Uses spliced and unspliced RNA seq data o Aim: Identify novel genes

5 Preliminary Results Augustus/TAU/Cufflinks predicted models are classified into categories: Novel genes 21 Updated genes 812 Splice-variants2134 B-list1586 Rejects2318

6 TAIR10 Genome Release Genome release RNA-seqProteomic Corrections No assembly updates Will incorporate: –200M Ecker and Mockler RNA-seq reads –Additional proteomics data –Individual gene structure corrections sent to us Release expected in August 2010

7 Experimentally Verified Gene Function From research articles read by TAIR curators From TAIR’s collaboration with journals From direct submissions by researchers to TAIR Published papers Gene function Journal collaborations Direct submission Where does it come from???

8 How? –Papers are prioritized according to novelty of gene function results –Highest priority papers are read and gene function is extracted Why? –A lot of high quality experimental gene function information is only available in the form of articles How many? –About 1/3 of all new articles containing gene function data are curated at TAIR each year Published papers Gene function Literature Curation

9 How? –Author instructions, Excel sheet or online form Why? –To capture a larger fraction of gene function data –Because publication is the right time to get the data into TAIR What journals? Gene function Journal collaborations Journal Collaboration

10

11 How? –Author instructions, Excel sheet or online form Why? –To capture a larger fraction of gene function data –Because publication is the right time to get the data into TAIR What journals? Gene function Journal collaborations 2010: Journal of Integrative Plant Biology Journal of Experimental Botany Plant Science Environmental Botany Plant Physiology and Biochemistry Plant, Cell and Environment Plant Physiology (2008) The Plant Journal (2009) Journal Collaboration

12 Direct Submission of Gene Function How? –Excel sheet or online form Why? –To capture more data with a small curation team –Because researchers are the experts on the genes they study Gene function Direct submission

13 New online submission form 17986450

14

15 Why Gene Ontology? Standardization allows comparison across experiments and species Hierarchical structure allows high level categorization Well structured ontology framework facilitates computational analysis Attached to data source (peer reviewed published research) Experimental evidence can be distinguished from predictions

16 Example Gene Ontology annotations GeneGO termEvidenceReference Phot1PhototropismMutant phenotypeHuala et al 1997 Phot1CytoplasmDirect assaySakamoto et al 2002 Phot1Serine / threonine kinase activity Direct assayChristie et al 1998 Biological process Cellular component Molecular function 3 GO flavors

17

18 New online submission form Autocomplete (just start typing to get a list of matching terms)

19 New online submission form

20

21 What is the result of TAIR’s effort to capture gene function? How many genes have experimental gene function in TAIR? Published papers Gene function Journal collaborations Direct submission

22 Number of genes 9342 genes (May 31 2010) Genes in TAIR with experimental evidence for biological process, molecular function or cellular component

23 Arabidopsis Gene Function in TAIR Year Genes Protein coding genes Predicted function Experimental function

24

25 Overview of TAIR Genome release Published papers Gene function Journal collaborations Direct submission RNA-seqProteomic Corrections Other data: Markers Ecotypes Gene symbols New genomes New tools Researchers Directly (TAIR pages) AND via other databases

26 GBrowse_syn Tool by Sheldon McKay, CSHL Alignment data from Pedro Pattyn, Van de Peer lab, U. of Ghent

27 GBrowse_syn A. lyrata A. thaliana poplar

28 NBrowse Tool by H.-L. Kao, F. Piano, M. Schuman, M. Gibson, Kris Gunsalus, NYU Interaction datasets curated by TAIR, BioGRID and IntAct

29 NBrowse Tool by H.-L. Kao, F. Piano, M. Schuman, M. Gibson, Kris Gunsalus, NYU Interaction datasets curated by TAIR, BioGRID and IntAct

30 NBrowse Tool by H.-L. Kao, F. Piano, M. Schuman, M. Gibson, Kris Gunsalus, NYU Interaction datasets curated by TAIR, BioGRID and IntAct

31

32

33 Genes have been loaded Working on adding some gene function information and improving searching Arabidopsis lyrata

34 Overview of TAIR Genome release Published papers Gene function Journal collaborations Direct submission RNA-seqProteomic Corrections Other data: Markers Ecotypes Gene symbols New genomes New tools Researchers Directly (TAIR pages) AND via other databases

35 Central registry for Gene Symbols

36

37

38

39 Helpdesk

40

41

42 RSS news feed

43

44 TAIR Facebook Page

45 TAIR Twitter Feed

46 Tanya Berardini Donghui Li Gene Function/GO: Bob Muller Larry Ploetz Chris Wilks (50%) ? David Swarbreck Philippe Lamesch Rajkumar Sasidharan Genome Annotation: TAIR Staff Tech Team: Cynthia Lee Shanker Singh

47 TAIR Sponsors: Funding Agencies: Host Institution: Partner:


Download ppt "New data and tools at TAIR (The Arabidopsis Information Resource)"

Similar presentations


Ads by Google