TAIR, PMN, SGN and Gramene workshop Focus on comparative genomics and new tools Philippe Lamesch, A. S. Karthikeyan, Aureliano Bombarely Gomez, Pankaj.

Slides:



Advertisements
Similar presentations
Model Organism Databases and Community Annotation
Advertisements

Making best use of TAIR tools and datasets Philippe Lamesch Donghui Li The Arabidopsis Information Resource contact us:
TAIR: Bringing together data for the global plant biology community Philippe Lamesch Kate Dreher The Arabidopsis Information Resource
Bienvenidos a TAIR! Kate Dreher curator TAIR/PMN.
GBrowse at TAIR Philippe Lamesch TAIR curator. Seqviewer.
TAIR: Bringing together data for the global plant biology community kate dreher curator TAIR/PMN.
The Arabidopsis Information Resource (TAIR)
Arabidopsis as a model for plant development Eva Huala.
Gene Structure Annotation Philippe Lamesch International Arabidopsis conference July 23, 2008, Montreal.
Kate Dreher AraCyc, TAIR, PMN Carnegie Institution for Science
Part I: Tips and Techniques from curators GBrowse at TAIR David Swarbreck.
Part I: Tips and techniques from curators Kate Dreher TAIR, AraCyc, PMN Carnegie Institution for Science.
Eukaryotic Intron Loss Tobias Mourier & Daniel C. Jeffares.
Pathways analysis Iowa State Workshop 11 June 2009.
The Plant Metabolic Network: PlantCyc, AraCyc, and NEW Metabolic Pathway Databases for Plant Research *K. Dreher, P. Zhang, L. Chae, R.A. Nilo Poyanco,
Introduce GeneSpring GX12 Yun Lian GeneSpring Layout.
NCBI National Center for Biotechnology Information.
Reconstructing Ancestral Vertebrate Genomes by in silico Palaeogenomics Hugues Roest Crollius Laboratoire Dyogen - CNRS Ecole Normale Supérieure Paris.
1/30 Comparative Genomics. 2/30 Overview of the Talk Comparing Genomes Homologies & Families Sequence Alignments.
First release of HOGENOM, a database of homologous genes from complete genome Equipe Bioinformatique et Génomique Evolutive Laboratoire de Biométrie et.
P-POD The Princeton Protein Orthology Database Literature Discussion Tim Hulsen
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
E. coli Genome PROKARYOTES Typically, - >10 6 bp - Sequence without gaps ANIMALS Typically, >10 9 bp - Sequence with many gaps - 95+% covered.
Bioinformatics master course DNA/Protein structure-function analysis and prediction Lecture 13: Protein Function Centre for Integrative Bioinformatics.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
BioSci 145B lecture 1 page 1 © copyright Bruce Blumberg All rights reserved mRNA frequency and cloning mRNA frequency classes –classic references.
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
Accessing the Data You Need at the Plant Metabolic Network kate dreher biocurator PMN The Carnegie Institution for Science Stanford, CA.
Spinal Muscular Atrophy SMN1 Billy Baader - Genetics 677 Medline Plus (2009) Spinal Muscular Atrophy retrieved Feb 3, 2009 from:
TAIR resources for plant biology research kate dreher curator TAIR/PMN.
The Ensembl Gene set The “Genebuild” 21 April 2008.
Using The Gene Ontology: Gene Product Annotation.
Meiosis Organisms that reproduce sexually have specialized cells called gametes (sex cells) Gametes are the result of a type of cell division called meiosis.
Genome Characterization Assembly/sequencing BIO520 BioinformaticsJim Lund Assigned reading: Ch 9.
Genomes School B&I TCD Bioinformatics May Genome sizes Completed eukaryotic nuclear genomes Type of organismSpeciesGenome size (10 6 base pairs)
New data and tools at TAIR (The Arabidopsis Information Resource)
EBI is an Outstation of the European Molecular Biology Laboratory. Bert Overduin Daniel Rios Stephen Fitzgerald Edinburgh, 24 & 25 February 2009 Ensembl.
Accessing information in plant metabolic pathway databases at the PMN, Gramene, and SGN Part I: Contents, Search Strategies, and Data Sharing Opportunities.
The University of Texas at Austin, CS 395T, Spring 2008, Prof. William H. Press 1 Computational Statistics with Application to Bioinformatics Prof. William.
Ontologies, data standards and controlled vocabularies.
Eukaryotic Genomes: From Parasites to Primates (part 2 of 2) Monday, November 3, 2003 Introduction to Bioinformatics ME: J. Pevsner
Comparative genomics and proteomics in Ensembl Sep 2006.
IGEM 101: Session 7 4/2/15Jarrod Shilts 4/5/15Ophir Ospovat.
An Introduction to Ensembl Presented By Hilary O. Pavlidis.
PlantCyc, AraCyc, PoplarCyc and more... Building databases and connecting to researchers at the Plant Metabolic Network kate dreher curator PMN/TAIR.
1/29 Comparative Genomics. 2/29 Overview of the Talk Comparing Genomes Homologies & Families Sequence Alignments.
EBI is an Outstation of the European Molecular Biology Laboratory. GOA: Looking after GO annotations Emily Dimmer Gene Ontology Annotation (GOA) Database.
NCBI FieldGuide September 29, 2004 ICGEB NCBI Molecular Biology Resources A Field Guide part 1.
Reactome - a curated knowledgebase of human biological pathways and processes.
GMOD/GBrowse_syn Sheldon McKay Reactome Ontario Institute for Cancer Research.
Comparative genomics Haixu Tang School of Informatics.
Kex2 from saccharomycetales  a b c d e f g h i j k l m n o p q r s t u v w x y z A B C F G H D E Saccharomyces cerevisiae NDLFKR-LPVP D D D D Y H R I.
E. coli Genome PROKARYOTES Typically, - >10 6 bp - Sequence without gaps ANIMALS Typically, >10 9 bp - Sequence with many gaps - 95+% covered.
Evolution of Animal Cytochromes P450 from Sponges to Mammals David R. Nelson University of Tennessee Health Sciences Center Memphis.
Building and Refining AraCyc: Data Content, Sources, and Methodologies Kate Dreher TAIR, AraCyc, PMN Carnegie Institution for Science.
Genome Database Comparative Genomics Phylogenomics Variation GrameneMart (BioMart) Discovery Environment Josh Stein Cold Spring Harbor Laboratory 1.
Gene models and proteomes for Saccharomyces cerevisiae (Sc), Schizosaccharomyces pombe (Sp), Arabidopsis thaliana (At), Oryza sativa (Os), Drosophila melanogaster.
Chapter 11 Meiosis & Genetics What do you think meiosis makes?
Comparative Genomics with GBrowse_syn Sheldon McKay.
Professor William H. Press, Department of Computer Science, the University of Texas at Austin1 Opinionated in Statistics by Bill Press Lessons #51 Hierarchical.
GMOD/GBrowse_syn Sheldon McKay iPlant Collaborative DNA Learning Center Cold Spring Harbor Laboratory.
Lecture/Lab 7.31
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
What’s new in GO?. Priorities Annotation outreach Reference genomes User advocacy Ontology development Software.
Annotating with GO: an overview
Genetics and Evolutionary Biology
TAIR, PMN, SGN and Gramene workshop
Part I: Tips and Techniques from curators
Homologs in 38 fully sequenced eukaryotic genomes to 534 assigned S
Alignment of Wt1 genomic regions reveals a highly conserved element upstream of zebrafish wt1a. Alignment of Wt1 genomic regions reveals a highly conserved.
Presentation transcript:

TAIR, PMN, SGN and Gramene workshop Focus on comparative genomics and new tools Philippe Lamesch, A. S. Karthikeyan, Aureliano Bombarely Gomez, Pankaj Jaiswal

TAIR Genome Browsers Synteny Viewer Protein interaction Viewer

Tools at TAIR

Genome Browsers at TAIR Two options (Seqviewer & GBrowse)

Seqviewer

GBrowse Header Main Browser Window Track Menu

GBrowse - Header Search by: Feature Name AT1G01040 AY Position Chr2: Configure Vista plot & fasta download

Track menu

Proteomics Data Baerenfaller 2008 Castellana et al. 2008

Reposition tracks

Promoter Elements Yamamoto YY, Obokata J. (2008) Nucleic Acids Res 36, D977-D981

Tracks updated regularly

Full support No support

VISTA plot Gbrowse track Dubchak et al

Arabidopsis Populus (92 Mya) Medicago (92 Mya) Oryza (160 Mya) Selaginella (400 Mya) Physcomitrella (450 Mya) TAIR GBrowse: Nucleotide conservation across large evolutionary time spans

Phot1 blue light photoreceptor Arabidopsis Populus (92 Mya) Medicago (92 Mya) Oryza (160 Mya) Selaginella (400 Mya) Physcomitrella (450 Mya) TAIR GBrowse:

LBL:

Orthologs and Gene Families INSECTS Aedes aegypti (Yellow fever mosquito) Anopheles gambiae (Malaria mosquito) Apis mellifera (Western honeybee) Drosophila melanogster (Fly) Drosophila pseudoobscura (Fly) MAMMALS Bos taurus (Cow) Canis familiaris (Dog) Homo sapiens (human) Mus Musculus (mouse) Gallus gallus (Chicken) Macaca mulatta (Rhesus Macaque) Monodelphis domestica (Opossum) Pan troglodytes (Chimpanzee) Rattus norvegicus (Rat) YEAST Candida gablata (Haploid yeast) Cryptococcus neoformans (Yeast-like fungus) Debaryomyces hansenii (Yeast) Kluyveromyces lactis (Yeast) Yarrowia lipolytica (yeast) Schizosaccharomyces pombe (Fission yeast) Saccharomyces cerevisiae (Budding yeast) FISH Takifugu rubripes (Pufferfish/Fugu) Tetraodon nigroviridis (Pufferfish/Green-spotted) Gasterosteus aculeatus (Stickleback) Danio rerio (Zebrafish) NEMATODES Caenorhabditis elegans (Nematode) Caenorhabditis briggsae (Nematode) Caenorhabditis remanei (Nematode) PLANTS Oryza sative (rice) OTHER Xenopus tropicalis (Frog) Ciona intestinalis (Sea squirt) Entamoeba histolytica (Amoebozoa) Dictyostelium discoideum (Slime mold)

Orthologs and Gene Families

Other ways to navigate between genomes: Brassica and radish sequences Brassica - 840,000 ESTs, 2100 cDNAs Raphanus - 287,000 ESTs Nucleotide alignments to Arabidopsis using CAT (Cross-species Alignment Tool, Li et al 2007) TAIR GBrowse

TAIR survey April 2008

What tools should we add to TAIR?

TAIR survey April 2008 What tools should we add to TAIR? Synteny Viewer Protein-Protein Interaction Viewer

Synteny Viewer GBrowse_syn: GBrowse-based synteny browser developed by Sheldon McKay Helps to study and analyze syntenic regions, homologous genes and other conserved elements between sequences By comparing less studied genomes to the well annotated Arabidopsis genome in Gbrowse_syn, scientists can identify novel genes and putative regulatory elements. First version of TAIR synteny viewer contains A.thaliana to A. lyrata genome alignments (more will be added soon)

Synteny Viewer

TAIR Protein-Protein interaction viewer using N-browse Protein-protein interactions curated by Intact, Biogrid and TAIR Using the generic network browser N-browse developed by the Gunsalus Lab ( Curr Protoc Bioinformatics Sep; Chapter 9:Unit 9.11) First version contains interactions determined experimentally Interactions can be filtered by type of experiment and biological modules User can overlay own interaction set on curated set

Minimal information about an Interaction

Arabidopsis Protein-Protein interaction viewer > 1,500 interactions in TAIR Nbrowse Interaction Viewer

Acknowledgments PIs Eva Huala Sue Rhee Curators David Swarbreck Donghui Li Tanya Berardini Kate Dreher Peifen Zhang TAIR Tech Team: Vanessa Kirkuo Chris Wilks Tom Meyer Cindy Lee Raymond Chetty Bob Muller N-Browse Kris Gunsalus (NYU) Mark Gibson Gbrowse_syn Sheldon McKay (CSHL) Vista Browser Ina Dubchak (LBNL)

Other ways to navigate between genomes: Orthologs and gene families TAIR GBrowse: