BRIITE SEATTLE, 2003 SCIENTIFIC DATA MANAGEMENT WORKING GROUP.

Slides:



Advertisements
Similar presentations
Bioinformatics Platform Three-tier Architecture Object-based Relational Database implemented using Oracle Middleware implemented using Entity-Class Operations,
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Integration of Heterogeneous Informations Sources for Proteomics and Transcriptomics Steffen Möller University of Rostock Proteome Center.
Discovery Studio AtlasStore: Protein/Ligand Database Steve Potts, Ph.D., MBA Product Manager Biological Informatics
CVRG Presenter Disclosure Information Tahsin Kurc, PhD Center for Comprehensive Informatics Emory University CardioVascular Research Grid Core Infrastructure.
Company Confidential 1 © 2005 Nokia DBUpgradeTool_ ppt / / JMa A Database Upgrade Tool Nokia Networks Jukka Maaranen.
10 de abril de 2014 Cloud Services for Projects in Bioinformatics: Technical Considerations and Business Fernando Barraza Omicsco Universidad de San Buenaventura.
Integrated Compound Management using Daylight TM, Java TM and Oracle ® The GNF Compound Management System Project Elena Rodriguez, GNF Steven Wilkens,
Data Management in the DOE Genomics:GTL Program Janet Jacobsen and Adam Arkin Lawrence Berkeley National Laboratory University of California, Berkeley.
Turning Biologists into Bioinformaticists – A Practical Approach Charlie Whittaker Bioinformatics and Computing Core Facility David H. Koch Institute for.
Scientific Data Mining: Emerging Developments and Challenges F. Seillier-Moiseiwitsch Bioinformatics Research Center Department of Mathematics and Statistics.
Ken Quinn Research Applications Administrator Information Technology Roswell Park Cancer Institute Christine O’Connell Sr. Director Laboratory Research.
BinX and Astronomy Bob Mann Institute for Astronomy and National e-Science Centre.
Suzanne Simon Cancer Center Administrator James Fitzpatrick Sr. Director of Scientific Core Facilities Director of the Advanced Biophotonics Core.
GTL User Facilities Facility II: Whole Proteome Analysis Michelle V. Buchanan.
BioPerl. cpan Open a terminal and type /bin/su - start "cpan", accept all defaults install Bio::Graphics.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
January, 23, 2006 Ilkay Altintas
Microrray Data Standardisation Microarray Gene Expression Database group -- MGED December, 2000.
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
Database Design for DNN Developers Sebastian Leupold.
DOE Genomics: GTL Program IT Infrastructure Needs for Systems Biology David G. Thomassen Office of Biological and Environmental Research DOE Office of.
Data Curation and Management activities within the UCT Computational Biology Group Dr Nicky Mulder.
CASIMIR Networking Meeting Heathrow, July 2007 CASIMIR WP4 Data Representation John Hancock Duncan Davidson.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
Life Sciences Integrated Demo Joyce Peng Senior Product Manager, Life Sciences Oracle Corporation
Rahul Raman, Ram Sasisekharan Bioinformatics Core Massachusetts Institute of Technology Glue Grants Bioinformatics Meeting April 22-23, 2004 San Diego,
Collecting and Storing Sequences In the laboratory Heather Helm UPR Sequencing Facilities Manager.
DDN & iRODS at ICBR By Alex Oumantsev History of ICBR  Campus wide Interdisciplinary Center for Biotechnology Research  Core Facility  Funded by the.
Proteome data integration characteristics and challenges K. Belhajjame 1, R. Cote 4, S.M. Embury 1, H. Fan 2, C. Goble 1, H. Hermjakob, S.J. Hubbard 1,
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
XML Profile of the FEA DRM Michael C. Daconta Metadata Program Manager November 4, 2004.
Teranode Tools and Platform for Pathway Analysis Michael Kellen, Solution Manager June 16, 2006.
Adding GO GO Workshop 3-6 August GOanna results and GOanna2ga 2. gene association files 3. getting GO for your dataset 4. adding more GO (introduction)
Informatics Software and Services Jim Shaw BergenShaw International Integrate. Automate. Manage. Your company Logo In collaboration.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
Implementing computational analysis through Web services Arnaud Kerhornou CRG/INB Barcelona - BioMed Workshop IRB November 2007.
2009 GMOD Meeting Dhileep Sivam & Isabelle Phan Seattle Biomedical Research Institute.
Data Integration and Management A PDB Perspective.
Building a Topic Map Repository Xia Lin Drexel University Philadelphia, PA Jian Qin Syracuse University Syracuse, NY * Presented at Knowledge Technologies.
FuGE: A framework for developing standards for functional genomics Angel Pizarro Univesrity of Pennsylvania Andrew Jones University of Manchester.
Alvis Brazma, Johan Rung, Ugis Sarkans, Thomas Schlitt, Jaak Vilo European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge,
XML Standards for Proteomics Data Andrew Jones, Dr Jonathan Wastling and Dr Ela Hunt Department of Computing Science and the Institute of Biomedical and.
FuGE: A framework for developing standards for functional genomics Andrew Jones School of Computer Science, University of Manchester Metabomeeting 2.0.
Representing Flow Cytometry Experiments within FuGE Josef Spidlen 1, Peter Wilkinson 2, and Ryan Brinkman 1 1 BC Cancer Research Centre, Vancouver, BC,
OGSA-DAI Neil Chue Hong 29 th January 2007 OGF19, Chapel Hill.
Databases, Ontologies and Text mining Session Introduction Part 2 Carole Goble, University of Manchester, UK Dietrich Rebholz-Schuhmann, EBI, UK Philip.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC.
17 th October 2002Data Provenance Grid Data Requirements Scoping Metadata & Provenance Dave Pearson Oracle Corporation UK.
IMDB: A Generic Insertional Mutagenesis Database Xiaokang Pan and Lincoln Stein Cold Spring Harbor Laboratory.
Ewa Deelman, Virtual Metadata Catalogs: Augmenting Existing Metadata Catalogs with Semantic Representations Yolanda Gil, Varun Ratnakar,
Integration architecture Queryable sources that generate XML Distributed queries over sources Query Manager to manage queries Saved queries as sources.
High throughput biology data management and data intensive computing drivers George Michaels.
ArrayExpress Ugis Sarkans EMBL - EBI
CRISP WP 17 1 / 2 Proposed Metadata Catalogue Architecture Document.
National Cancer Institute Uma Mudunuri ABCC, NCI-Frederick ISRCE Monthly Meeting, Nov 9th 2010 bioDBnet The biological DataBase network.
Department of Pathology UC Davis School of Medicine Jeff Gregg, M.D. The Development of an Informatics Platform for the Characterization of Clinical Samples.
Trustworthy Semantic Webs Building Geospatial Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas October 2006 Presented at OGC Meeting,
Computational Aspects of the Protein Target Selection, Protein Production Management and Structure Analysis Pipeline.
Million Veteran Program: Industry Day Genomic Data Processing and Storage Saiju Pyarajan, PhD and Philip Tsao, PhD Million Veteran Program: Industry Day.
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
DATA INTEGRATION FOR LANGUAGE DOCUMENTATION
Bio68: Bioinformatics Databases
Pipeline Execution Environment
Middleware independent Information Service
Topics Covered in COSC 6340 Data models (ER, Relational, XML (short))
Topics Covered in COSC 6340 Data models (ER, Relational, XML)
An ontology for e-Research
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Presentation transcript:

BRIITE SEATTLE, 2003 SCIENTIFIC DATA MANAGEMENT WORKING GROUP

NEEDS  MASSIVE DISPARATE DATA SETS IMAGE DATA (CONFOCAL, ETC.) DNA/PROTEIN SEQUENCE GENE EXPRESSION PROTEOMICS (GELS, ICAT, MASS SPEC) STRUCTURE...

PROBLEMS RECAPITULATION OF BOB’S PRESENTATION BETTER GRAPH REPRESENTATIONS AND QUERIES RATE OF CHANGE (2 YR LIFESPAN) DO AS LITTLE AS POSSIBLE – NEVER OVERBUILD SAVE ALL BASIC DATA ERRORS UNAVOIDABLE HUMAN INTERPRETATION OF SEMATICS TRANSLATIONAL ERRORS PERVASIVE…..SEMANTICS LACK OF ACCEPTED VOCABULARIES NEVER BE ONE ONTOLOGY EXAMPLES OF ONTOLOGIES DATA INTEGRATION CHALLENGES TECHNICAL INTEGRATION VS. SCIENTIFIC INTEGRATION

WHAT WORKS BUILD AS SMALL, MODULAR AS POSSIBLE – EXPECT IT TO CHANGE USE ESTABLISHED DB DESIGN, DEVELOPMENT, AND QC PRACTICES TO FACILITATE CHANGE XML SCHEMA TO BUILD DB EASY TO MODIFY HANDLES DATA FORMAT CHANGES OVER TIME FACILITATES SHARING DATA BETWEEN INSTITUTION USE OPEN SOURCE

AVAILABLE TO SHARE WARREN KIBBE NOTIS SECURITY MODULES (JAVA, C, COLDFUSION USING ORACLE) SAMPLE TRACKING CORE FACILITY ORDERING SYSTEM TONY PAN DISTRIBUTED METADATA MGMT SYSTEM ( DISTRIBUTED PROCESS EXECUTION – DATACUTTER ( CLAYTON NAEVE HCNETDAT- A GENE ANNOTATION DATABASE ( SIMS (LIMS, DB, ONLINE ORDERING/TRACKING, ETC.) NAT GOODMAN SBEAMS (RELATIONAL SCHEMA FOR HT LAB DATA) (see

AVAILBLE TO SHARE MICHAEL OCHS AUTOMATED ANNOTATION PIPELINE FUNCTIONAL GENOMICS DATA PIPELINE DISTRIBUTED BLAST (BEOBLAST) – A QUEUEING SYSTEM FLOWLIMS – FLOW CYTOMETRY DATA SYSTEM