Integrating Literature and Experimental Data Fan Meng, Ph.D. Microarray Laboratory Psychiatry Department and Molecular & Behavioral Neuroscience Institute.

Slides:



Advertisements
Similar presentations
NCBO-I2B2 Collaboration Overview and Use Cases Nigam Shah
Advertisements

Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Pathways analysis Iowa State Workshop 11 June 2009.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Overview of Biomedical Informatics Rakesh Nagarajan.
NYU Microarray Database (NYUMAD)
Automating Discovery from Biomedical Texts Marti Hearst & Barbara Rosario UC Berkeley Agyinc Visit August 16, 2000.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
6/17/20151 Table Structure Understanding by Sibling Page Comparison Cui Tao Data Extraction Group Department of Computer Science Brigham Young University.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
 Image Search Engine Results now  Focus on GIS image registration  The Technique and its advantages  Internal working  Sample Results  Applicable.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
The LINDI Project Linking Information for New Discoveries UIs for building and reusing hypothesis seeking strategies. Statistical language analysis techniques.
MARS: Microarray analysis, retrieval, and storage system Albert F. Cervantes.
1 Alternate Title Slide: Presentation Name Goes Here Presenter’s Name Infrastructure Solutions Division Date GIS Perfct Ltd. Autodesk Value Added Reseller.
>>> Korean BioInformation Center >>> KRIBB Korea Research institute of Bioscience and Biotechnology GS2PATH: Linking Gene Ontology and Pathways Jin Ok.
1 ArrayExpress and MAGE Jamboree II Ugis Sarkans, EBI.
Knowledge Integration for Gene Target Selection Graciela Gonzalez, PhD Juan C. Uribe Contact:
Cancer is heterogeneous disease! -> enabled characterization of new tumor subtypes for improving personalized treatment and ultimately achieving better.
Oracle Application Express (Oracle APEX), formerly called HTML DB, is a Free rapid web application development tool for the Oracle database.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
An Ontology for Protein- Protein Interaction Data Karen Jantz CIS Honors Project December 7, 2006.
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
DECISION SUPPORT SYSTEM ARCHITECTURE: The data management component.
AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Drexel University Philadelphia,
Fundamentals of Database Chapter 7 Database Technologies.
Networks and Interactions Boo Virk v1.0.
Data Analysis Summary. Elephant in the room General Comments General understanding that informatics is integral in medical sequencing and other –omics.
ChipDB: An interactive database system for high- throughput expression analysis Peter Young, John Barnett, Bing Ren, Ezra Jennings and Richard Young Whitehead.
EBI is an Outstation of the European Molecular Biology Laboratory. Anatomy ontology ArrayExpress Helen Parkinson,
Atlas Interoperablity I & II: progress to date, requirements gathering Session I: 8:30 – 10am Session II: 10:15 – 12pm.
Visual Registration Overview Combines the elements of Schedule Finder, the Course Catalog, and the Registration Process all in an easy to use GUI.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Association of variations in I kappa B-epsilon with Graves' disease using classical and my Grid methodologies Peter Li School of Computing Science University.
Copyright OpenHelix. No use or reproduction without express written consent1.
XML Standards for Proteomics Data Andrew Jones, Dr Jonathan Wastling and Dr Ela Hunt Department of Computing Science and the Institute of Biomedical and.
Core 2: Bioinformatics NCBO-Berkeley. Core 2 Specific Aims 1.Apply ontologies  Software toolkit for describing and classifying data 2.Capture, manage,
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
PPI team Progress Report PPI team, IDB Lab. Sangwon Yoo, Hoyoung Jeong, Taewhi Lee Mar 2006.
A Geometric Database of Gene Expression Data for the Mouse Brain Tao Ju, Joe Warren Rice University.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
A curated database of biological pathways.
A collaborative tool for sequence annotation. Contact:
Introduction to caIntegrator caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
1 Semantic Relations for Interpreting DNA Microarray Data and for Novel Hypotheses Generation Dimitar Hristovski, 1 PhD, Andrej Kastrin, 2 Borut Peterlin,
Department of Computer Science PCL: A Policy Combining Language EXAM: Environment for Xacml policy Analysis & Management Access Control Policy Combining.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
Computer Science and Engineering PhD in Computer Science Monday, November 07, :00 a.m. – 11:00 a.m. Swearingen Conference Room 3A75 Network Based.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Canadian Bioinformatics Workshops
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
William H. Bowers – Conceptual Design and Architecture Torres 11.
Ingenuity Pathway Analysis Alex Pico. Description "IPA is a software application that enables researchers to analyze and understand the complex biological.
Pathway Team SNU, IDB Lab. DongHyuk Im DongHee Lee.
Networks and Interactions
Using NCBO Web services
Visual Information Retrieval
A web portal for management of biological data and applications
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
Department of Genetics • Stanford University School of Medicine
CCO: concept & current status
Collaborative RO1 with NCBO
Network biology An introduction to STRING and Cytoscape
Presentation transcript:

Integrating Literature and Experimental Data Fan Meng, Ph.D. Microarray Laboratory Psychiatry Department and Molecular & Behavioral Neuroscience Institute University of Michigan

High Throughput Data Analysis Overview Raw Data: Expression/Genotype/Sequence Molecular → Gene/Transcript/SNP/Genome System → Pathway/Network/Gene Set Integrative Exploration → Hypothesis freewheeling rigid glamorous dull

MGREP Concept Mapping Engine Single Word Variation Concepts Remove Common Words Combine with Word Order Permutation Radix-tree Match Figure 1. Overview of our free text-to-ontology mapping method Key Idea: While classical concept match algorithms use the time consuming approach of generating concept variations during concept match, mgrep pre-generate concept variations and uses highly efficient string match algorithms to achieve two orders of magnitude increase in speed over MetaMap.

Evaluation of MGREP by NCBO Shah NH, Bhatia N, Jonquet C, Rubin D, Chiang AP, Musen MA (2009) Comparison of concept recognizers for building the Open Biomedical Annotator. BMC Bioinformatics Sep 17;10 Suppl 9:S14. Precision of Mgrep and MetaMap using the 'diseases' dictionary Data SourceMgrepMetaMap Clincal Trials Gold Miner GEO MedLine

MGREP in NCBO Annotator Web Service

PubAnatomy Integrate Medline literature with external data Enable efficient visual query Open architecture

Linking Literature and Experimental Data Mapping Medline to brain structures Integrating multiple data sets – Gene expression from the Allen Brain Atlas – Brain structure relationship from NeuroName – Protein-protein interaction from MiMI Graphic presentation of data – Allen Brain Atlas – Protein-protein interaction network – Gene Co-expression network

PubAnatomy Architecture Visualization components: Flex Server-side web services: algorithms and graphics Backend database: Oracle PubAnatomy UI user selection Internal services User plug-ins service I1 service I2 … plugin U1 plugin U2 … algorithm I1 ithm I2 dataset I1 dataset I2 algorithm U1 algorithm U2 dataset U1 dataset U2 … … open API databases BioNLP Literature … Integration Visualization ComponentsServer-Side Web ServicesBackend Database

PubAnatomy Interface PubAnatomy