University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY BeeSpace: An Interactive Environment for Functional Analysis of Social Behavior.

Slides:



Advertisements
Similar presentations
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Advertisements

Zoology 305 Library Databases/Indexes Lab Goals for session: 1) Meet your librarian Kevin Messner 2) Understand.
NCBI/WHO PubMed/Hinari Course NCBI Literature Databases: PubMed Background.
Searching Pubmed Database استخدام قاعدة المعلومات Pubmed د. سيناء عبد المحسن العقيل قسم الصيدلة الإكلينيكية برنامج مهارات البحث العلمي.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
The Thomson Reuters CITATION CONNECTION Digital Library st March – 3 rd April 2014, Jasná David Horký Country Manager – Central and Eastern Europe.
Gene Ontology John Pinney
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
NATIONAL LIBRARY OF MEDICINE The PubMed ID and Entrez, PubMed and PubMed Central Edwin Sequeira National Center for Biotechnology Information June 21,
Bioinformatics Director Lecture University of Michigan Medical School February 7, 2000 Building Analysis Environments Beyond the Genome and the Web Bruce.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Michigan Life Sciences Corridor Bioinformatics, University of Michigan March 14, 2001 Building Analysis Environments Beyond the Genome and the Web Bruce.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
High-Performance Digital Library Classification Systems: PI: Hsinchun Chen, The University of Arizona From Information Retrieval to Knowledge Management.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
BeeSpace: An Interactive Environment for Analyzing Nature and Nurture in Societal Roles Bruce Schatz Institute for Genomic Biology University of Illinois.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
Srihari-CSE730-Spring 2003 CSE 730 Information Retrieval of Biomedical Text and Data Inroduction.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Analysis Environments For Scientific Communities From Bases to Spaces Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign.
Bioinformatics Seminar Department of Computer Science, UIUC February 25, 2005 Analysis Environments For Functional Genomics Bruce R. Schatz CANIS Laboratory.
Information Systems Basic Core Specialization Clinical Imaging BioInformatics Public Health Computer Science Methods (formal models) Biomedical Decision.
Bioinformatics and medicine: Are we meeting the challenge?
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
Bio-Medical Information Retrieval from Net By Sukhdev Singh.
International Conference on Digital Libraries November 16, 2000 Kyoto, Japan Digital Libraries of Community Knowledge: The Coming World of the Interspace.
IEEE Knowledge Media Networking KMN’02 Keynote Address, CRL, Kyoto Japan, July 11, 2002 Concept Switching in the Interspace: Networking Infrastructure.
Integrated Biomedical Information for Better Health Workprogramme Call 4 IST Conference- Networking Session.
Helping scientists collaborate BioCAD. ©2003 All Rights Reserved.
Automatically Generating Gene Summaries from Biomedical Literature (To appear in Proceedings of PSB 2006) X. LING, J. JIANG, X. He, Q.~Z. MEI, C.~X. ZHAI,
CNI Spring Meeting April 26, 1999 Washington, DC THE NET OF THE 21st CENTURY: Concepts across the Interspace Bruce Schatz CANIS Laboratory Graduate School.
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
Department of Computer Science seminar University of Illinois, February 14, 2005 The Evolution of the Net: Predicting Global Infrastructure Bruce R. Schatz.
University of Illinois at Urbana-Champaign BeeSpace Navigator v4.0 and Gene Summarizer beespace.uiuc.edu `
BeeSpace: An Interactive Environment for Analyzing Nature and Nurture in Societal Roles Bruce Schatz Institute for Genomic Biology University of Illinois.
The Gene Ontology and its insertion into UMLS Jane Lomax.
Indexing Mathematical Abstracts by Metadata and Ontology IMA Workshop, April 26-27, 2004 Su-Shing Chen, University of Florida
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Master of Science in Biological Informatics PROGRAM DESCRIPTION The MS in Biological Informatics program program aims.
RESEARCH – DOING AND ANALYSING Gavin Coney Thomson Reuters May 2009.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
CODE (Committee on Digital Environment) July 26, 2000 Rice University THE NET OF THE 21st CENTURY: Concepts across the Interspace Bruce Schatz CANIS Laboratory.
Mining the Biomedical Research Literature Ken Baclawski.
Workshop on The Transformation of Science Max Planck Society, Elmau, Germany June 1, 1999 TOWARDS INFORMATIONAL SCIENCE Indexing and Analyzing the Knowledge.
Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.
Bioinformatics and Computational Biology
Revolutionary System Models, The Net, & The Public Interest The Interspace Prototype ( ) Digital Libraries Initiative ( ) Worm Community.
Revolution & Kids: Building the Future of the Net & Understanding the Structures of the World Bruce R. Schatz CANIS - Community Systems Laboratory University.
BeeSpace Informatics: Interactive System for Functional Analysis Bruce Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign.
Opportunities for Text Mining in Bioinformatics (CS591-CXZ Text Data Mining Seminar) Dec. 8, 2004 ChengXiang Zhai Department of Computer Science University.
BeeSpace: An Interactive Environment for Functional Analysis of Social Behavior Bruce Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign.
Analysis Environments For Functional Genomics Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign
Phenotype And Trait Ontology (PATO) and plant phenotypes
ISI Web of Knowledge update: October What’s New? Conference Proceedings Citation Indexes now in Web of Science –Two editions – Science and Social.
University of Illinois at Urbana-Champaign. BeeSpace Project 5-year NSF-funded project Project Goals  Develop open bioinformatics resources  Support.
DISCUSSION Using a Literature-based NMF Model for Discovering Gene Functional Relationships Using a Literature-based NMF Model for Discovering Gene Functional.
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
Joined up ontologies: incorporating the Gene Ontology into the UMLS.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Genomic Medicine Grid Juan Pedro Sánchez Merino Instituto de Salud Carlos III
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Graduate School of Informatics Kyoto University, November 14, 2001 Functions of the Interspace Infrastructure for Concept Spaces Bruce Schatz CANIS Laboratory.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Networks and Interactions
Biological Databases By: Komal Arora.
Applications of the Interspace Analysis for Community Repositories
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
DATABASES By: Hanna Ben-Or Phone:
Presentation transcript:

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY BeeSpace: An Interactive Environment for Functional Analysis of Social Behavior Bruce Schatz, Principal Investigator Graduate School of Library & Information Science (GSLIS) Department of Computer Science, Program in Neuroscience Theme for Genomics of Neural and Behavioral Plasticity IGB Thematic Research Seminar, November 2, 2004

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Bee Counted – Vote Today!

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY BeeSpace FIBR Project BeeSpace project is NSF FIBR flagship Frontiers Integrative Biological Research, $5M for 5 years at University of Illinois Nature-Nurture using honey bee as model Genome technologies in wet lab and dry lab biology Localized Gene Expression for Normal Social Behavior Gene Robinson, Entomology (behavioral expressions) Susan Fahrbach, Entomology (anatomical localization) Sandra Rodriguez-Zas, Animal Sciences (data analysis) Interactive Information System for Functional Analysis Bruce Schatz, Library & Information Science (info systems) ChengXiang Zhai, Computer Science (text analysis) Chip Bruce, Library & Information Science (user support)

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Post-Genome Informatics Classical Organisms have extensive Genetic Descriptions There will be NO more classical organisms beyond Mice and Men other than Worms and Flies, Yeasts and Weeds. So must use comparative genomics to classical organisms, Via sequence homologies and literature analysis. Automatic annotation of genes to standard classifications, Such as Gene Ontology via sequence homology. Automatic analysis of functions to scientific literature, Such as concept spaces via text mining. Descriptions in Literature MUST be used for future interactive environments for functional analysis!

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Informational Science Computational Science is widely accepted as the Third Branch of Science (beyond Experimental and Theoretical) Genes are Computed, Proteins are Computed, Sequence “equivalences” are Computed. Informational Science is coming to be accepted as the Fourth Branch of Science Based on Information Science technologies for Functional Mining of Information Sources Comparative Analysis within the Dry Lab of Biological Knowledge

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Conceptual Navigation in BeeSpace

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Biology: The Model Organism The Western Honey Bee, Apis mellifera has become a primary model for social behavior Complex social behavior in controllable urban environment Normal Behavior – honey bees live in the wild Controllable Environment – hives can be modified Small size manageable with current genomic technology Capture bees on-the-fly during normal behavior Record gene expressions for whole-brain or brain-region

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Informatics: From Bases to Spaces data Bases support genome data e.g. FlyBase has sequences and maps Genes annotated by GeneOntology and linked to literature BeeBase (Christine Elsik, Texas A&M) Uses computed homologies to annotate genes information Spaces support biomedical literature e.g. BeeSpace uses automatically generated conceptual relationships to navigate functions

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY BeeSpace Software Environment Will build a Concept Space of Biomedical Literature for Functional Analysis of Bee Genes -Partition Literature into Community Collections -Extract and Index Concepts within Collections -Navigate Concepts within Documents -Follow Links from Documents into Databases Locate Candidate Genes in Related Literatures then follow links into Genome Databases

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY BeeSpace Software Implementation Natural Language Processing Identify noun phrases Recognize biological entities Statistical Information Retrieval Compute statistical contexts Support conceptual navigation Network Information System Concept switch across community collections Semantic Links into biological databases

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY BeeSpace Information Sources Biomedical Literature -Medline (medicine) -Biosis (biology) -Agricola, CAB Abstracts, Agris (agriculture) Model Organisms (heredity) -Gene Descriptions (FlyBase, WormBase) Natural Histories (environment) -BeeKeeping Books (Cornell Library, Harvard Press)

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Worm Community System (1991) WCS Information Sources Literature Biosis, Medline, newsletters, meetings Data Genes, Maps, Sequences, strains, cells WCS Interactive Environment Browsingsearch, navigation Filteringselection, analysis Sharinglinking, publishing WCS: 250 users at 50 labs across Internet (1991) Flagship in NSF National Collaboratory program

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY WCS Molecular

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY WCS Cellular

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY WCS PPCS demo

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Medical Concept Spaces (1998) Obtain discipline-scale collection Medline from NLM, 10M bibliographic abstracts human classification: Medical Subject Headings Partition discipline into Community Repositories 4 core terms per abstract for MeSH classification 32K nodes with core terms (classification tree) Community is all abstracts classified by core term 40M abstracts containing 280M concepts computation took 2 days on NCSA Origin 2000 Simulating World of Medical Communities 10K repositories with > 1K abstracts (1K w/ > 10K)

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Navigation in MedSpace For a patient with Rheumatoid Arthritis Find a drug that reduces the pain (analgesic) but does not cause stomach (gastrointestinal) bleeding Choose Domain

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Concept Search

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Concept Navigation

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Retrieve Document

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Biomedical Session

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Categories and Concepts

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Concept Switching

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Document Retrieval

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Biological Concept Spaces (2005) Compute concept spaces for All of Biology BioSpace across entire biomedical literature 50M abstracts across 50K repositories Use Gene Ontology to partition literature into biological communities for functional analysis GO same scale as MeSH but adequate coverage? GO light on social behavior (biological process)

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Interactive Functional Analysis BeeSpace will enable users to navigate a uniform space of diverse databases and literature sources for hypothesis development and testing, with a software system that goes beyond a searchable database, using statistical literature analyses to discover functional relationships between genes and behavior. Genes to Behaviors Behaviors to Genes Concepts to Concepts Clusters to Clusters Navigation across Sources

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY BeeSpace Information Sources General for All Spaces: Scientific Literature -Medline, Biosis, Agricola, Agris, CAB Abstracts -partitioned by organisms and by functions Model Organisms -Gene Descriptions (FlyBase, WormBase, MGI, SCD, TAIR) Special Sources for BeeSpace: -Natural History Books (Cornell Library, Harvard Press)

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY XSpace Information Sources Organize Genome Databases (XBase) Compute Gene Descriptions from Model Organisms Partition Scientific Literature for Organism X Compute XSpace using Semantic Indexing Technology Boost the Functional Analysis from Special Sources Collecting Useful Data about Natural Histories e.g. CowSpace Leverage in AIPL Databases

University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY Beyond BeeSpace The Analysis Environment technology is GENERAL! BirdSpace? BehaviorSpace? BrainSpace? SoySpace? CowSpace? IGBSpace? BioSpace Internet will evolve into Interspace…