WikiNeuron: Semantic Wiki of Collective Minds in Neuroscience Kei Cheung, Ph.D. Yale Center for Medical Informatics NCBO Seminar Series, March 18, 2009.

Slides:



Advertisements
Similar presentations
Creating Collaborative Partnerships
Advertisements

The Neuroscience Information Framework Establishing a practical semantic framework for neuroscience Maryann Martone, Ph. D. University of California, San.
Data Landscapes neuinfo.org Anita Bandrowski, Ph. D. University of California, San Diego.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Ontology Notes are from:
Build VIVO in the Cloud NIH Workshop on Value Added Services for VIVO Brand Niemann Semantic Community March 25-26,
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
1 CIS607, Fall 2004 Semantic Information Integration Attendees: Vikash Agarwal, Julian M Catchen Kevin A Huck, Kushal M Koolwal, Paea J Le Pendu Xiangkui.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Business Driven Technology Unit 4
Wikis and Blogs: Applications for Educational Environments Nina McHale Assistant Professor/Web Librarian Auraria Library.
Teaching in Wikiland An Introduction to Using Wikis in the Classroom By Aubri Keleman.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
An ontology of computing. What is an ontology? An ontology is a specification of a conceptualization. A specification of a representational vocabulary.
Information Integration Intelligence with TopBraid Suite SemTech, San Jose, Holger Knublauch
The use of ontologies within the Neuroscience Information Framework, a neuroscience-centered portal for searching and accessing diverse resources Maryann.
The Neuroscience Information Framework Establishing a practical semantic framework for neuroscience Maryann Martone, Ph. D. University of California, San.
Wikis.
Unifying Data and Domain Knowledge Using Virtual Views IBM T.J. Watson Research Center Lipyeow Lim, Haixun Wang, Min Wang, VLDB Summarized.
Doi: /journal.pbio Scivee Pubcast. 2 Community intelligence Traditional media revolves around the Short Head – a few number of publishers.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Atlas Interoperablity I & II: progress to date, requirements gathering Session I: 8:30 – 10am Session II: 10:15 – 12pm.
RDF and triplestores CMSC 461 Michael Wilson. Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
Neuroinformatics Maryann Martone Amarnath Gupta. Bioinformatics a scientific discipline that encompasses all aspects of biological information acquisition,
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Alan Ruttenberg PONS R&D Task force Alan Ruttenberg Science Commons.
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NIFSTD Maryann Martone University of California, San Diego.
CURRIKI --An Overview Presented to the Bioscience Interest Group Christine Loew Program Manager
NERVE CELLS IMAGES.
RHIT COURSE CATALOGUE SEMANTIC WIKI Overview and Initial Thoughts From your client for : Christina Selby, RHIT Math Dept G214,
The Neuroscience Information Framework Making Resources Discoverable for the Computational Neuroscience Community Jeffrey S. Grethe, Ph. D. Co-Principal.
Data Integration and Management A PDB Perspective.
A Short Tutorial to Semantic Media Wiki (SMW) [[date:: July 21, 2009 ]] At [[part of:: Web Science Summer Research Week ]] By [[has speaker:: Jie Bao ]]
Cognitive Science Overview Cognitive Science Defined The Brain Assumptions of Cognitive Science Cognitive Information Processing Cognitive Science and.
Introduction to the Semantic Web and Linked Data
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
The Uniform Resource Layer Anita Bandrowski Neuroscience Information Framework.
University of California, San Diego Ontology-based annotation of multiscale imaging data: Utilizing and building the Neuroscience Information Framework.
PRO and the NIF / ImmPort Antibody Registries Alexander Diehl Protein Ontology Workshop 6/18/14.
N IF S TD : A C OMPREHENSIVE O NTOLOGY FOR N EUROSCIENCE Fahim IMAM 1, Stephen LARSON 1, Sridevi POLAVARAM 2, Georgio ASCOLI 2, Gordon SHEPHERD 3, Jeffery.
The Neuroscience information framework A User’s Guide.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Lessons learned from Semantic Wiki Jie Bao and Li Ding June 19, 2008.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
The Uniform Resource Layer Anita Bandrowski Neuroscience Information Framework.
Social Information Processing March 26-28, 2008 AAAI Spring Symposium Stanford University
Contributions to mouse BIRN tools and resources Maryann Martone and Mark Ellisman University of California, San Diego 2008.
High throughput biology data management and data intensive computing drivers George Michaels.
Web mining is the use of data mining techniques to automatically discover and extract information from Web documents/services
Uniform Resource Layer Anita Bandrowski, Ph. D. Neuroscience Information Framework University of California, San Diego.
LexWiki Framework & Use Cases SMW for Distributed Terminology Development Guoqian Jiang, PhD, Harold Solbrig Mayo Clinic Meeting with Dr. Jakob (WHO) May.
All Hands Meeting 2004 Clinician’s Requirements for HID Query and Statistics Interface Christine Fennema-Notestine, Ph.D. David Kennedy, Ph.D.
AOHT Principles of Hospitality and Tourism Unit 5, Lesson 13 Introduction to Internet Marketing Copyright © 2007–2014 National Academy Foundation. All.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Use SIOC RDF format for representation of scientific statements Annotated statements created by manual curation automated extraction of biomedical literature.
University of California, San Diego
BioRDF Overview and Update By Kei Cheung, Ph. D
Entrez Neuron: an OWL/RDFa–based Web Application for Information Exploration and Integration in Neuroscience Matthias Samwald, Ernest Lim, Peter Masiar,
CCNT Lab of Zhejiang University
Fahim IMAM, Stephen LARSON, Georgio ASCOLI, Gordon SHEPHERD,
Development of FunctionalConnectomeDB within SenseLab to incorporate and mine functional connectomics data Luis Marenco1,2,4, Rixin Wang1, Robert A. McDougal1,2,
Knowledge Management Systems
Neuroinformatics at Edinburgh
Discovering neuronal interactions,
Discovering neuronal interactions,
WikiNeuron: Semantic Neuro-Mashup
The Gene Wiki, from a BioRDF-naïve perspective
Presentation transcript:

WikiNeuron: Semantic Wiki of Collective Minds in Neuroscience Kei Cheung, Ph.D. Yale Center for Medical Informatics NCBO Seminar Series, March 18, 2009

Nature’s Special Issue: Big Data Big influxes of data have transformed researchers’ understanding of nature –~1.8 million named species: sequences, genes, proteins, interactions, pathways, … plus variants We need more computers and people Wikiomics – mashup of people, data, and computers

Short Head vs. Long Tail of Knowledge Traditional media revolves around the Short Head – a few number of publishers putting out lots of content “Web 2.0” media revolves around community generated content – a huge population of individuals each generating a (relatively) small amount of content Users Content The Short Head Newspapers TV/Hollywood Consumer Reports Olympics Encyclopedia Britannica The Long Tail Blogs YouTube Amazon reviews American Idol Wikipedia “Community intelligence”

The Long Tail of Encyclopedias July 2008 An expert-led investigation carried out by Nature … revealed numerous errors in both encyclopaedias, but among 42 entries tested, the difference in accuracy was not particularly great: the average science entry in Wikipedia contained around four inaccuracies; Britannica, about three. Wiki: “… a website that allows the visitors themselves to easily add, remove, and otherwise edit and change available content, typically without the need for registration.” Wikipedia: “the free encyclopedia that anyone can edit.”

Bio-Wiki Projects Wiki Pathways Gene Wiki Wiki Gene Wiki Protein Proteopedia SNPedia … Standalone wiki sites vs. sites that are tapped into Wikipedia

Something Wiki This Way Comes (Friend S, Schadt E. (2009) Nature 458(7234):13 ) “… it is possible to build frameworks that other people could add data to — and at that point, the scale and scope became very large. And we felt that it was right to go ahead and start that now.”

Neuroscience Wiki This Way Comes If we have “calling on million minds for community annotation in Wikiproteins” why not “calling on trillion neurons for community annotation in WikiNeuron”

WikiNeuron + Neuroscience Wiki This Way Comes Semantic & collaborative Neuroscience Wiki

Diverse types of brain data at different levels Courtesy of NIDA

Barriers to Data Integration Well known problems –Inconsistent and sparse annotation of scientific data –Many different names for the same thing –Different ways of classification –No standards for data exchange or annotation at the semantic level Find images of corticospinal tract?

Cerebral peduncle Internal capsule Corticospinal tract Terminology is used inconsistently; there are many names for the same structure

Barriers to data integration (cont’d) –What genes are found in the cerebral cortex That depends on your definition of cerebral cortex

Cerebral Cortex AtlasChildrenParent GenepaintNeocortex, Olfactory cortex (Olfactory bulb; piriform cortex), hippocampus Telencephalon ABACortical plate, Olfactory areas, Hippocampal Formation Cerebrum MBAT (cortex)Hippocampus, Olfactory, Frontal, Perirhinal cortex, entorhinal cortex Forebrain MBLDoesn’t appear GENSATNot definedTelencephalon BrainInfofrontal lobe, insula, temporal lobe, limbic lobe, occipital lobe Telencephalon Brainmaps Entorhinal, insular, 6, 8, 4, A SII 17, Prp, SI Telencephalon

Wikipedia DBpedia A Semantic Mismatch between Wikipedia and DBpedia

WikiNeuron It is conceived as collaborative knowledge acquisition, annotation, and integration for neurosciences This prototype is developed by SenseLab in collaboration with NIF (Neuroscience Information Framework) It is implemented using Semantic MediaWiki (SMW), which is a semantic extension of MediaWiki that drives large-scale community projects like Wikipedia

The goal of NIF is to develop an inventory of information and other resources within a framework that enables neuroscientists to identify resources relevant to their research needs. It is funded by the NIH Blueprint for Neuroscience Research Yale SenseLab is part of NIF (Other members include: UCSD, George Mason U., Ca. Tech., Cornell University) Leverage NIF resources NIFSTD (NIF Standard) ontologies NeuroLex Use NeuroLex to provide a skeletal structure (categories) as well as standard IDs and terms for identifying and annotating resources (data) Collaboration with NIF

Current Structure of NIFSTD NIF Single inheritance trees with minimal cross domain and intradomain properties Human readable definitions (not complete yet) NIFSTD Macroscopic Anatomy Macroscopic Anatomy Organism NS Dysfunction NS Function Quality Molecule Investigation Subcellular Anatomy Macromolecule Gene Molecule Descriptors Techniques Reagent Protocols Cell Resource Instruments

AnatomyCell Type Cellular Component Small Molecule Neuro- transmitter Transmembrane Receptor GABA GABA-R Transmitter Vesicle Terminal Axon Bouton Presynaptic density Purkinje Cell Neuron Dentate Nucleus Neuron CNS Cpllection of Deep Cerebellar Nuclei Purkinje Cell Layer Dentate Nucleus Cytoarchitectural Part of Cerebellar Cortex Expressed in Located in “Bridge files”

Overview of SMW It is page-centric. There are different types of pages: –Categories: support of hierarchical structure E.g., Person is a category, Scientist can be a subcategory of Peron –Articles: they are category instances/members E.g., The home page of Jone Smith is an article page of the Category Person Properties: attributes that are used to annotate page contents and relate pages –E.g., Address, Age, Sex, , and Friends are properties of Jone Smith

Overview of SMW It provides an internal semantic query language It supports SPARQL endpoint It supports Open Linked Data through a utility that allows RDF data export It has extensions such as the Halo extension that allows incorporation of ontologies into semantic annotation of wiki content.

Semantic Wiki Structure Categories (e.g., brain regions, neurons, molecules) These categories and their subcategories are used to represent diverse types of data at different levels Article pages are generated from different sources and assigned to category pages Category and article pages can have properties associated with them. These properties can also be used to relate between category/article pages.

Example Categories Brain –Brain Region Cerebellum, Hippocampus, Neocortex, … –Neuron Principal neuron –CA1 Pyramidal Neuron, Cerebellar Purkinje Neuron, … Interneuron –Cerebellar Granule Cell –Neuronal Properties (Synapses) Receptor –GABA-A receptor, … Transmitter –Dopamine, … Current –IA, …

NeuroLex Categories

NeuroLex Categories (cont’d)

WikiNeuron Articles

WikiNeuron Articles (cont’d)

Semantic Trees of the Mind Brain functions Brain regions Neurons synapses Data/paper page Property connecting Data/paper pages Property connecting Category pages Category page The diagram below shows the apical tufts of 2 cortical layer V pyramidal cells filled with biocytin and stained with a Texas red / avidin-D conjugate, then counterstained with a green fluorescent nissl stain. Neuroantonomy/Neurophysiology Forest (other forests can exist) See next slide

Automatic Generation and Import of Data/Literature Pages Triplestore Relational database Mapping between the source data structure and the target semantic Wiki page structure (wiki template may facililate this mapping paper Multimedia data Other (e.g., XML, CSV, …)

Mapping tools Get_external_data –CSV, XML Open Biomedical Annotator –Literature Triplestore (e.g., Virtuoso, Allegro Graph, Sesame, Oracle, …) –RDF/OWL

Literature annotation: NCBO’s Open Biomedical Annotator 1 2

Future Directions Work with the NIF community to identify data sources that can be incorporated into WikiNeuron Work with other communities such as NCBO, HCLS IG, SIOC, Semantic Wiki Interface between Semantic Wiki, ontologies, social networking, and Semantic Web

Acknowledgement SenseLab –Gordon Shepherd –Perry Miller –Luis Marenco –Matthew Holford NIF –Maryann Martone –Stephen Larson NCBO –Nigam Shah Other –Yaron Koren

Demo Live demo – p/Main_Pagehttp://bioinformatics.med.yale.edu/neurowiki/index.ph p/Main_Page Screenshots

WikiNeuron (main page)

Image Map

End of Demo