Knowledge Engineering Start with the question: “What is an ‘atom’ of scientific knowledge?”

Slides:



Advertisements
Similar presentations
1 Probability and the Web Ken Baclawski Northeastern University VIStology, Inc.
Advertisements

Ontology Engineering approaches based on semi-automated curation of the primary literature Gully APC Burns, Tommy Ingulfsen, Donghui Feng and Ed Hovy Biomedical.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
Stony Brook Model for General Education Assessment Pilot Report November 13, 2003 GEAR as a Catalyst for Change Beginning to Build a Campus- Wide Culture.
Copyright 2002 Prentice-Hall, Inc. Chapter 1 The Systems Development Environment 1.1 Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer.
Overview of The Operations Research Modeling Approach.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Design and Evaluation of Iterative Systems n For most interactive systems, the ‘design it right first’ approach is not useful. n The 3 basic steps in the.
Chapter 1 The Systems Development Environment 1.1 Modern Systems Analysis and Design Third Edition.
Personality, 9e Jerry M. Burger
1 FACS Data Management Workshop The Immunology Database and Analysis Portal (ImmPort) Perspective Bioinformatics Integration Support Contract (BISC) N01AI40076.
February Semantion Privately owned, founded in 2000 First commercial implementation of OASIS ebXML Registry and Repository.
Laboratory System Integration: The Evolution Of LIMS Michael S. Zachowski, Robert D. Walla, Richard P. Albert Astrix Technology Group 175 May Street Edison,
Copyright 2002 Prentice-Hall, Inc. Chapter 1 The Systems Development Environment 1.1 Modern Systems Analysis and Design.
Healthcare Services as Collective Activity Susan Wakenshaw Xiao MA.
Copyright 2002 Prentice-Hall, Inc. Chapter 1 The Systems Development Environment 1.1 Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer.
[ §3 : 1 ] 2. Life-Cycle Perspective Overview 2.1 Motivation 2.2 Waterfall Model 2.3 Requirements in Context.
SCIENTIFIC METHOD THE STEPS.
Taxonomies and Laws Lecture 10. Taxonomies and Laws Taxonomies enumerate scientifically relevant classes and organize them into a hierarchical structure,
School of Computing FACULTY OF ENGINEERING Developing a methodology for building small scale domain ontologies: HISO case study Ilaria Corda PhD student.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Crux flexible, structured data reporting for funding agencies.
Samad Paydar Web Technology Lab. Ferdowsi University of Mashhad 10 th August 2011.
Scientific Data Annotation and Analysis Lecture 7.
Copyright 2002 Prentice-Hall, Inc. 1.1 Modern Systems Analysis and Design Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 1 The Systems Development.
Research Design for Collaborative Computational Approaches and Scientific Workflows Deana Pennington January 8, 2007.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
GEON Cyberinfrastructure Workshop Beijing, China, July 21-23, 2006 Workflow-Driven Ontologies for the Geosciences Leonardo Salayandía The University of.
1 The Theoretical Framework. A theoretical framework is similar to the frame of the house. Just as the foundation supports a house, a theoretical framework.
ANKITHA CHOWDARY GARAPATI
Information Integration BIRN supports integration across complex data sources – Can process wide variety of structured & semi-structured sources (DBMS,
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
Bill Roberts, PresDB 07 Database Preservation: A success story and an unsolved problem Bill Roberts 23 March 2007 PresDB, Edinburgh.
BIRN Knowledge Engineering Working Group Chair: Gully APC Burns.
Master headline RDFizing the EBI Gene Expression Atlas James Malone, Electra Tapanari
Knowledge Engineering “Knowledge Engineering is an engineering discipline that involves integrating knowledge into computer systems in order to solve complex.
Proposed Research Problem Solving Environment for T. cruzi Intuitive querying of multiple sets of heterogeneous databases Formulate scientific workflows.
Automatic Discovery and Processing of EEG Cohorts from Clinical Records Mission: Enable comparative research by automatically uncovering clinical knowledge.
Major Science Project Process A blueprint for experiment success.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Module 4: Systems Development Chapter 13: Investigation and Analysis.
Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation Bioinformatics, July 2003 P.W.Load,
An Introduction to the Biomedical Informatics Research Network (BIRN) Gully APC Burns Information Sciences Institute University of Southern California.
Semantics and the EPA System of Registries Gail Hodge IIa/ Consultant to the U.S. Environmental Protection Agency 18 April 2007.
Versatile Information Systems, Inc International Semantic Web Conference An Application of Semantic Web Technologies to Situation.
Tools for Navigating and Analysis of Provenance Information Vikas Deora, Arnaud Contes and Omer Rana.
Using indicators for program management and program assessment Draft framework with examples Donna Podger (916)
High throughput biology data management and data intensive computing drivers George Michaels.
Pattern Recognition. What is Pattern Recognition? Pattern recognition is a sub-topic of machine learning. PR is the science that concerns the description.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
Unit 1 Lesson 2 Scientific Investigations Copyright © Houghton Mifflin Harcourt Publishing Company.
Chapter 1 The Systems Development Environment
Elucidating effects of nerve injury on gene expression using
Chapter 1 The Systems Development Environment
Chapter 1 The Systems Development Environment
What contribution can automated reasoning make to e-Science?
Chapter 1 The Systems Development Environment
Chapter 1 The Systems Development Environment
Ontology Evolution: A Methodological Overview
An ecosystem of contributions
Research in Psychology
Collaborative RO1 with NCBO
Chapter 1 The Systems Development Environment
Presentation transcript:

Knowledge Engineering Start with the question: “What is an ‘atom’ of scientific knowledge?”

Scientific assertions as ‘Computable, citable elements’ There are very large number of statements like ‘mice like cheese’ – semantics at this level are complicated! For example: – “Novel neurotrophic factor CDNF protects midbrain dopamine neurons in vivo” [Lindholm et al 2007] – “Hippocampo-hypothalamic connections: origin in subicular cortex, not ammon's horn.” [Swanson & Cowan 1975] – “Intravenous 2-deoxy-D-glucose injection rapidly elevates levels of the phosphorylated forms of p44/42 mitogen-activated protein kinases (extracellularly regulated kinases 1/2) in rat hypothalamic parvicellular paraventricular neurons.” [Khan & Watts 2004] Assertions vary in their levels of reliability, specificity. Can we introduce a generalized formalism that could support automated reasoning?

Cycles of Scientific Investigation (‘CoSI’)

e.g., ‘CDNF protects nigral dopaminergic neurons in-vivo’ This statistically- significant effect is the experimental basis for the findings of this study. Our ontology engineering approach is based on experimental variables from Lindholm, P. et al. (2007), Nature, 448(7149): p. 73-7

Knowledge Engineering from Experimental Design (‘KEfED’) Khan et al. (2007), J. Neurosci. 27: [expt 2]

KNOWLEDGE ENGINEERING FROM EXPERIMENTAL DESIGN ‘KEfED’ Project Overview

Project History The KEfED formalism has been under formulation since 2006 and received it’s first active funding in It has been initially developed in a demonstration project based on neural connectivity and has been developed for the Michael J Fox and Kinetics Foundations for Parkinson’s research. The initial user group consists of laboratory-based neuroanatomists and neuroendocrinologists. Early phases of the project involved development of initial prototypes to capture the design of a well-understood experimental design and to generate a knowledge base for experimental data from that design. We have developed numerous prototypes but have deployed a working system from the website in March, Ongoing enhancements to the system include (a) ontology support, (b) the representation of statistical relations and correlations, (c) coordination with the data management and information integration working groups.

BioScholar Application Develop a knowledge base framework for observations and interpretations from experiments. Scientists manually curate data by hand from publications into generic database driven by KEfED model Can reuse designs for multiple experiments Design process is intuitive, can build a database without informatics training Ideal for non-computational biologists. Java / Flex Web application, one click install Use Cases Scientists want to develop a generic knowledge base driven from a corpus of PDF files stored locally within a specific laboratory

Crux Application Scientists within a disease foundation must plot a whole research program How to keep track of hypotheses, experimental results and outcomes to plan the next phase of the project? System is just about to start year 2 of funding geared towards curation of raw data (not from publications). Possible framework to help scientists develop simple databases. Use Cases Decision makers at a disease foundation want to store raw data generated a generic knowledge base driven from a corpus of PDF files stored locally within a specific laboratory

Screenshots