An Ontology-centric Architecture for Extensible

Slides:



Advertisements
Similar presentations
Notes for teachers This presentation has been designed to complement the information provided in the Plant Phenomics Teacher Resource. Some of the slides.
Advertisements

OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
Y. Jaques Yves Jaques ICIS Requirements Gathering, June 2008, Rome NeOn Lifecycle Support for Networked Ontologies.
© Geodise Project, University of Southampton, Applying the Semantic Web to Manage Knowledge on the Grid Feng Tao, Colin.
Soils to Satellites. NCRIS Capabilities Well Placed NCRIS capabilities have access to: Vast volumes of Data (uniformly and non-uniformly structured) High.
Soils to Satellites Logos used with consent. Content of this presentation except logos is released under TERN Attribution Licence v1.0
Digital Agriculture Alyssa Weirman, Business Manager, High Resolution Plant Phenomics Centre July 2013.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Environmentally Sustainable Australia Atlas of Living Australia presentation to Environmentally Sustainable Australia Expert Working Group Donald Hobern,
Crop Canopy Sensors for High Throughput Phenomic Systems
Plant phenomics Some background information A plant’s genotype is all of its genes. A plant’s phenotype is how it looks and performs: a plant’s phenotype.
1 Publishing Linked Sensor Data Semantic Sensor Networks Workshop 2010 In conjunction with the 9th International Semantic Web Conference (ISWC 2010), 7-11.
Event dashboard: Capturing user-defined semantics for event detection over real-time sensor data CSIRO LAND AND WATER Jonathan Yu | Research engineer Environmental.
6 Mark Tester Australian Centre for Plant Functional Genomics University of Adelaide Research developments in genetically modified grains.
TERN Eco-informatics – Managing and delivering ecological research data now and into the future Craig Walker Eco-informatics Facility Director Logos used.
An Overview of eResearch Activities in Australia Paul Davis, GrangeNet Jane Hunter, Uni of Qld.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
ÆKOS: A new paradigm for discovery and access to complex ecological data David Turner, Paul Chinnick, Andrew Graham, Matt Schneider, Craig Walker Logos.
Performing event detection over real-time sensor data using ontology-driven approaches CSIRO LAND AND WATER Jonathan Yu | Research software engineer Environmental.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
WPS Application Patterns at the Workshop “Models For Scientific Exploitation Of EO Data” ESRIN, October 2012 Albert Remke & Daniel Nüst 52°North Initiative.
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
Knowledge based Learning Experience Management on the Semantic Web Feng (Barry) TAO, Hugh Davis Learning Society Lab University of Southampton.
material assembled from the web pages at
High Resolution Plant Phenomics Centre
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
Interfacing Registry Systems December 2000.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
The Saguaro Digital Library for Natural Asset Management Dr. Sudha RamSudha Ram Advanced Database Research Group Dept. of MIS The University of Arizona.
Crystal25 Hunter Valley, Australia, 11 April 2007 Crystal25 Hunter Valley, Australia, 11 April 2007 JAINIS (JCU and Indiana Instrument Services): A Grid.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval WP2 – Media Semantics and Ontologies.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Annotations for the ALA Ron Chernich Principal Research Fellow University of Queensland, Australia.
Interoperability & Knowledge Sharing Advisor: Dr. Sudha Ram Dr. Jinsoo Park Kangsuk Kim (former MS Student) Yousub Hwang (Ph.D. Student)
Crop Ontology towards the semantic integration of open plant trait data Elizabeth Arnaud, Luca Matteis, Rosemary Shrestha, Milko Skofic,
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
© Geodise Project, University of Southampton, Knowledge Management in Geodise Geodise Knowledge Management Team Barry Tao, Colin Puleston, Liming.
Core 2: Bioinformatics NCBO-Berkeley. Core 2 Specific Aims 1.Apply ontologies  Software toolkit for describing and classifying data 2.Capture, manage,
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
ALA Metadata - Goals and Issues Donald Hobern, Director, Atlas of Living Australia 29 August 2008.
SEEK Science Environment for Ecological Knowledge l EcoGrid l Ecological, biodiversity and environmental data l Computational access l Standardized, open.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Semantic sewer pipe failure detection: Linked data approaches for discovering events Jonathan Yu | Research software engineer Environmental Information.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
CIMA and Semantic Interoperability for Networked Instruments and Sensors Donald F. (Rick) McMullen Pervasive Technology Labs at Indiana University
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Cyril Pommier et al. / Feedback from the RDA and WheatIS recommendations for Wheat Data Interoperability Adoption of the Wheat Data Interoperability Guidelines.
ARCHER Building data and information management tools for the complete research life-cycle July 2006.
QTL for vigor traits (LA, plant height, growth rate)
Overview: Fedora Architecture and Software Features
Innovate. Improve. Grow. WEAVER: HEXAPOD ROBOT WITH 5DOF LIMBS FOR NAVIGATING ON UNSTRUCTURED TERRAIN.
Notes for teachers This presentation has been designed to complement the information provided in the Plant Phenomics Teacher Resource. Some of the slides.
The VITO Earth Observation LTDA Facility
The Importance of “Genomes to Fields”
Geospatial and Problem Specific Semantics Danielle Forsyth, CEO and Co-Founder Thetus Corporation 20 June, 2006.
NSDL Data Repository (NDR)
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Presentation transcript:

An Ontology-centric Architecture for Extensible Scientic Data Management Systems Gavin Kennedy1,2 Dr Yuan-Fang Li3 2: School of ITEE, University of Queensland, St Lucia, QLD 3: Clayton School of IT, Monash University, Clayton, VIC Gavin.kennedy@csiro.au Novel High Resolution tools at the HRPPC Dr Xavier Sirault1 Dr Bob Furbank1 1: CSIRO Plant Industry, Black Mountain Cnr Clunies Ross St & Barry Drive Canberra, ACT 2601 Xavier.sirault@csiro.au

What is Plant Phenomics? Phenome = Genome X Environment Genomics is accelerating gene discovery but how do we capitalise on these data sets to establish gene function and development of new genotypes for agriculture? High throughput and high resolution analysis capacity now the factor limiting discovery of new traits and varieties “ In the next 50 years we must produce more food than we have consumed in the history of mankind” Megan Clarke, CSIRO CEO 2009

Phenomics from the Leaf to the Field Imagine a plant breeder walking his trials logging plant performance distributed sensors with his mobile phone or logging on to Phenonet from home to view his wheat in real time

HRPPC: Canberra node of the Australian Plant Phenomics Facility Role Deep phenotyping Development of next generation tools to probe plant function and performance (come and see us) Brachypodium distachyon Arabidopsis thaliana Infrastructure: 1500 m2 lab space 245 m2 greenhouse 260 m2 growth cabinets Analytical tools packaged in: 1- Model Plant Module (HTP) 2- Crop-Plant Shoot Module (MTP) 3- Crop-Plant Root Module (MTP) 4- Crop-Plant Field Module (HTP) Gossypium species Triticum and Hordeum species, Vigna unguiculata (cowpea), Cicer arietinum (chickpea), Zea mays (maize), Sorghum bicolor, … 4

Capitalising on new imaging technologies Plant Morphology Plant Function Visible imaging Plant area, biomass, structure Senescence, relative chlorophyll content, pathogenic lesions Far Infrared imaging Canopy / leaf temperature Water use / salt tolerance Chlorophyll Fluorescence imaging Physiological state of photosynthetic machinery Near IR imaging Tissue water content Soil water content FTIR Imaging Spectroscopy / Hyperspectral imaging Cellular localisation of metabolites (sugars, protein, aromatics) Carbohydrates, pigments and proteins 5

Addressing issues with fluorescence and environmental control PlantScan: next generation phenotyping platform for n-dimensional Models Light Detection and Ranging (LiDAR) Micro-bolometer sensors (Far-Infrared) 4-CCD line scanner (NIR and visible split) Addressing issues with fluorescence and environmental control

Automated features extraction and quantification of n-dimensional models Jurgen Fripp CSIRO ICT E-Health Brisbane Automated segmentation – extracted stem Bounding box extraction and Delauney triangulation for convex 3D hull Volume over time Height and total volume extraction Sirault, Fripp and Furbank (in preparation)

An integrated phenotyping platform for Model Plants PAM Fluorescence imaging Far Infrared imaging Visible imaging for growth Climate controlled in equilibration chamber and imaging chambers 2500 plants per day Applications: 1001 genomes project - 65 re-sequenced Arabidopsis thaliana ecotypes under analysis - with Detlef Weigel USDA Brachypodium distachyon project

www.phenonet.com Distributed Sensor Network for Phenomics Measure and log range of environmental factors on field trials. Zigby wireless transmitters: Thermopile Temp Sensor Humidity Ambient Temp Soil Moisture Imaging: Estimate biomass; greeness index for fertilization; detect flowering; estimate yield. Imaging constrained: Develop smarter portable platforms.

Ontologies Ontologies are a set of formalised terms that allow us to represent knowledge about concepts and relationships in a domain. Annotating with ontologies means describing a domain object or process. Modelling with ontologies means classifying a domain object or process, and its relationship to other domain concepts. This image shows the wheat plant on the left has increased “salt tolerance (TO:0006001)” OBI:0000050 : “platform” “A platform is an object_aggregate that is the set of instruments and software needed to perform a process. “

Ontologies Evolutionary Changes in Domain, Model & Data Expressed in OWL (& RDF Schema) Provides syntax & semantics - enables reasoning Expressivity vs decidability Validation via reasoning Designed to be open & interoperable Facilitates sharing, reuse & Integration Maturing technology stacks APIs, reasoners, triple stores, query engines

PODD The Phenomics Ontology Driven Data repository PlantScan The Phenomics Ontology Driven Data repository A research data and metadata repository. Managing Phenomics Data from Multiple Heterogeneous High Volume High Resolution Data Generation Platforms A methodology for managing and publishing research data outputs. A semantic web data resource. Phenonet Data Phenomobile TrayScan Metadata PODD Metadata Repository PODD Data Stores Data Metadata

Putting the OD in PODD Basics: Ontologies as domain models for research data Model domain objects as ontological objects Base ontology: domain independent Phenomics ontology: domain specific Organizes data logically Represented as metadata objects Parent-child relationship Referential relationship Drives all operations in the data lifecycle Domain Concepts OWL Classes Attributes and relations OWL Predicates Domain Objects OWL Individuals Comments, descriptions OWL Annotations

Observation/Phenotype The PODD Ontology Project Project Plan Investigation Platform Analysis Event Genotype Treatment Material Material Container Data Environment Design Gene Sex Observation/Phenotype Treatment Archive Data Sequence Measurement Measurement Parameter

PODD Architecture Objects represented semantically Semantics (metadata) captured in RDF Repository operations on RDF: Ingestion, retrieval, update, query & search, export Backend Object Management: Fedora Commons Fedora objects mapped to Java objects for: Business Logic Layer Interface Layer

Future Work Annotation Services Ontological tagging of PODD objects Annotation tools, search/discovery tools, browsers, etc. Virtual Laboratory Environment Support Phenome to Genome (and back) discovery processes Analyse linkages across data resources Workflows for statistical inferences & mathematical modelling. Visualisation tools etc...

Resources Plant Phenomics Test Instance: http://poddtest.plantphenomics.org.au/ Plant Phenomics Production Instance: http://podd.plantphenomics.org.au/ Mouse Phenomics Production Instance: http://podd.australianphenomics.org.au PODD Project Website: http://projects.arcs.org.au/trac/podd Contact: Gavin.Kennedy@csiro.au Ph: +61413 337 819 This work is part of a National eResearch Architecture Taskforce (NeAT) project, supported by the Australian National Data Service (ANDS) through the Education Investment Fund (EIF) Super Science Initiative, and the Australian Research Collaboration Service (ARCS) through the National Collaborative Research Infrastructure Strategy Program.

The Team PODD Project Manager Gavin Kennedy University of Queensland eResearch Lab: Faith Davies (Developer) Simon McNaughton (Developer) Jane Hunter (eResearch Lab Leader) APPF/HRPCC/CSIRO Xavier Sirault (Science Leader, HRPPC) Xueqin Wang (Tester, Documentor) Bob Furbank (APPF HRPPC Leader) APPF/Plant Accelerator/Uni of Adelaide Bogdan Masznicz (Bioinformatician) Mark Tester (APPF TPA Leader) APN Philip Wu (Developer) Martin Hamilton (Developer) Adrienne McKenzie (APN Head of Network Services) Monash Univesity Yuan-Fang Li (Designer) NeAT Andrew Treloar (Deputy Director ANDS) Paul Coddington (Projects Manager, ARCS) ALA Donald Hobern (Director, ALA)