Free Open-Source, Open- Platform System for Information Mash-Up and Exploration in Earth Science Tawan Banchuen, Will Smart, Brandon Whitehead, Mark Gahegan,

Slides:



Advertisements
Similar presentations
Panel 2 – Promoting Re-Use of Scientific Collections John Harrison SHAMAN Project University of Liverpool
Advertisements

GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Online Qualitative Data Resources: Best Practice in Metadata Creation.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
IBM Watson Research © 2004 IBM Corporation BioHaystack: Gateway to the Biological Semantic Web Dennis Quan
IS 466 ADVANCED TOPICS IN INFORMATION SYSTEMS LECTURER : NOUF ALMUJALLY 20 – 11 – 2011 College Of Computer Science and Information, Information Systems.
Providing useful perspectives onto massive digital collections Mark Gahegan Tawan Banchuen, Will Smart, Brandon Whitehead Centre for eResearch University.
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
GenSpace: Exploring Social Networking Metaphors for Knowledge Sharing and Scientific Collaborative Work Chris Murphy, Swapneel Sheth, Gail Kaiser, Lauren.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
1 Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies.
Domain Modelling the upper levels of the eframework Yvonne Howard Hilary Dexter David Millard Learning Societies LabDistributed Learning, University of.
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Business Driven Technology Unit 4
Lecturer: Ghadah Aldehim
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Web 2.0: Concepts and Applications 4 Organizing Information.
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Learning Unit Documents and Examples. Learning Units - basic building block of a course For iGETT a Learning Unit consists of –Three parts Instructor.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
PLATFORM INDEPENDENT SOFTWARE DEVELOPMENT MONITORING Mária Bieliková, Karol Rástočný, Eduard Kuric, et. al.
NEPTUNE Canada Workshop Oceans 2.0 Project Environment NEPTUNE Canada DMAS Team Victoria, BC February 16, 2009.
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Web-centric Computing: Computing, Hypertext, & the WWW.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Language and Computation Day University of Essex 4 October 2005.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
CSM06 Information Retrieval Lecture 6: Visualising the Results Set Dr Andrew Salway
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
L6-S1 UML Overview 2003 SJSU -- CmpE Advanced Object-Oriented Analysis & Design Dr. M.E. Fayad, Professor Computer Engineering Department, Room #283I College.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
1 Semantic Provenance and Integration Peter Fox and Deborah L. McGuinness Joint work with Stephan Zednick, Patrick West, Li Ding, Cynthia Chang, … Tetherless.
Semantic Visualization What do we mean when we talk about visualization? - Understanding data - Showing the relationships between elements of data Overviews.
Semantic Web Technologies Brief Readings Discussion Class work: Research topics and Project discussion Research Presentation Topics assigned Building lightweight.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Portable Infrastructure for the Metafor Metadata System Charlotte Pascoe 1, Gerry Devine 2 1 NCAS-BADC, 2 NCAS-CMS University of Reading PIMMS provides.
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
Introduction to the Semantic Web and Linked Data
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Digital Libraries Lillian N. Cassel Spring A digital library An informal definition of a digital library is a managed collection of information,
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Geographic Visualization to Support Epidemiology in Bulgaria Anthony C. Robinson GeoVISTA Center Department of Geography The Pennsylvania State University.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Chapter 10. The Explorer System in Cognitive Systems, Christensen et al. Course: Robots Learning from Humans On, Kyoung-Woon Biointelligence Laboratory.
Five fantastic Fedora Commons projects in five minutes, in no particular order Carol Minton Morris Communications Director National Science Digital Library,
Collaborative Query Previews in Digital Libraries Lin Fu, Dion Goh, Schubert Foo Division of Information Studies School of Communication and Information.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
Information Sharing on the Social Semantic Web Aman Shakya* and Hideaki Takeda National Institute of Informatics, Tokyo, Japan The Second NEA-JC Workshop.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
The TERENA-OER Portal Eli Shmueli IUCC- Israeli-Inter Universities Communication Center MEITAL- Inter-University Center for e-Learning
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
GUILLOU Frederic. Outline Introduction Motivations The basic recommendation system First phase : semantic similarities Second phase : communities Application.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Samad Paydar WTLab Research Group Ferdowsi University of Mashhad LD2SD: Linked Data Driven Software Development 24 th February.
1 Using DLESE: Finding Resources to Enhance Teaching Shelley Olds Holly Devaul 11 July 2004.
The Semantic Web By: Maulik Parikh.
An Approach to Software Preservation
Publishing Communities
Presentation transcript:

Free Open-Source, Open- Platform System for Information Mash-Up and Exploration in Earth Science Tawan Banchuen, Will Smart, Brandon Whitehead, Mark Gahegan, Sina Masoud-Ansari Center for eResearch & School of Environment The University of Auckland

Overview 1.Introduction and background to project 2.Application Development – Software system for integrating, browsing and understanding large information bases 3.Demonstration / sample results 4.Conclusion

Components of knowledge computing Rich descriptions of resource meaning Recommender systems Finding analogous situations Knowledge evaluation Ontology alignment tools Filters and query tools for locating resources Knowledge visualization tools (e.g. ConceptVista, CMap, ThinkBase) Workflow description Metadata scraping Ontology capture Use-case capture Tag clouds Ontologies, controlled vocabularies, taxonomies Metadata Knowledge bases RDF/OWL/KIF

What is an Ontology? 4 An ontology describes what we know or what is true, via a kind of logic An ontology can be as simple as a concept map showing terms used to describe a topic and the relationships between those terms Topic Terms

The problem Knowledge leaks from organizations – Some gets forgotten – Some leaves with its container – Some gets buried or lost in the infrastructure We are very poorly equipped to care for knowledge in computational infrastructure – Can we ‘surface’ more of the knowledge implicitly held in unstructured documents? – If so, can we put it to use effectively?

Complete conceptual neighborhood of a document ConceptVista, Gahegan et al.

Methods Lab Books Preprints Data Video Blogs Podcasts Codes Algorithms Models Presentations Ontologies Intermediate Results Intermediate Results Related Articles Related Articles Comments & Reviews Comments & Reviews Plans Reproducible, transparent science Composite research components Carole Goble, UK eScience

Methods Lab Books Preprints Data Video Blogs Podcasts Codes Algorithms Models Presentations Ontologies Intermediate Results Intermediate Results Related Articles Related Articles Comments & Reviews Comments & Reviews Connections run both ways… an open, linked web of science Plans Carole Goble, UK eScience

Application Development Software system for integrating, browsing and understanding large information bases

Alfred & SemDat Integration Data Sources Geospatial Data - Geoserver & Mapserver Ontological Data - Sesame Documents - webpages, PDFs, reports Visualization Map Concept graph Concept tree Web browser Analysis methods Visual exploration Relevant measurement Spatial and ontological queries

Eclipse Neon uDig Display WebpageDocumentNetworkFlow SPARQL query engine Relevance query engine Display queries Style queries The application has the following basic module types:

Single Sourcing

Eclipse is used as the base – Stable and industry-standard – Enables advanced coordination between our modules and many available third party modules The display modules provide a view on the dataset with rich interactivity – A user can focus on the information they want. The query engine is the smarts – Determines which information is relevant to the current selection – Determines how that information should be displayed

SPARQL queries Styling commands Rich markup for each displayed item Style queries mark-up displayed information based on semantics:

Standards: – Eclipse – Industry-standard base with standardized plug-in format NeOn – Existing eclipse application providing useful ontological plug- ins uDig – Existing eclipse application providing useful mapping and browser plug-ins Open source Open standard Active communities – OWL/RDF – Industry standard for representing ontologies – SPARQL – Query language – Jython/Python – Advanced styling and rendering of data

Geographic Context (Map View) Analysts can gain insights from geographic relationships between cases Distance – possible physical/chemical interactions, team collaboration Clusters – successes and failures Patterns – successes restricted to a particular team Possible explanations/theories

Drill Down to Related Document Analysts can drill down to investigate anindividual abstract/article for more details

We need far better information filters

Demonstration / sample results

Conclusions We are drowning in data / information / knowledge, yet are rewarded for producing more, not less zero sum game: if we are writing more, we must be reading less… Describing documents and other digital artifacts according to a variety of different facets holds considerable promise The semantic web is providing many ways to describe data collections We may not be able to capture what things mean directly, but we can provide some useful signifiers (clues) The traces that individuals leave behind can be very useful, both to themselves and to others. And it is comparatively inexpensive to capture and analyse Trust: Researchers need commitments over data custodianship that they can rely on into the long term. Not 4 year funding cycles for nationally significant datasets

Questions? Tawan Banchuen, PhD Lecturer at Auckland University