The Keyword Aggregator web service A tool and methodology for managing digital objects’ keywords IINFORMATION MANAGEMENT TECHNOLOGY, LAND & WATER David.

Slides:



Advertisements
Similar presentations
IDN Services and SERF Update Heather Weir. Earth Science Related Tools & Services Contains: –Descriptions of commercial and non-commercial, Earth science.
Advertisements

IDN Services and SERF Update Heather Weir
Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
Technology Exploration – Semantics Karen Moe NASA Earth Science Technology Office WGISS-37 Meeting April 14-18, 2014.
C van Ingen, D Agarwal, M Goode, J Gupchup, J Hunt, R Leonardson, M Rodriguez, N Li Berkeley Water Center John Hopkins University Lawrence Berkeley Laboratory.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
OntoBlog: Linking Ontology and Blogs Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of Informatics, Japan 2 Asian.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
Aligning Thesauri for an integrated Access to Cultural Heritage Collections Antoine ISAAC (including slides by Frank van Harmelen) STITCH Project UDC Conference.
Metadata Standards & Applications 7. Approaches to Models of Metadata Creation, Storage, and Retrieval.
How the University Library can help you with your term paper Computer Science SC Hester Mountifield Science Library x 8050
Accessing Cultural Heritage using Semantic Web Techniques Antoine ISAAC VU Amsterdam - KB Digital Access to Cultural Heritage Master March 20 th, 2008.
Vocabulary Services “Huuh - what is it good for…” (in WDTS anyway…) 4 th September 2009 Jonathan Yu CSIRO Land and Water.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
A water information R & D alliance between the Bureau of Meteorology and CSIRO’s Water for a Healthy Country Flagship Vocabulary Services, RDF, SKOS and.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
The OAI-ORE based data model of Europeana and the Digital Public Library of America: implications for educational publishing Dov Winer MAKASH – Advancing.
Using Vocabulary Services in Validation of Water Data May 2010 Simon Cox, JRC Jonathan Yu & David Ratcliffe, CSIRO.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Chuck Humphrey Data Library Co-ordinator University of Alberta May 16, Capitalising on Metadata Tool development plans IASSIST 2007.
Vocabularies in the VO Alasdair J G Gray Norman Gray Iadh Ounis.
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
7. Approaches to Models of Metadata Creation, Storage and Retrieval Metadata Standards and Applications.
Microsoft Academic Search Search | Explore | Discover Alex D. Wade Director - Scholarly Communication.
MD9.6 Release: Highlights Increased the character limit for all URL resources to 600 characters. Data_Center/Service_Provider Data_Set_Citation/Service_Citation.
Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical.
Reading Discussions Metcalfe’s Law paper What is metcalfe’s Law? Examples from the Web? How can we utilize it? How semantics contribute to social networks,
Digital Earth Communities GEOSS Interoperability for Weather Ocean and Water GEOSS Common Infrastructure Evolution Roberto Cossu ESA
NERC DataGrid NERC DataGrid Vocabulary Server Use Cases Vocabulary Workshop, RAL, February 25, 2009.
1 24 September BREAKOUT :30 1)Review of Metadata Standards Directory (DCC version and GitHub) 2)Introduction of Metadata Standards Catalog.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Exploring and Mapping Vocabularies IVOA InterOp Semantics Session Trieste Italy, May 2008 Alasdair J. G. Gray.
Search Engines Reyhaneh Salkhi Outline What is a search engine? How do search engines work? Which search engines are most useful and efficient? How can.
Google’s Deep-Web Crawl By Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, and Alon Halevy August 30, 2008 Speaker : Sahana Chiwane.
VIVO and Scholarly Repositories: Synergistic Opportunities.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
Discovering Earth Science Data and Services Using NASA’s Global Change Master Directory: The Value for Earth Science Teachers Tyler Stevens NASA’s Global.
Terrier Workshop: 26 th February 2008 Alasdair J G Gray.
AGROVOC Thesaurus. 1980s: developed as multilingual structured thesaurus for agricultural terminology (“rice”) : parallel effort to express thesaurus.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Introduction to the Semantic Web and Linked Data
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Mining for Ideas on the Web Video: 4 min. 45 sec..
Adapting the Electronic Laboratory Notebook for the Semantic Era Tara Talbott, Michael Peterson, Jens Schwidder, James D. Myers 2005 International Symposium.
Vocabulary services in CSIRO’s Environmental Informatics I Simon Cox, Jonathan Yu 24 September 2015.
Discovering libraries’ gold through collection-level descriptions ELAG 2014, Bath Valentine Charles Data specialist.
“New Dimensions in KOS” CENDI/NKOS Workshop September 11, 2008 Washington, DC, USA An international conference to share and advance knowledge and experience.
The Application of Semantic Technologies to Scientific Archives J. Steven Hughes Daniel J. Crichton J. Steven Hughes Daniel J. Crichton Science Archives.
Margret Plank 17th International Conference on Grey Literature 1st and 2nd December 2015, Amsterdam (Netherlands) Move beyond text – How TIB manages the.
Summer of Vocabs: Knowledge Organisation Water Resources Management - Environmental Information Infrastructures Megan Williams| Vacation Scholar 29 January.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Renovation of Eurostat dissemination chain
Science Keyword Aggregator I David Benn, Nick Car, Simon Cox, Jonathon Yu 18 September 2015.
EXtended Knowledge Organization System (XKOS) Prepared by Franck Cotton, Institut National de la Statistique et des Études Économiques Daniel W. Gillman,
SKOS : A language to describe simple knowledge structures for the web
LoCloud Conference - Sharing local cultural heritage online with LoCloud services Microservices in LoCloud Walter Koch Gerda Koch
LP DAAC Overview – Land Processes Distributed Active Archive Center Chris Doescher LP DAAC Project Manager (605) Chris Torbert.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Usage of BODC parameter vocabularies
Ilya Zaslavsky Jeffrey Grethe amarnath Gupta burak Ozyurt
PREMIS Tools and Services
International Marketing and Output Database Conference 2005
Web archives as a research subject
Knowledge Sharing Mechanism in Social Networking for Learning
Presentation transcript:

The Keyword Aggregator web service A tool and methodology for managing digital objects’ keywords IINFORMATION MANAGEMENT TECHNOLOGY, LAND & WATER David Benn | Software Engineer, IMT Scientific Computing, CSIRO 1 December 2015 scikey.org

Agenda 2 | The Keyword Aggregator | David Benn What problem are we solving? Finding suitable (science) keywords for publications What have we built (to address the problem)? The Keyword Aggregator: –web service, example widget, vocabularies, related tools What remains to be done?

Collaboration 3 | The Keyword Aggregator | David Benn CSIRO Land & Water Nick Car * Simon Cox Jonathon Yu CSIRO Information Management Technology (IMT) David Benn IMT Scientific Computing eResearch projects x 2 –1 day per week for 6 months

What problem are we solving?

Publication Keywords 5 | The Keyword Aggregator | David Benn Data publication, Software publication, Journal paper, Conference paper …

Publication Keywords 6 | The Keyword Aggregator | David Benn

Publication Keywords: Controlled Vocabulary 7 | The Keyword Aggregator | David Benn

Publication Keywords: Free Entry 8 | The Keyword Aggregator | David Benn

What have we built? Keyword Aggregator

Aggregated Keyword Source 10 | The Keyword Aggregator | David Benn ?

Folksonomies 11 | The Keyword Aggregator | David Benn lowest ranking

Design Goals 12 | The Keyword Aggregator | David Benn Fast keyword search, even with many vocabs. Relevant search results Various search strategies may be needed, not just full-text search. Allow for folksonomy-style use. Web service and demo client (widget) For direct use or as a reference implementation (e.g. ZK vs jQuery). Simple management of separate vocabularies.

Keyword Aggregator 13 | The Keyword Aggregator | David Benn Web Service Vocab 1 Vocab 2 Vocab n … Folkso nomy

14 | The Keyword Aggregator | David Benn Keyword Aggregator

REST API: | The Keyword Aggregator | David Benn {"head": {"vars": ["graph_name", "term", "sum", "text_value", "prefLabel"]}, "results": {"bindings": [ [{"term": {"type": "uri", "value": " "graph_name": {"type": "uri", "value": " "vocab_subject": {"xml:lang": "en", "type": "literal", "value": "science"}, "text_value": {"xml:lang": "en", "type": "literal", "value: "EARTH SCIENCE > Agriculture > Animal Science > Animal Physiology and Biochemistry"}, "sum": {"datatype": " "type": "typed-literal", "value": "20"}, "vocab_title": {"xml:lang": "en", "type": "literal", "value": "GCMD Science Keywords V6"}, "vocab_status": {"type": "uri", "value": " "prefLabel": {"xml:lang": "en", "type": "literal", "value": "EARTH SCIENCE > Agriculture > Animal Science > Animal Physiology and Biochemistry"}}] }

Usage Stats: in relational database 16 | The Keyword Aggregator | David Benn KWAG|test| |none| KWAG|test| |none| KWAG|test| |none| KWAG|test| |none| KWAG|test| |none| KWAG|test| |none| KWAG|test| |none| KWAG|test| |none| KWAG|test| |none| KWAG|test| |none| …

What have we built? Vocabularies

SKOS: 18 | The Keyword Aggregator | David Benn

Vocabularies 19 | The Keyword Aggregator | David Benn MODSIM 2011, 2013 keyword analysis.

Vocabularies 20 | The Keyword Aggregator | David Benn GCMD: Global Change Master Directory science keywords

Vocabularies 21 | The Keyword Aggregator | David Benn Wikipedia Computer Science

Discoverable Vocabularies 22 | The Keyword Aggregator | David Benn

Vocabulary Metadata 23 | The Keyword Aggregator | David Benn

24 | The Keyword Aggregator | David Benn Vocab Metadata Generation

25 | The Keyword Aggregator | David Benn

Vocab-of-vocabs concept A SKOS ConceptScheme can point its hasTopConcept property to a Concept outside itself. Useful for broad vocabs where specialisations exist –e.g. science keywords A “vocabulary-of-vocabularies”| Nicholas Car 26 | hasTopConcept

Vocab-of-vocabs concept Allow a single, integrated, vocab set to be used by search tools No change to underlying vocabulary A “vocabulary-of-vocabularies”| Nicholas Car 27 | skos:ConceptScheme GCMD Terms skos:Concept OCEANS skos:Concept MARINE SEDIMENTS skos:Concept TURBIDITY skos:ConceptScheme Turbidity Types skos:Concept Turbidity Type 1 skos:Concept Turbidity Type 2 hasTopConcept Vocab 2 Vocab 1

Search Strategies Explored 28 | The Keyword Aggregator | David Benn Simple matching in vocab text elements Weighted semantic Assign weights to text matches in different SKOS elements –e.g. skos:prefLabel, skos:altLabel, skos:definition, dc:description Hierarchical Exploits explicit broader/narrower relationships present in some vocabs Historical/popularity based

Administration 29 | The Keyword Aggregator | David Benn

What remains to be done?

Future Work: publication, improvements 31 | The Keyword Aggregator | David Benn Software publication in CSIRO Data Access Portal (in draft) Search result “decoration” from JSON to enhance widget keyword selection. Automate ingestion of arbitrary number of known vocabularies. Streamline vocabulary submission process. Mine arbitrary/federated metadata for vocabulary existence. Inter-vocabulary individual term linking search strategy (e.g. skos:related). Use of previously chosen keywords to inform search result prioritisation. Scale search performance with large or many vocabularies. Web-time hierarchical search: pre-compute, more resources, better graph engine.

IMT Scientific Computing, CSIRO David Benn Software Engineer t e w my.csiro.au/B/D/David-Benn.aspx Thank you IMT/SCIENTIFIC COMPUTING