Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Thesaurus Management and User-Friendliness: a contradiction? Helmut Nagy Semantic Web Company
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Enrichment of Library Authority Files by Linked Open Data Sources
Sharing Human Rights Terminologies: towards an online Thesauri Builder Boris Panhoelzl ECCHRD-meeting, 22 October 2010.
PoolParty Vasiljevic Vladica,
Twarql Tapping Into the Wisdom of the Crowd Pablo N. Mendes, Pavan Kapanipathi, Alexandre Passant I-SEMANTICS Graz, Austria September 2 nd, 2010.
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
Stefania Bergamasco, Cecilia Colasanti An integrated approach to turn statistics into knowledge combining data warehouse, controlled vocabularies and advanced.
WWW Challenges : Supporting Users in Search and Navigation Natasa Milic-Frayling Microsoft Research, Cambridge UK SOFSEM 2004 January 28, 2004.
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
Information and Business Work
Standards for networked knowledge organisation systems Ron Davies European Library Automation Group Bucharest, April 2006.
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
Compass Semantic search
New “Collaborate” Button Integrate UI directly into the browser. Possible Targets: IE (via SpiceIE) & Firefox (via standard extensions & NPAPI plugins.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
A Registry for controlled vocabularies at the Library of Congress
Knowledge Portals and Knowledge Management Tools
Sunday May 4 – 5 PM Bradford, Hlava, McNaughton
Redefining Perspectives A thought leadership forum for technologists interested in defining a new future June COPYRIGHT ©2015 SAPIENT CORPORATION.
New “Collaborate” Button Integrate UI directly into the browser. Preferred target: Firefox Easiest browser to extend in terms of UI.
Thesaurusmanagement Quickstart Introduction. What are controlled vocabularies? organized arrangement of words and phrases used to index content and/or.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Classroom User Training June 29, 2005 Presented by:
Information Extraction with Linked Life Data 19/04/2011.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
Florian Kondert Helmut Nagy Semantic Web Company October 23 – 27, 2011 Linked Data Based Enterprise Data Integration.
© Copyright 2012 STI INNSBRUCK
AthenaPlus: WP4 Eva Coudyzer Koninklijke Musea voor Kunst en Geschiedenis Europeana Overlegplatform, 7 juni 2013.
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
1 Knowledge Portals and Knowledge Management Tools Chapter 13.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Bells, Whistles, and Alarms: HCI Lessons Using AJAX for a Page-turning Web Application Juliet L. Hardesty, Indiana University.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
Selecting Taxonomy Software Who, Why, How Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
The Semantic Logger: Supporting Service Building from Personal Context Mischa M Tuffield et al. Intelligence, Agents, Multimedia Group University of Southampton.
EConnect WP1 & semantic issues VU members –Guus Schreiber, Antoine Isaac, Jacco van Ossenbruggen, Jan Wielemaker.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
1 © Xchanging 2010 no part of this document may be circulated, quoted or reproduced without prior written approval of Xchanging. MOSS Training – UI customization.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Collection Management Systems
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
STAR, STELLAR and SKOS Ceri Binding, Phil Carlisle, Keith May, Doug Tudhope, Andreas Vlachidis University of Glamorgan and English Heritage.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
LoCloud Conference - Sharing local cultural heritage online with LoCloud services Microservices in LoCloud Walter Koch Gerda Koch
June 30, 2005 Public Web Site Search Project Update: 6/30/2005 Linda Busdiecker & Andy Nguyen Department of Information Technology.
Linked Data Theatre Slide deck. The challenge Linked Data We love our Linked Data! Turtle representation But it doesn’t look good.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Presentation by Giorgos Theodoridis. WordPress is a free web software you can use to create a beautiful website, blog, or app, (CMS) based on PHP and.
Search can be Your Best Friend You just Need to Know How to Talk to it IW 306 Ágnes Molnár.
What is a Blog? short for Weblog journal on a website
EBSCO eBooks.
Presented at Archives Records 2016, session 510
Semantic Database Builder
Web Engineering.
Overview & Applications Welcome!
The Re3gistry software and the INSPIRE Registry
PREMIS Tools and Services
NewCronos what policy and architecture contents consultation evolution
LOD reference architecture
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Web archives as a research subject
Metadata supported full-text search in a web archive
PolyAnalyst™ text mining tool Allstate Insurance example
Presentation transcript:

Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl

PoolParty at a glance Developed by punkt. netServices Current release: PoolParty 2.8 Main focus on three application areas: –SKOS Thesaurus Management –Linked Data (publishing & consuming) –Semantic Search & Semantic Indexing 2

Challenge for Content Management 3 1.Annotation: Add meaning to the content 2.Link content: Bring content together in a meaningful way 3.Make content searchable: Add background knowledge to the content

Traditional approach to annotate content with metadata 4 Apple is in the process of launching an application to allow iPhone, iPad and iPod Touch users to purchase Apple merchandise straight from their devices. Apple application merchandise iPod touch iPad iPhone

Semantic Web approach: Concepts & Relations instead of simple text 5 Apple is in the process of launching an application to allow iPhone, iPad and iPod Touch users to purchase Apple merchandise straight from their devices. Apple Apple Inc. iPhone iPhone 3GS iPhone 3G

in a nutshell W3C Semantic Web standards: Management of multi-lingual (corporate) thesauri & taxonomies on top of Semantic Web standards (SKOS, RDF, OWL & SPARQL) Usability: easy-to-use, web-based AJAX user interface Scalable Semantic Technologies: RDF Triple Store (SAIL), (Lucene) index engine and a phrase-extraction component Service oriented: PoolParty Server offers a Java-API & several interfaces: HTTP web services, SPARQL endpoint, Linked Data 6

PoolParty GUI 7

Full compatibility with SKOS/RDF 8

Some highlights: PoolParty thesaurus management Drag & drop, Auto-Complete Document analysis: phrase extraction Enrich concepts by using linked data Publish thesauri as linked data Advanced reporting functionality Import and validation of thesauri and CSV files Thesauris quality checker Wiki style collaborative editing of thesauri Visual browsing and map navigation 9

Built-in automatic phrase extraction 10 Supports different formats (html, doc, pdf, ppt, …) Thesaurus based extraction Integrable with CMS, CRM etc. Supports different formats (html, doc, pdf, ppt, …) Thesaurus based extraction Integrable with CMS, CRM etc.

Some Applications on top of PoolParty Tag recommendation: support users and content managers when annotating text Semantic Indexing: PoolParty TagEvent Store as a basis for a semantic index (  IndexBuilder) Similarity search: „Similarity“ is configurable: Certain features of a document can be „boosted“ (example: persons, places / user tags etc.) Semantic Search and Navigation: Thesaurus can be used for facetted and moderated search (examples: emteba.at, ecoi.net) Search Engine Dictionaries: provide company or domain specific terms for search engine dictionary 11

Similarity search: finding the unexpected… 12 Expert #4532 Senior Product Manager Enterprise Wiki at MitchelLake Consulting in Sydney Area ……… Project #AZ67 Integration of Confluence which is a web-based corporate wiki. It is developed and marketed by Atlassian, Australia. ….. same topic near location

PoolParty DemoZone compare thesaurus based approach with traditional approach tag recommender similar documents find images which fit to your document browser bookmarklet 13

Wordpress Glossary Plugin 14 automatic generation of glossaries for Wordpress blogs SKOS compatibility automatic link detection and linkage with glossary term

Programmatic access via Web Services getProposedTagsForDocument addTaggingEvent getTagFrequencies addDocumentToSimilarityIndex findSimilarDocuments getConceptSuggestions ….. 15

Programmatic access – Example: emteba.at 16

PoolParty Linked Data Features in Detail

SKOS Thesauri + Linked Data 18

Linked Data – Benefits & Application Scenarios 19 Thesaurus Management Automatic population of thesauri (Semi) Automatic categorization of new concepts End User Content augmentation Improved recommender services Improved navigation elements, e.g. in web- shops Content Provider Improved SEO Reduced costs of content management New services and mashups

Publishing Linked Data with PoolParty 20 using linked data patterns and „Cool URIs“ Linked Data front- end Additionally: Wiki front-end SPARQL-endpoint

Linked Data frontend 21

Consuming Linked Data 22 advanced linked data look-up services expandable number of linked data sources already integrated linked data synchronisation mechanisms (beta)

Linked Data Screencast Here comes a screencast 23

Using SKOS context to link concepts to LD resources and semi-automatic population of thesaurus Example: Thesaurus about arts and artists Concept „Painters“ with NT: Kandinsky, Rembrandt and Berners-Lee Using broader and sibling concepts to help disambiguate and suggest the painter Berners-Lee Finding mutual categories from Dbpedia or Freebase Suggesting more NTs for Painters using LD categories 24

PoolParty Semantic Search

More background knowledge from thesauri and linked data can improve semantic search better disambiguation of search terms background knowledge of search terms help to „expand queries“ better similarity search because of more metadata content augmentation through linked data 26

Semantic Services provided by PoolParty 27 Search assistants (Auto-Complete, faceted search) Improve user´s search experience Moderated Search Creating complex queries Tag Recommendation Identifying the meaning of a document Similarity Search (Recommender Systems) Understanding relations

Search Assistants 28 clever auto- complete query expansion faceted search visual search Google synonyms

Moderated Search 29 thesaurus helps to create complex queries supports multi- linguality helps to explore a domain without deep knowledge

Tag Recommendation 30 annotation of documents with low effort motivation for people to annotate documents basis for building a semantic index

Similarity Search 31 improved similarity detection on top of additional background knowledge build recommender systems for web-shops or knowledge management systems help people to skim large document collections detect hidden relations between documents

Integration of thesauri with Enterprise Search 32 PoolParty Reporting Export parts of thesauri into individual XML- formats and synchronize with search engine Possible integrations with enterprise search engine: Autocomplete-Server Entity dictionary Query rewriting Moderated search Enrich semantic index PoolParty Web- Services Integrate thesauri into search engine with real-time queries improved semantic enterprise search all metadata can be administrated at one single place expandable via linked data mechanisms

PoolParty Thesaurus Management Advanced Features

Multilinguality 34

Concept mapping skos:exactMatch skos:closeMatch  used for linked data mapping  used for concept mapping, e.g. after having imported a thesaurus 35

Associating notes with concepts 36 skos:historyNote skos:changeNote skos:editorialNote  used to trace meanings of a concept  used to discuss meanings of a concept

Introduce individual relations between concepts 37 Create your own individual inverse or symmetric relations between concepts

Import / export / reporting 38 import & export of SKOS using various RDF serializations import of CSV import of Zthes import/export of sub- trees custom reports and XML exports based on PoolParty´s template engine

Quality checks and validation service 39 Check thesauri to…. be complete be non-cyclic (e.g. no circularity in the broader/narrower hierarchy). have no disjoints between related and hierarchical paths.

Visual browsing 40

Use your favourite theme! 41

Contact Apply for a PoolParty demo account Thomas Schandl punkt. netServices GmbH Lerchenfelder Guertel 43 A—1160 Wien / Austria