12/03/2013 1 Second International Workshop on New Generation Enterprise and Business Innovation NGEBIS 2013 Cross Domain Crawling for Innovation Pieruigi.

Slides:



Advertisements
Similar presentations
Towards Data Mining Without Information on Knowledge Structure
Advertisements

ISDSI 2009 Francesco Guerra– Università di Modena e Reggio Emilia 1 DB unimo Searching for data and services F. Guerra 1, A. Maurino 2, M. Palmonari.
Slide 1 Insert your own content. Slide 2 Insert your own content.
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
1 Web Search Environments Web Crawling Metadata using RDF and Dublin Core Dave Beckett Slides:
1 Southhampton, 1/03 1 Part 4: Mindswap tools Maryland Information and Network Dynamics Laboratory Semantic Web Agents Project
Maritime Knowledge Base Semantic Application Semantic Exchange Workshop February 17th, 2009 Eric Freese Semantic Web, XML & Geospatial Technologist Copyright.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
HERMES TUTORIAL version 1.0 Published 24th July 2007 This tutorial version is based on the actual deployed version of Hermes, as of the date of publication.
0 - 0.
1 Cognitive sociolinguistics Richard Hudson Budapest March 2012.
4-th IEEE International Conference on Advanced Learning Technologies, Joensuu, Finland, August 30 – September 1, th IEEE International Conference.
Yammer Technical Solutions Overview
Draft Change Management Strategy Framework and Toolkit An Overview TAU Workshop: Vulindlela Academy (DBSA) 12 April 2012 Presenter: Dr Patrick Sokhela.
Ontology-based User Modeling for Web-based Information Systems Anton Andrejko, Michal Barla and Mária Bieliková {andrejko, barla,
26/10/2008 SWESE'08 1 Enhanced Semantic Access to Software Artefacts Danica Damljanović and Kalina Bontcheva.
TU/e eindhoven university of technology PACIS'03 July Engineering Semantic Web Information Systems Richard Vdovjak Flavius Frasincar Geert-Jan Houben.
TU/e technische universiteit eindhoven Hera: Development of Semantic Web Information Systems Geert-Jan Houben Peter Barna Flavius Frasincar Richard Vdovjak.
OSLC Resource Shape: A Linked Data Constraint Language Arthur Ryman & Achille Fokoue, IBM W3C RDF Validation Workshop, Cambridge,
©NGEBIS WORKSHOP 2013Short Paper 9 - AIDIMAF. GiganteJune Analysis of Information Resources for the Furniture Industry in BIVEE Fernando Gigante,
12/03/ Second International Workshop on New Generation Enterprise and Business Innovation NGEBIS 2013 Linked Data Based Approach to Similarity Reasoning.
Project Overview Slide 2 of 15 Overview Project in a Nutshell ◦Motivation ◦Aims and Objectives ◦Expected Outcomes PlanetData Programs Join PlanetData.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
AIM Operational Concept
1 ISWC-2003 Sanibel Island, FL IMG, University of Manchester Jeff Z. Pan 1 and Ian Horrocks 1,2 {pan | 1 Information Management.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Bridges Policy Manuals for RSS Understanding and Navigating the SharePoint Manuals Updated: 10/22/2013.
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. The Future is Now JeromeDL A Digital Library on Social Semantic.
]po[ Docu Wiki.  ]project-opem[ 2008, Rollout Methodology / Frank Bergmann / 2 Types of Readers  Beginners – These users have just started using ]po[.
Korean Place Name Information Service on the Web 2.0 Environment
A framework for Linked Data business models Michalis Vafopoulos vafopoulos.org 1/10/2011.
How creating a course on the e-lastic platform 1.
Co-funded by the European Union Semantic CMS Community Content Management From free text input to automatic entity enrichment Copyright IKS Consortium.
A platform of for knowledge and services sharing Fernando Ferri IRPPS-CNR.
® Microsoft Office 2010 Browser and Basics.
An Adaptive System for User Information needs based on the observed meta- Knowledge AKERELE Olubunmi Doctorate student, University of Ibadan, Ibadan, Nigeria;
KEOD 2013 – 20 th September 2013 A Comprehensive Framework for Semantic Annotation of Web Content Manuel Fiorelli 1, Maria Teresa Pazienza 2, Armando Stellato.
Stefania Bergamasco, Cecilia Colasanti An integrated approach to turn statistics into knowledge combining data warehouse, controlled vocabularies and advanced.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Making Courseware Reusable Institute for Program Structures and Data Organization Universität Karlsruhe Germany Khaldoun Ateyeh, Jutta Mülle
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Welcome to the Minnesota SharePoint User Group. Introductions / Overview Project Tracking / Management / Collaboration via SharePoint Multiple Audiences.
David Chen IMS-LAPS University Bordeaux 1, France
University of Sheffield, NLP Entity Linking Kalina Bontcheva © The University of Sheffield, This work is licensed under the Creative Commons.
© Copyright 2008 STI INNSBRUCK Media Meets Semantic Web – How the BBC Uses DBpedia and Linked Data to Make Connections.
Ontology-Based Information Extraction: Current Approaches.
12/03/ Second International Workshop on New Generation Enterprise and Business Innovation NGEBIS 2013 Semantic UBL-like documents for innovation.
Mining fuzzy domain ontology based on concept Vector from wikipedia category network.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
Linked Data Profiling Andrejs Abele National University of Ireland, Galway Supervisor: Paul Buitelaar.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Copyright All right reserved 1 i - LIKE Linked Data enrichment for an e-learning system Networked interactions to create, learn and share knowledge.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
International Workshop 28 Jan – 2 Feb 2011 Phoenix, AZ, USA Ontology in Model-Based Systems Engineering Henson Graves 29 January 2011.
Research on Knowledge Element Relation and Knowledge Service for Agricultural Literature Resource Xie nengfu; Sun wei and Zhang xuefu 3rd April 2017.
Giuseppina Inserra INFN Catania
Big Data Quality the next semantic challenge
ESS roadmap on Linked Open Data State of play
Big Data Quality the next semantic challenge
Web archives as a research subject
ViCoS Visualising Conceptual Spaces
Linked Data Ryan McAlister.
Big Data Quality the next semantic challenge
Presentation transcript:

12/03/ Second International Workshop on New Generation Enterprise and Business Innovation NGEBIS 2013 Cross Domain Crawling for Innovation Pieruigi Assogna, Francesco Taglino CNR-IASI (Italy)

12/03/ Outline Motivations & Objectives Methodological approach Technological approach Conclusions

12/03/ Motivations and Objectives In any kind of organization, creativity and innovation come from people Tools aiming at supporting creativity need to be based on the most accredited theories related to how people use their knowledge to act on the environment, adapt to new situations, invent. The method proposed here aims at providing knowledge “raw material”, capable of triggering out-of-the-box ideas

12/03/ Constructivism According to Constructivism a person’s culture is an integrated network of concepts and models This guides the person’s activity, and is consolidated, enriched, modified by each new experience Apart from pathological situations (schizophrenia) each person’s structure is anyway connected

12/03/ New Paths The connections between concepts create paths that, with time, our mind travels more or less automatically In new situations we have to “take the lead” and try new paths, possibly linking different and distant clusters This is for instance what is favored by “lateral thinking” methods

12/03/ Knowledge Base In general a domain Knowledge Base (KB) is a tool for maintaining and enriching its users’ focused knowledge In particular the KB’s ontology mimics their focused conceptual structure When the users are confronted by new issues, a search on the KB or on the Net (on the base of the domain ontology) typically keeps them within this focused ground

12/03/ The Methodology We propose a way to extend a focused knowledge domain to support diversions from usual thinking paths We use the domain ontology to search the Net for documents that address key topics of the domain together with topics belonging to different ones These documents have good probability of containing considerations, theories, metaphors that link the person’s knowledge clusters with “exotic” ones, able to trigger ideas out-of-the- box

12/03/ Semantics-based cross-domains crawling

12/03/ Documental Resources Space where we search for interesting documents websites (e.g., MIT website on innovations), RSS feeds, and public documents repositories (e.g., BBC news) In our example we focus on Robotics and Machine Vision (R&MV) domain

12/03/ Linked Data A set of principles to allow Standard description of data (RDF-based) Standard way of accessing data (HTTP) Linking resources/data among them Linking Open Data as a project for publishing datasets (e.g., Dbpedia) in a Linked Data fashion

12/03/ The Linking Open Data cloud DBpedia

12/03/ Reference ontology and bridge to the LOD cloud Within the BIVEE project we have built a glossary of 600 concepts on R&MV We enriched such concepts with DBpedia entries (owl:sameAs) Photodiodes R&MV reference ontology DBpedia Photodiode owl:sameAs Camera owl:sameAs

12/03/ Terms extraction from analyzed document Extracted terms/concepts are representative and somehow synthesize the document’s content We analyzed different tools for extracting knowledge from documents Zemanta, Alchemy, OpenCalais, FISE AlchemyAPI: extract concepts from a text relevance value link to DBpedia and other LOD dataset

12/03/ Semantic Filter over a doc Two steps Identify the extracted concepts related to our domain of interest Identify good candidate and discarding not interesting documents

12/03/ Semantic Filter over a doc: step 1 Identify the extracted concepts related to our domain of interest (e.g., R&MV) Given an extracted concept ec, it exists at least one reference concept rc, such that Extracted Concept (ec) (r 1 = ref. to Dbpedia entry) Reference Ontology Concept (rc) (r 2 = ref. to Dbpedia entry) (r 1 dc:subject) r AND (r 2 dc:subject r) where r is a resources r 1 = r 2 OR

12/03/ Semantic Filter over a doc: step 2 Let be S1 the set of extracted concepts related to our domain Let be S2 the set of extracted concepts NOT related to our domain A document is a good candidate if (a) t1<Sum(relVal(S1))<t2 AND t 1 =0.1, t 2 =0.4 (b) Sum(relVal(S2))>t3t 3 =0.4 (a) ensures that the analyzed document deals with our reference domain, but in a small manner, (b) second constraint ensures that the analyzed document deals with other topics in a considerable measure.

12/03/ Filtering: example 1 Extracted Concepts and Relevance The document is about extracting energy from insects SUGGESTED AS INTERESTING

12/03/ Filtering: example 2 Extracted Concepts and Relevance The document is about supporting shoppers get the right fit when buying clothes online SUGGESTED AS INTERESTING

12/03/ Filtering: example 3 Extracted Concepts and Relevance The document does not consider Robotics and Machine Vision at all NOT INTERESTING document

12/03/ Filtering: example 4 Extracted Concepts and Relevance The document is too much Robotics oriented, so it can be surely useful for experts in the Robotics field, but it does not appear inspiring for lateral thinking NOT INTERESTING document

12/03/ Conclusions and Outlook Very preliminary work on supporting lateral thinking activities More experimentation Using the LOD cloud as much as possible

12/03/ Questions & Answers