The OKKAM project the quest for a web of uniquely identified entities Stefano Bocconi.

Slides:



Advertisements
Similar presentations
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
Advertisements

Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
PubMed and its search options Jan Emmerich, Sonja Jacobi, Kerstin Müller (5th Semester Library Management)
NCBI/WHO PubMed/Hinari Course NCBI Literature Databases: PubMed Background.
Introduction to PubMed® (pubmed.gov)
Searching Pubmed Database استخدام قاعدة المعلومات Pubmed د. سيناء عبد المحسن العقيل قسم الصيدلة الإكلينيكية برنامج مهارات البحث العلمي.
Co-ordinated by aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT The Entity Name System (ENS): A technical infrastructure for implementing.
Provenance in Open Distributed Information Systems Syed Imran Jami PhD Candidate FAST-NU.
The OKKAM project the quest for a web of uniquely identified entities Stefano Bocconi.
Interfaces for Selecting and Understanding Collections.
The Semantic Web Week 1 Module Content + Assessment Lee McCluskey, room 2/07 Department of Computing And Mathematical Sciences Module.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
CBioC: Massive Collaborative Curation of Biomedical Literature Future Directions.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education, Inc. publishing as Prentice Hall 4-1.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education 4-1.
“Health Insurance Providers - Improving Customer Service through Access of Information & How to Take Advantage of each Platform” Alain Grijseels (INAMI-RIZIV,
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Distributed Computing COEN 317 DC2: Naming, part 1.
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE KIEV, 31 JANUARY.
An Intelligent Broker Architecture for Context-Aware Systems A PhD. Dissertation Proposal in Computer Science at the University of Maryland Baltimore County.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
4th project meeting 27-29/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA agINFRA A data infrastructure for agriculture.
Databases From A to Boyce Codd. What is a database? It depends on your point of view. For Manovich, a database is a means of structuring information in.
In The Name Of God. Jhaleh Narimisaei By Guide: Dr. Shadgar Implementation of Web Ontology and Semantic Application for Electronic Journal Citation System.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
1 The BT Digital Library A case study in intelligent content management Paul Warren
OKKAM – Enabling the Web of Entities A SCALABLE AND SUSTAINABLE SOLUTION FOR SYSTEMATIC AND GLOBAL IDENTIFIER REUSE IN DECENTRALIZED INFORMATION ENVIRONMENTS.
Bio-Medical Information Retrieval from Net By Sukhdev Singh.
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE LVIV, 11 SEPTEMBER.
Distributed Computing COEN 317 DC2: Naming, part 1.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
Component Based SW Development and Domain Engineering 1 Component Based Software Development and Domain Engineering.
Okalo Daniel Ikhena Dr. V. Z. Këpuska December 7, 2007.
STASIS Technical Innovations - Simplifying e-Business Collaboration by providing a Semantic Mapping Platform - Dr. Sven Abels - TIE -
Individualized Knowledge Access David Karger Lynn Andrea Stein Mark Ackerman Ralph Swick.
Semantic based P2P System for local e-Government Fernando Ortiz-Rodriguez 1, Raúl Palma de León 2 and Boris Villazón-Terrazas 2 1 1Universidad Tamaulipeca.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Working with Ontologies Introduction to DOGMA and related research.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Module 9 User Profiles and Social Networking. Module Overview Configuring User Profiles Implementing SharePoint 2010 Social Networking Features.
1 Ontolog OOR-BioPortal Comparative Analysis Todd Schneider 15 October 2009.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Architectural Considerations for Semantic Support Group Name: WG5 Source: Martin Bauer (NEC), Joerg Swetina (NEC) Meeting Date: Agenda Item:
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
Social Information Processing March 26-28, 2008 AAAI Spring Symposium Stanford University
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
An Overview of Data-PASS Shared Catalog
CCNT Lab of Zhejiang University
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Doron Goldfarb & Yann LE FRANC
Objectives, activities, and results of the database Lituanistika
Database Design Hacettepe University
Aggregating Online Resources: Grolier Online as an Educational Portal
SDMX IT Tools SDMX Registry
Presentation transcript:

The OKKAM project the quest for a web of uniquely identified entities Stefano Bocconi

Outline  What is OKKAM?  The case study  Open challenges  Possible connections

Overall Goal  Two years IP European Project (  “Enable the Web of Entities, a global digital space for publishing and managing information about entities, where every entity is uniquely identified, and links between entities can be explicitly specified and exploited in a variety of scenarios.”  Entity Name Server (like DNS)  Grounding of the Semantic Web

The 3 Pillars  Infrastructure Distributed, large-scale repository Matching and ranking algorithms, entity lifecycle Privacy and Security  Okkamized Content “Okkamizers” and OKKAM-empowered tools  Entity Centric Applications Authoring tools Search engine Product-centered knowledge management solution

Research areas  Identity management OKKAM is horizontal, not vertical. Integration of existing ID systems (DOI, OpenID)  Entity Identity Data-level & schema level matching Adaptation  Information Integration & Grounding of the Semantic Web  Large scale repository management Queries, ranking  Models of security, privacy and trust Some info private to third parties

Node Architecture

Entity Centric Authoring Environment  Editor (e.g. Word) with an OKKAM plug-in  Entities are recognized in documents, giving the possibility to provide additional information  Fields of application: FEBS Letters, journal of molecular biosciences, focus on proteins and their interactions ANSA, Italian news agency, focus on people, events, political parties, places, etc.

Entity Centric Authoring Environment Natural Language Processing Determining something is an entity Providing context info to query the OKKAM repository  Information integration From external sources via the OKKAM id Creation of new OKKAM ids Updating profile information  Architecture web-service based to reuse functionality

Entities  Individuals, particulars, instances Products, organizations, associations, countries, events, publications, hotels, people Fictional objects (e.g. Pegasus), from the past (e.g. Plato), abstract (e.g. the Gödel Theorem)  No universal objects, like classes or properties “forcing” the use of the same URIs for logical resources is in principle likely to fail, as people tend to have different views even about the same domain  No schema to store info (loss of generality)

Open issues about entities  ANSA case: event “Microsoft acquires Yahoo!” I need to retrieve exactly that, and compare the same news from Reuters  Is it an entity? Or a combination of entities?  Do we want to say something about acquisition as a class?  Any class is an instance at some conceptualization level (and vice versa)?

Open issues/thoughts  Can there be such a thing as a private entity?  Trust, authority, the SW never cared  Separation between entities and knowledge about entities  TF-IDF under the cap…Sweeping the problem under the rug?  No enforcement of a schema or hierarchy, BUT good P&R and distributed databases

Connections  Envisioned collaborations The Large Knowledge Collider (platform for massive distributed incomplete reasoning)  What would you need to use it? Need for particular entities to be modelled?  Can your research (potentially) contribute to OKKAM? Do you see potentials/pitfalls?

Questions? Thank you!!

Online sources  Online articles databases Science Direct PubMed is a service of the U.S. National Library of Medicine that includes over 17 million citations from MEDLINE and other life science journals for biomedical articles back to the 1950sU.S. National Library of Medicine MEDLINE source of life sciences and biomedical bibliographic information, with nearly eleven million records  Databases of proteins MINT, the Molecular INTeraction database UniProt (Universal Protein Resource) catalog of information on proteins  Controlled vocabularies EMTREE Elsevier’s Life Science Thesaurus. It is a hierarchically structured, controlled vocabulary, for Biomedicine and related Life Sciences.

Strong Points  Very clear and understandable presentation, well presented, lot of discussion  Good question answering: listen to questions, appropriate answers: good! Very good talk, stimulates discussion  Good presentation organization  Interesting presentation, well explained. Good interaction will audience. Slides about entities and issues interesting!

Weak Points  Not clear what timing/scope of the project is very ambitious project!  What about decentralized & autonomous principles of the Web?  Did not mention other systems that tag for examples Web pages based on ontologies, like GATE-based web services and tools (KIM, Melita, SHOW, Annotea..)  Introduction about Web, IDs, ontologies was too vague for people not familiar with these issues  Too much of a “sales” talk. After 15 min still no in depth problems/solutions: only arguments of use and OKKAM specific overview. I would like to know more insight in how to solve the problem since we all understand the problem very well.

Suggestions  Some info on the current status/starting date would be nice  Before “Research areas” add a figure to explain the mapping performed (one id->resource), would allow easier comparison with DNS systems.  The architecture looks to be centralized. Why not using a totally distributed one instead? There exist some P2P DNS systems you could take inspiration from.  Skip “Research areas” slide in such a short presentation. The goal is clear, focus on your solution and mention the problem from “research areas”, when they are applicable  Don’t’ go into implementation details. Focus on the high level concepts, methods and solutions. The problem and solutions are also valid outside OKKAM: talk about these.  General: Good presentation, but talk more about your work and issues instead of about OKKAM in general