Presentation is loading. Please wait.

Presentation is loading. Please wait.

The OKKAM project the quest for a web of uniquely identified entities Stefano Bocconi.

Similar presentations


Presentation on theme: "The OKKAM project the quest for a web of uniquely identified entities Stefano Bocconi."— Presentation transcript:

1 The OKKAM project the quest for a web of uniquely identified entities Stefano Bocconi

2 Outline  What is OKKAM?  The case study  Open challenges  Possible connections

3 Overall Goal  Two years IP European Project (http://fp7.okkam.org/)  “Enable the Web of Entities, a global digital space for publishing and managing information about entities, where every entity is uniquely identified, and links between entities can be explicitly specified and exploited in a variety of scenarios.”  Entity Name Server (like DNS)  Grounding of the Semantic Web

4 The 3 Pillars  Infrastructure Distributed, large-scale repository Matching and ranking algorithms, entity lifecycle Privacy and Security  Okkamized Content “Okkamizers” and OKKAM-empowered tools  Entity Centric Applications Authoring tools Search engine Product-centered knowledge management solution

5 Research areas  Identity management OKKAM is horizontal, not vertical. Integration of existing ID systems (DOI, OpenID)  Entity Identity Data-level & schema level matching Adaptation  Information Integration & Grounding of the Semantic Web  Large scale repository management Queries, ranking  Models of security, privacy and trust Some info private to third parties

6 Node Architecture

7 Entity Centric Authoring Environment  Editor (e.g. Word) with an OKKAM plug-in  Entities are recognized in documents, giving the possibility to provide additional information  Fields of application: FEBS Letters, journal of molecular biosciences, focus on proteins and their interactions ANSA, Italian news agency, focus on people, events, political parties, places, etc.

8 Entity Centric Authoring Environment Natural Language Processing Determining something is an entity Providing context info to query the OKKAM repository  Information integration From external sources via the OKKAM id Creation of new OKKAM ids Updating profile information  Architecture web-service based to reuse functionality

9 Entities  Individuals, particulars, instances Products, organizations, associations, countries, events, publications, hotels, people Fictional objects (e.g. Pegasus), from the past (e.g. Plato), abstract (e.g. the Gödel Theorem)  No universal objects, like classes or properties “forcing” the use of the same URIs for logical resources is in principle likely to fail, as people tend to have different views even about the same domain  No schema to store info (loss of generality)

10 Open issues about entities  ANSA case: event “Microsoft acquires Yahoo!” I need to retrieve exactly that, and compare the same news from Reuters  Is it an entity? Or a combination of entities?  Do we want to say something about acquisition as a class?  Any class is an instance at some conceptualization level (and vice versa)?

11 Open issues/thoughts  Can there be such a thing as a private entity?  Trust, authority, the SW never cared  Separation between entities and knowledge about entities  TF-IDF under the cap…Sweeping the problem under the rug?  No enforcement of a schema or hierarchy, BUT good P&R and distributed databases

12 Connections  Envisioned collaborations The Large Knowledge Collider (platform for massive distributed incomplete reasoning)  What would you need to use it? Need for particular entities to be modelled?  Can your research (potentially) contribute to OKKAM? Do you see potentials/pitfalls?

13 Questions? Thank you!!

14 Online sources  Online articles databases Science Direct PubMed is a service of the U.S. National Library of Medicine that includes over 17 million citations from MEDLINE and other life science journals for biomedical articles back to the 1950sU.S. National Library of Medicine MEDLINE source of life sciences and biomedical bibliographic information, with nearly eleven million records  Databases of proteins MINT, the Molecular INTeraction database UniProt (Universal Protein Resource) catalog of information on proteins  Controlled vocabularies EMTREE Elsevier’s Life Science Thesaurus. It is a hierarchically structured, controlled vocabulary, for Biomedicine and related Life Sciences.

15 Strong Points  Very clear and understandable presentation, well presented, lot of discussion  Good question answering: listen to questions, appropriate answers: good! Very good talk, stimulates discussion  Good presentation organization  Interesting presentation, well explained. Good interaction will audience. Slides about entities and issues interesting!

16 Weak Points  Not clear what timing/scope of the project is very ambitious project!  What about decentralized & autonomous principles of the Web?  Did not mention other systems that tag for examples Web pages based on ontologies, like GATE-based web services and tools (KIM, Melita, SHOW, Annotea..)  Introduction about Web, IDs, ontologies was too vague for people not familiar with these issues  Too much of a “sales” talk. After 15 min still no in depth problems/solutions: only arguments of use and OKKAM specific overview. I would like to know more insight in how to solve the problem since we all understand the problem very well.

17 Suggestions  Some info on the current status/starting date would be nice  Before “Research areas” add a figure to explain the mapping performed (one id->resource), would allow easier comparison with DNS systems.  The architecture looks to be centralized. Why not using a totally distributed one instead? There exist some P2P DNS systems you could take inspiration from.  Skip “Research areas” slide in such a short presentation. The goal is clear, focus on your solution and mention the problem from “research areas”, when they are applicable  Don’t’ go into implementation details. Focus on the high level concepts, methods and solutions. The problem and solutions are also valid outside OKKAM: talk about these.  General: Good presentation, but talk more about your work and issues instead of about OKKAM in general


Download ppt "The OKKAM project the quest for a web of uniquely identified entities Stefano Bocconi."

Similar presentations


Ads by Google