EDM Martin Doerr TPDL 2011 Berlin, Germany September 25, 2011

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Andy Powell, Eduserv Foundation Feb 2007 The Dublin Core Abstract Model – a packaging standard?
1 ICS –FORTH, Oct.30-Nov.4,2006, Cyprus Documenting Events in Metadata Martin Doerr, Athina Kritsotaki Center for Cultural Informatics Institute of Computer.
Interoperability Aspects in Europeana Antoine Isaac Workshop on Research Metadata in Context 7./8. September 2010, Nijmegen.
ICS-FORTH May 23, An Ontological Approach to Digital Preservation Metadata Martin Doerr Foundation for Research and Technology - Hellas Institute.
1 CIDOC CRM + FRBR ER = FRBR OO … an equation for a harmonised view of museum information and bibliographic information Martin Doerr First CASPAR Seminar.
By Ahmet Can Babaoğlu Abdurrahman Beşinci.  Suppose you want to buy a Star wars DVD having such properties;  wide-screen ( not full-screen )  the extra.
1 Modelling Intellectual Processes: The object-orient FRBR Model Martin Doerr Center for Cultural Informatics Institute of Computer Science Foundation.
Data modeling at Europeana Antoine Isaac METS Workshop at the Digital Libraries 2014 Conference London, Sept. 11, 2014.
P12 occurred in the presence of (was present at) P11 had participant P16 used specific object P25 moved P31 has modified P92 brought into existence P33.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
EMu and Archives NA EMu Users Conference – Oct Slide 1 EMu and Archives Experiences from the Canada Science and Technology Museum Corporation.
ICS – FORTH, August 31, 2000 Why do we need an “Object Oriented Model” ? Martin Doerr Atlanta, August 31, 2000 Foundation for Research and Technology -
Idea-garden.org SOCIAL SEMANTIC INFORMATION SPACE An Interactive Learning Environment Fostering Creativity Grant agreement no: nd CIDOC CRM-SIG.
Harmonising without Harm: towards an object-oriented formulation of FRBR aligned on the CIDOC CRM ontology Maja Žumer (University of Ljubljana) & Patrick.
Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013.
The OAI-ORE based data model of Europeana and the Digital Public Library of America: implications for educational publishing Dov Winer MAKASH – Advancing.
The Europeana Data Model: Constraints and Opportunities Prof. Dr. Stefan Gradmann Based on work done with M. Doerr, S. Hennicke, A. Isaac, C. Meghini,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Metadata, the CARARE Aggregation service and 3D ICONS Kate Fernie, MDR Partners, UK.
METADATA QUALITY IN EUROPEANA , Den Haag.
P12 occurred in the presence of (was present at) P11 had participant P16 used specific object P25 moved P31 has modified P92 brought into existence P33.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Aligning library-domain metadata with the Europeana Data Model Sally CHAMBERS Valentine CHARLES ELAG 2011, Prague.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
EConnect WP1 & semantic issues VU members –Guus Schreiber, Antoine Isaac, Jacco van Ossenbruggen, Jan Wielemaker.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
EDM Europeana Data Model Guus Schreiber with input from Carlo Meghini, Antoine Isaac, Stefan Gradmann, Maxx Dekkers et al. from Europeana V1.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
CASEY A. MULLIN WITH: LALA HAJIBAYOVA SCOTT MCCAULAY DECEMBER 8, 2008 FRBR in RDF: a proof-of-concept model 1 ©2008 Casey A. Mullin.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
1 The Europeana Data Model (EDM): Object Representations, Context and Semantics Prof. Dr. Stefan Gradmann Humboldt-Universität zu Berlin / School of Library.
Introduction to multimedia
RSC Strategy Gordon Dunsire, Chair, RDA Steering Committee
The Semantic Web By: Maulik Parikh.
Harmonized EDM-CRM-FRBRoo
Modelling Intellectual Processes: The object-orient FRBR Model
Harmonized EDM-CRM-FRBRoo-CRMdig
Multiple approaches to archival description
Harmonized EDM-CRM-FRBRoo
From FRBR to FRBROO through CIDOC CRM…
Resource Description Framework
Harmonized EDM-CRM-FRBRoo
Harmonized EDM-CRM-FRBRoo
FRBRoo and performing arts
RECORDKEEPING METADATA STANDARDS: THE INTERNATIONAL CONTEXT
Telling tails: metadata standards and the digital humanities
IFLA FRBR-Library Reference Model and RDA
Outline Pursue Interoperability: Digital Libraries
RDA, linked data, and update on development
Attributes and Values Describing Entities.
Metadata for research outputs management
Metadata - Catalogues and Digitised works
NSDL Data Repository (NDR)
Harmonized EDM-CRM-FRBRoo
RDA and semantic data Gordon Dunsire
Harmonized EDM-CRM-FRBRoo
LOD reference architecture
MUMT611: Music Information Acquisition, Preservation, and Retrieval
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
RDA in a non-MARC environment
Antoine Isaac SEMIC conference
The new RDA: resource description in libraries and beyond
Modelling Intellectual Processes: The object-orient FRBR Model
Attributes and Values Describing Entities.
FRBR and FRAD as Implemented in RDA
Modelling Intellectual Processes: The object-orient FRBR Model
Future directions for RDA
Presentation transcript:

EDM Martin Doerr TPDL 2011 Berlin, Germany September 25, 2011 Center for Cultural Informatics Institute of Computer Science Foundation for Research and Technology - Hellas Berlin, Germany September 25, 2011

Europeana Data Model A data model (schema) for the next Europeana release (“Danube”) A collaborative effort of core experts A model to index rather than to document digital material A great challenge to find a minimal but powerful and extensible global model Satisfying many standards and particularly their promoters….

Rationale of EDM Precursor: ESE (Europeana Semantic Elements) by Antoine Isaac Precursor: ESE (Europeana Semantic Elements) used in 2008 version of Europeana represents lowest common denominator for object metadata convert datasets to Dublin-Core like standard forces interoperability major drawback: original metadata is lost EDM goals preserve original data while still allowing for interoperability Semantic Web representation

EDM requirements & principles by Antoine Isaac Distinction between “provided object” (painting, book, program) and digital representation Distinction between object and metadata record describing an object Allow for multiple records for same object, containing potentially contradictory statements about an object Support for objects that are composed of other objects A standard metadata format that can be specialized Standard vocabulary format that can be specialized EDM should be based on existing standards

EDM basics OAI ORE for organization of metadata about an object Requirements 1-4 Dublin Core for metadata representation Requirements 5 SKOS for vocabulary representation Requirements 6 key classes from CRM, some DC classes key relationships from FRBRoo

The class taxonomy in version 5 by Antoine Isaac rdfs:Resource NonInformation Resource Information Resource Web Resource Event Agent Place Physical Thing skos: Concept Time-span Europeana Object ore: Aggregation Information Realization Europeana Aggregation 6

Property taxonomy (without ESE properties) by Antoine Isaac dc:relation wasPresentAt happenedAt occuredAt isRelatedTo ore: proxyFor ore: proxyIn isNext InSequence isSimilarTo realizes dcterms: hasPart hasType dcterms: references hasMet is DerivativeOf is SuccessorOf incorporates ore: aggregates isAbout hasView landingPage is AnnotationOf is RepresentationOf 7

EDM Major Components A “flat” model of finding aids relationships generalizes over Dublin Core, CRM, OPM…etc. closes the recall-precision gap between keyword search and “advanced search” major innovation: “has met” relationship – a formal deduction from events - the historical links in contrast to the aboutness ! A minimal event model ensures representation of historical contexts, spatio-temporal queries and CIDOC CRM compatibility FRBR(OO) reduced to 3 relationships: derivation, continuation, incorporation Reuses the epistemic ORE Aggregation and “proxy” model to solve reification, “who said what” (could also be solved by Named Graphs!)

Europeana EDM, a new indexing standard? integrated with a minimal event model 5 core relationships to query “my thing” by other entities ens:isRelatedTo dcterms:has part Resource ens:hasType dcterms:references Agent ens:was present at EuropeanaObject/ “my” Thing ens:was present at has part Thing ens:hasMet ens:happenedAt Place ens:occurredAt TimeSpan Event ens:hasType Concept

Example: the necessity of event metadata Preserving and exploiting original data also means being compatible with descriptions beyond simple object level Also crucial for semantic enrichment by Antoine Isaac

A flexible model: object and events by Antoine Isaac

EDM, a new indexing standard? crm: shows features of “My” Information Object isDerivativeOf isSuccessorOf incorporates 3 additional core relations for information content (from FRBRoo !)

The Utitility of FRBR relations: “Blade Runner” Excerpts from Wikipedia (http://en.wikipedia.org/wiki/Blade_Runner) Blade Runner is a 1982 American science fiction film, directed by Ridley Scott and starring Harrison Ford, Rutger Hauer, and Sean Young. The screenplay, written by Hampton Fancher and David Peoples, is based on the novel Do Androids Dream of Electric Sheep? by Philip K. Dick. Seven versions of the film have been shown, for various markets, and as a result of controversial changes made by film executives. A rushed Director's cut was released in 1992 after a strong response to workprint screenings. This, in conjunction with its popularity as a video rental, made it one of the first films released on DVD, resulting in a basic disc with mediocre video and audio quality. In 2007, Warner Bros. released in select theaters and DVD/HD DVD/Blu-ray, the 25th anniversary digitally remastered definitive Final Cut by Scott. Fancher found a cinema treatment by William S. Burroughs for Alan E. Nourse's novel The Bladerunner (1974), entitled Blade Runner (a movie). Blade Runner has numerous and deep similarities to Fritz Lang's Metropolis, including a built up urban environment, in which the wealthy literally live above the workers, dominated by a huge building—the Stadtkrone Tower in Metropolis and the Tyrell Building in Blade Runner. The Blade Runner soundtrack by Vangelis is a dark melodic combination of classic composition and futuristic synthesizers which mirrors the film-noir retro-future envisioned by Ridley Scott. Despite being well received by fans and critically acclaimed and nominated in 1983 for a BAFTA and Golden Globe as best original score, and the promise of a soundtrack album from Polydor Records in the end titles of the film, the release of the official soundtrack recording was delayed for over a decade. There are two official releases of the music from Blade Runner. In light of the lack of a release of an album, the New American Orchestra recorded an orchestral adaptation in 1982 which bore little resemblance to the original. Some of the film tracks would in 1989 surface on the compilation Vangelis: Themes, but not until the 1992 release of the Director's Cut version would a substantial amount of the film's score see commercial release. These delays and poor reproductions led to the production of many bootleg recordings over the years. A bootleg tape surfaced in 1982 at science fiction conventions and became popular given the delay of an official release of the original recordings, and in 1993 "Off World Music, Ltd." created a bootleg CD that would prove more comprehensive than Vangelis' official CD in 1994. A disc from "Gongo Records" features most of the same material, but with slightly better sound quality. In 2003, two other bootlegs surfaced, the "Esper Edition," closely preceded by "Los Angeles: November 2019". The double disc "Esper Edition" combined tracks from the official release, the Gongo boot and the film itself. Finally "2019" provided a single disc compilation almost wholly consisting of ambient sound from the film, padded out with some sounds from the Westwood game Blade Runner. A set with 3 CDs of Blade Runner-related Vangelis music was released on December 10, 2007. Titled Blade Runner Trilogy, the first CD contains the same tracks as the 1994 official soundtrack release, the 2nd CD contains previously unreleased music from the movie, and the 3rd CD is all newly composed music from Vangelis, inspired by, and in the spirit of the movie

The complete picture under EDM Book: “Do Androids Dream of Electric Sheep?” P.Dick 1968 RIO Appellation Book: “The Blade Runner” A.Nourse 1974 RIO is identified by Title: “The Blade Runner” Appellation Title: “Blade Runner (The movie)” is derivative of is derivative of Book: “Blade Runner (The movie)” W.Burroughs 1979 RIO Appellation is derivative of is identified by Title: “Blade Runner” is derivative of Book: “Blade Runner screenplay” H.Fancher&D.Peoples 1982 RIO Film: “Metropolis” F.Lang 1927 RIO unreleased Film Score: “Blade Runner” Vangelis 1982 RIO is identified by is similar to incorporates Film: “Blade Runner” R.Scott 1982 RIO incorporates incorporates is derivative of is derivative of is derivative of Comic: A Marvel.. Blade Runner” A.Goodwin 1982 RIO Soundtrack: “Blade Runner” Vangelis 1982 RIO Film score: “Blade Runner” New American Orchestra 1982 RIO Reprint: “Blade Runner (Do androids..)”P.Dick RIO Film: “San Diego Sneak Preview” 1982 RIO is derivative of (is version) is similar to Film: “International Cut” 1982 RIO Videogame: “Blade Runner” CRL Group P LC 1985 DPO incorporates Film: “US Theatrical Version”1982 RIO bootleg cd: “bootleg Blade Runner recording” Off World Music Ltd 1993 DRO incorporates is successor of Videogame: “Blade Runner” Westwood Studios 1997 DPO Film: “US Broadcast Version”1986 RIO Film score: “Blade Runner” Vangelis 1994 RIO Book: “Blade Runner 2: The edge..” K.Jeter 1995 RIO Film: “Director’s Cut” 1992 RIO incorporates Film: “The Final Cut” 2007 RIO incorporates is successor of Book: “Blade Runner 3: Replicant Night”K.Jeter 1996 RIO Film score: “Blade RunnerTrilogy”Vangelis 2007 DPO Anti gia thing ebala book, film, etc. is derivative of (is translation of) is about Film: (in Portugal):”Perigo Iminente” RIO is similar to has part has part is successor of Tv series: “Total Recall 2070” Art Monterastelli 1999 RIO cd: “unreleased Blade Runner music”1982 DRO is similar to RIO Film: “(in Venezuela):”El cazador implacable” RIO cd: “new Vangelis music” Vangelis 2007 DRO Book: “Blade Runner 4: Eye and Talon” K.Jeter 2000 incorporates Documentary: “On the edge..”NoblesGate Ltd 2000 RIO is about DVD: “Blade Runner” (The Final Cut) 2007 DRO Documentary: “Future Shocks” TV Ontario 2003 RIO Documentary: “Dangerous Days” C.Lauzirika 2007 DPO Documentary: “All our Variant Futures” P.Prischman 2007 RIO

The Chaos of documenting and relating only products: Title: “…” Appellation Title: “…” Appellation The Chaos of documenting and relating only products: even without the book and sound products! Title: “…” Appellation Title: “…” Appellation Appellation Title: “Blade Runner International Cut” Book: “Blade Runner screenplay” H.Fancher&D.Peoples 1982 RIO Film: “Metropolis” F.Lang 1927 RIO has part is version Soundtrack: “Blade Runner” Vangelis 1982 RIO has part Film: “San Diego Sneak Preview” 1982 RIO Comic: A Marvel.. Blade Runner” A.Goodwin 1982 RIO is version Film: “International Cut” 1982 RIO Film: “US Theatrical Version”1982 RIO Videogame: “Blade Runner” CRL Group P LC 1985 DPO is version is version Film: “US Broadcast Version”1986 RIO is version Videogame: “Blade Runner” Westwood Studios 1997 DPO Film: “Director’s Cut” 1992 RIO Film: “The Final Cut” 2007 RIO Anti gia thing ebala book, film, etc. is version is about Film: (in Portugal):”Perigo Iminente” RIO is version Tv series: “Total Recall 2070” Art Monterastelli 1999 RIO Film: “(in Venezuela):”El cazador implacable” RIO Documentary: “On the edge..”NoblesGate Ltd 2000 RIO Documentary: “Future Shocks” TV Ontario 2003 RIO Documentary: “All our Variant Futures” P.Prischman 2007 RIO

ORE Model ore:Proxy : “description” semantics (OIO ontology!) rdfs:Resource ore:proxyFor ore:AggregatedResource ore:Proxy ore:aggregates ore:Aggregation ore:proxyIn ore:Proxy : “description” semantics (OIO ontology!) Good for archival descriptions: One thing in multiple hierarchies

Multiple aggregations = multiple providers by Antoine Isaac

Further Properties Berlin, Jan. 25-26, 2010 Europeana V1.0 WP3 Meeting Object domain is AnnotationOf range rdfs:Resource Europeana Aggregation domain landing Page range WebResource ore: Aggregation domain has View range rdfs:Resource Berlin, Jan. 25-26, 2010 Europeana V1.0 WP3 Meeting 18

Source view of Europeana EDM classes ore:Proxy dcmitype:Collection ore:Aggregation ens:EuropeanaAggregation E89 Propositional Object ens:InformationResource rdfs:Resource ens:WebResource ens:EuropeanaObject E52 Time-Span ens:Time-Span Exx Class : abstract CRM class E55 Type SKOS:Concept ens:Class : abstract EDM class E18 Physical Thing ens:Pysical Thing Exx Class : concrete CRM class ens:Class E53 Place ens:Place : concrete EDM class ens:NonInformationResource Exx Class ens:Class : concrete CRM&EDM class E39 Actor ens:Agent ore:Class : concrete ORE class E4 Period ens:Event

How to Integrate ORE? Bad model!! AggregatedResource: rdfs:Resource Bad model!! ore:proxyFor ore:AggregatedResource ore:Proxy ore:aggregates ore:Aggregation ore:proxyIn ens:NonInformationResource E89 Propositional Object ens:InformationResource AggregatedResource: Anything that is related by my aggregation, and not by a “nature” ??? OR Only AggregatedResources can be aggregated??? ens:WebResource ens:EuropeanaObject ens:EuropeanaAggregation

A better ORE Model? Aggregations and Proxies can be aggregated, rdfs:Resource ore:proxyFor ens:NonInformationResource E89 Propositional Object ens:InformationResource ore:AggregatedResource ore:aggregates ens:WebResource ore:Aggregation ore:Proxy ens:EuropeanaObject ore:proxyIn Aggregations and Proxies can be aggregated, Proxies point to anything? ens:EuropeanaAggregation

A relaxed ORE Model? AggregatedResource a term for anything? rdfs:Resource ore:proxyFor ore:AggregatedResource ens:NonInformationResource E89 Propositional Object ens:InformationResource ore:aggregates ens:WebResource ore:Aggregation ore:Proxy ens:EuropeanaObject ore:proxyIn AggregatedResource a term for anything? ens:EuropeanaAggregation

Integrated Europeana EDM Class Diagram Exx Class : abstract CRM class E4 Period ens:Event E5 Event E2 Temporal Entity ens:Class : abstract EDM class E3 Condition State Exx Class : concrete CRM class E52 Time-Span ens:Time-Span ens:Class : concrete EDM class Exx Class ens:Class E53 Place ens:Place : concrete CRM&EDM class ens:NonInformationResource E21 Person Fxx Class : concrete FRBRoo class ore:AggregatedResource E39 Actor ens:Agent E74 Group ore:Class : concrete ORE class E19Physical Object new IsA links for integration E26 Physical Feature E1 CRM Entity E18 Physical Thing ens:Pysical Thing E77 Persistent Item E72 Legal Object E24 Physical M-M Thing E55 Type SKOS:Concept E70 Thing E90 Symbolic Object F2 Expression E71 Man-Made Thing E28 Conceptual Object rdf:Resource E89 Propositional Object ens:InformationResource E73 Information Object ore:Proxy

Europeana EDM Class Diagram cont’d Exx Class : abstract CRM class ens:Class : abstract EDM class Exx Class : concrete CRM class ens:Class : concrete EDM class Exx Class ens:Class : concrete CRM&EDM class Fxx Class : concrete FRBRoo class dcmitype:Collection ore:Aggregation ens:EuropeanaAggregation ore:Proxy F23 Expression Fragment F24 Publication Expression ens:WebResource ens:EuropeanaObject F2 Expression F22 Self Contained Expression F25 Performance Plan F26 Recording

Current status EDM Definitions v5.2 EDM Primer – 05/08/10 (fitting v5.2) http://group.europeana.eu/web/europeanaproject/technicaldocuments/

Conclusions The Europeana EDM model is a great generalization over virtually all existing metadata formats It satisfies particularly poor (as most!) metadata and automatic metadata enrichment It is not suited for documentation! It is extensible: Do not manually reduce your metadata! Map your metadata formats to adequate richer standards and create automatically EDM views for global indexing and querying!