Presentation is loading. Please wait.

Presentation is loading. Please wait.

EDM Martin Doerr TPDL 2011 Berlin, Germany September 25, 2011

Similar presentations


Presentation on theme: "EDM Martin Doerr TPDL 2011 Berlin, Germany September 25, 2011"— Presentation transcript:

1 EDM Martin Doerr TPDL 2011 Berlin, Germany September 25, 2011
Center for Cultural Informatics Institute of Computer Science Foundation for Research and Technology - Hellas Berlin, Germany September 25, 2011

2 Europeana Data Model A data model (schema) for the next Europeana release (“Danube”) A collaborative effort of core experts A model to index rather than to document digital material A great challenge to find a minimal but powerful and extensible global model Satisfying many standards and particularly their promoters….

3 Rationale of EDM Precursor: ESE (Europeana Semantic Elements)
by Antoine Isaac Precursor: ESE (Europeana Semantic Elements) used in 2008 version of Europeana represents lowest common denominator for object metadata convert datasets to Dublin-Core like standard forces interoperability major drawback: original metadata is lost EDM goals preserve original data while still allowing for interoperability Semantic Web representation

4 EDM requirements & principles
by Antoine Isaac Distinction between “provided object” (painting, book, program) and digital representation Distinction between object and metadata record describing an object Allow for multiple records for same object, containing potentially contradictory statements about an object Support for objects that are composed of other objects A standard metadata format that can be specialized Standard vocabulary format that can be specialized EDM should be based on existing standards

5 EDM basics OAI ORE for organization of metadata about an object
Requirements 1-4 Dublin Core for metadata representation Requirements 5 SKOS for vocabulary representation Requirements 6 key classes from CRM, some DC classes key relationships from FRBRoo

6 The class taxonomy in version 5
by Antoine Isaac rdfs:Resource NonInformation Resource Information Resource Web Resource Event Agent Place Physical Thing skos: Concept Time-span Europeana Object ore: Aggregation Information Realization Europeana Aggregation 6

7 Property taxonomy (without ESE properties)
by Antoine Isaac dc:relation wasPresentAt happenedAt occuredAt isRelatedTo ore: proxyFor ore: proxyIn isNext InSequence isSimilarTo realizes dcterms: hasPart hasType dcterms: references hasMet is DerivativeOf is SuccessorOf incorporates ore: aggregates isAbout hasView landingPage is AnnotationOf is RepresentationOf 7

8 EDM Major Components A “flat” model of finding aids relationships generalizes over Dublin Core, CRM, OPM…etc. closes the recall-precision gap between keyword search and “advanced search” major innovation: “has met” relationship – a formal deduction from events - the historical links in contrast to the aboutness ! A minimal event model ensures representation of historical contexts, spatio-temporal queries and CIDOC CRM compatibility FRBR(OO) reduced to 3 relationships: derivation, continuation, incorporation Reuses the epistemic ORE Aggregation and “proxy” model to solve reification, “who said what” (could also be solved by Named Graphs!)

9 Europeana EDM, a new indexing standard?
integrated with a minimal event model 5 core relationships to query “my thing” by other entities ens:isRelatedTo dcterms:has part Resource ens:hasType dcterms:references Agent ens:was present at EuropeanaObject/ “my” Thing ens:was present at has part Thing ens:hasMet ens:happenedAt Place ens:occurredAt TimeSpan Event ens:hasType Concept

10 Example: the necessity of event metadata
Preserving and exploiting original data also means being compatible with descriptions beyond simple object level Also crucial for semantic enrichment by Antoine Isaac

11 A flexible model: object and events
by Antoine Isaac

12 EDM, a new indexing standard?
crm: shows features of “My” Information Object isDerivativeOf isSuccessorOf incorporates 3 additional core relations for information content (from FRBRoo !)

13 The Utitility of FRBR relations: “Blade Runner”
Excerpts from Wikipedia ( Blade Runner is a 1982 American science fiction film, directed by Ridley Scott and starring Harrison Ford, Rutger Hauer, and Sean Young. The screenplay, written by Hampton Fancher and David Peoples, is based on the novel Do Androids Dream of Electric Sheep? by Philip K. Dick. Seven versions of the film have been shown, for various markets, and as a result of controversial changes made by film executives. A rushed Director's cut was released in 1992 after a strong response to workprint screenings. This, in conjunction with its popularity as a video rental, made it one of the first films released on DVD, resulting in a basic disc with mediocre video and audio quality. In 2007, Warner Bros. released in select theaters and DVD/HD DVD/Blu-ray, the 25th anniversary digitally remastered definitive Final Cut by Scott. Fancher found a cinema treatment by William S. Burroughs for Alan E. Nourse's novel The Bladerunner (1974), entitled Blade Runner (a movie). Blade Runner has numerous and deep similarities to Fritz Lang's Metropolis, including a built up urban environment, in which the wealthy literally live above the workers, dominated by a huge building—the Stadtkrone Tower in Metropolis and the Tyrell Building in Blade Runner. The Blade Runner soundtrack by Vangelis is a dark melodic combination of classic composition and futuristic synthesizers which mirrors the film-noir retro-future envisioned by Ridley Scott. Despite being well received by fans and critically acclaimed and nominated in 1983 for a BAFTA and Golden Globe as best original score, and the promise of a soundtrack album from Polydor Records in the end titles of the film, the release of the official soundtrack recording was delayed for over a decade. There are two official releases of the music from Blade Runner. In light of the lack of a release of an album, the New American Orchestra recorded an orchestral adaptation in 1982 which bore little resemblance to the original. Some of the film tracks would in 1989 surface on the compilation Vangelis: Themes, but not until the 1992 release of the Director's Cut version would a substantial amount of the film's score see commercial release. These delays and poor reproductions led to the production of many bootleg recordings over the years. A bootleg tape surfaced in 1982 at science fiction conventions and became popular given the delay of an official release of the original recordings, and in 1993 "Off World Music, Ltd." created a bootleg CD that would prove more comprehensive than Vangelis' official CD in A disc from "Gongo Records" features most of the same material, but with slightly better sound quality. In 2003, two other bootlegs surfaced, the "Esper Edition," closely preceded by "Los Angeles: November 2019". The double disc "Esper Edition" combined tracks from the official release, the Gongo boot and the film itself. Finally "2019" provided a single disc compilation almost wholly consisting of ambient sound from the film, padded out with some sounds from the Westwood game Blade Runner. A set with 3 CDs of Blade Runner-related Vangelis music was released on December 10, Titled Blade Runner Trilogy, the first CD contains the same tracks as the 1994 official soundtrack release, the 2nd CD contains previously unreleased music from the movie, and the 3rd CD is all newly composed music from Vangelis, inspired by, and in the spirit of the movie

14 The complete picture under EDM
Book: “Do Androids Dream of Electric Sheep?” P.Dick 1968 RIO Appellation Book: “The Blade Runner” A.Nourse 1974 RIO is identified by Title: “The Blade Runner” Appellation Title: “Blade Runner (The movie)” is derivative of is derivative of Book: “Blade Runner (The movie)” W.Burroughs 1979 RIO Appellation is derivative of is identified by Title: “Blade Runner” is derivative of Book: “Blade Runner screenplay” H.Fancher&D.Peoples 1982 RIO Film: “Metropolis” F.Lang 1927 RIO unreleased Film Score: “Blade Runner” Vangelis 1982 RIO is identified by is similar to incorporates Film: “Blade Runner” R.Scott 1982 RIO incorporates incorporates is derivative of is derivative of is derivative of Comic: A Marvel.. Blade Runner” A.Goodwin 1982 RIO Soundtrack: “Blade Runner” Vangelis 1982 RIO Film score: “Blade Runner” New American Orchestra 1982 RIO Reprint: “Blade Runner (Do androids..)”P.Dick RIO Film: “San Diego Sneak Preview” 1982 RIO is derivative of (is version) is similar to Film: “International Cut” 1982 RIO Videogame: “Blade Runner” CRL Group P LC 1985 DPO incorporates Film: “US Theatrical Version”1982 RIO bootleg cd: “bootleg Blade Runner recording” Off World Music Ltd 1993 DRO incorporates is successor of Videogame: “Blade Runner” Westwood Studios 1997 DPO Film: “US Broadcast Version”1986 RIO Film score: “Blade Runner” Vangelis 1994 RIO Book: “Blade Runner 2: The edge..” K.Jeter 1995 RIO Film: “Director’s Cut” 1992 RIO incorporates Film: “The Final Cut” 2007 RIO incorporates is successor of Book: “Blade Runner 3: Replicant Night”K.Jeter 1996 RIO Film score: “Blade RunnerTrilogy”Vangelis 2007 DPO Anti gia thing ebala book, film, etc. is derivative of (is translation of) is about Film: (in Portugal):”Perigo Iminente” RIO is similar to has part has part is successor of Tv series: “Total Recall 2070” Art Monterastelli 1999 RIO cd: “unreleased Blade Runner music”1982 DRO is similar to RIO Film: “(in Venezuela):”El cazador implacable” RIO cd: “new Vangelis music” Vangelis 2007 DRO Book: “Blade Runner 4: Eye and Talon” K.Jeter 2000 incorporates Documentary: “On the edge..”NoblesGate Ltd 2000 RIO is about DVD: “Blade Runner” (The Final Cut) 2007 DRO Documentary: “Future Shocks” TV Ontario 2003 RIO Documentary: “Dangerous Days” C.Lauzirika 2007 DPO Documentary: “All our Variant Futures” P.Prischman 2007 RIO

15 The Chaos of documenting and relating only products:
Title: “…” Appellation Title: “…” Appellation The Chaos of documenting and relating only products: even without the book and sound products! Title: “…” Appellation Title: “…” Appellation Appellation Title: “Blade Runner International Cut” Book: “Blade Runner screenplay” H.Fancher&D.Peoples 1982 RIO Film: “Metropolis” F.Lang 1927 RIO has part is version Soundtrack: “Blade Runner” Vangelis 1982 RIO has part Film: “San Diego Sneak Preview” 1982 RIO Comic: A Marvel.. Blade Runner” A.Goodwin 1982 RIO is version Film: “International Cut” 1982 RIO Film: “US Theatrical Version”1982 RIO Videogame: “Blade Runner” CRL Group P LC 1985 DPO is version is version Film: “US Broadcast Version”1986 RIO is version Videogame: “Blade Runner” Westwood Studios 1997 DPO Film: “Director’s Cut” 1992 RIO Film: “The Final Cut” 2007 RIO Anti gia thing ebala book, film, etc. is version is about Film: (in Portugal):”Perigo Iminente” RIO is version Tv series: “Total Recall 2070” Art Monterastelli 1999 RIO Film: “(in Venezuela):”El cazador implacable” RIO Documentary: “On the edge..”NoblesGate Ltd 2000 RIO Documentary: “Future Shocks” TV Ontario 2003 RIO Documentary: “All our Variant Futures” P.Prischman 2007 RIO

16 ORE Model ore:Proxy : “description” semantics (OIO ontology!)
rdfs:Resource ore:proxyFor ore:AggregatedResource ore:Proxy ore:aggregates ore:Aggregation ore:proxyIn ore:Proxy : “description” semantics (OIO ontology!) Good for archival descriptions: One thing in multiple hierarchies

17 Multiple aggregations = multiple providers
by Antoine Isaac

18 Further Properties Berlin, Jan. 25-26, 2010 Europeana V1.0 WP3 Meeting
Object domain is AnnotationOf range rdfs:Resource Europeana Aggregation domain landing Page range WebResource ore: Aggregation domain has View range rdfs:Resource Berlin, Jan , 2010 Europeana V1.0 WP3 Meeting 18

19 Source view of Europeana EDM classes
ore:Proxy dcmitype:Collection ore:Aggregation ens:EuropeanaAggregation E89 Propositional Object ens:InformationResource rdfs:Resource ens:WebResource ens:EuropeanaObject E52 Time-Span ens:Time-Span Exx Class : abstract CRM class E55 Type SKOS:Concept ens:Class : abstract EDM class E18 Physical Thing ens:Pysical Thing Exx Class : concrete CRM class ens:Class E53 Place ens:Place : concrete EDM class ens:NonInformationResource Exx Class ens:Class : concrete CRM&EDM class E39 Actor ens:Agent ore:Class : concrete ORE class E4 Period ens:Event

20 How to Integrate ORE? Bad model!! AggregatedResource:
rdfs:Resource Bad model!! ore:proxyFor ore:AggregatedResource ore:Proxy ore:aggregates ore:Aggregation ore:proxyIn ens:NonInformationResource E89 Propositional Object ens:InformationResource AggregatedResource: Anything that is related by my aggregation, and not by a “nature” ??? OR Only AggregatedResources can be aggregated??? ens:WebResource ens:EuropeanaObject ens:EuropeanaAggregation

21 A better ORE Model? Aggregations and Proxies can be aggregated,
rdfs:Resource ore:proxyFor ens:NonInformationResource E89 Propositional Object ens:InformationResource ore:AggregatedResource ore:aggregates ens:WebResource ore:Aggregation ore:Proxy ens:EuropeanaObject ore:proxyIn Aggregations and Proxies can be aggregated, Proxies point to anything? ens:EuropeanaAggregation

22 A relaxed ORE Model? AggregatedResource a term for anything?
rdfs:Resource ore:proxyFor ore:AggregatedResource ens:NonInformationResource E89 Propositional Object ens:InformationResource ore:aggregates ens:WebResource ore:Aggregation ore:Proxy ens:EuropeanaObject ore:proxyIn AggregatedResource a term for anything? ens:EuropeanaAggregation

23 Integrated Europeana EDM Class Diagram
Exx Class : abstract CRM class E4 Period ens:Event E5 Event E2 Temporal Entity ens:Class : abstract EDM class E3 Condition State Exx Class : concrete CRM class E52 Time-Span ens:Time-Span ens:Class : concrete EDM class Exx Class ens:Class E53 Place ens:Place : concrete CRM&EDM class ens:NonInformationResource E21 Person Fxx Class : concrete FRBRoo class ore:AggregatedResource E39 Actor ens:Agent E74 Group ore:Class : concrete ORE class E19Physical Object new IsA links for integration E26 Physical Feature E1 CRM Entity E18 Physical Thing ens:Pysical Thing E77 Persistent Item E72 Legal Object E24 Physical M-M Thing E55 Type SKOS:Concept E70 Thing E90 Symbolic Object F2 Expression E71 Man-Made Thing E28 Conceptual Object rdf:Resource E89 Propositional Object ens:InformationResource E73 Information Object ore:Proxy

24 Europeana EDM Class Diagram cont’d
Exx Class : abstract CRM class ens:Class : abstract EDM class Exx Class : concrete CRM class ens:Class : concrete EDM class Exx Class ens:Class : concrete CRM&EDM class Fxx Class : concrete FRBRoo class dcmitype:Collection ore:Aggregation ens:EuropeanaAggregation ore:Proxy F23 Expression Fragment F24 Publication Expression ens:WebResource ens:EuropeanaObject F2 Expression F22 Self Contained Expression F25 Performance Plan F26 Recording

25 Current status EDM Definitions v5.2
EDM Primer – 05/08/10 (fitting v5.2)

26 Conclusions The Europeana EDM model is a great generalization over virtually all existing metadata formats It satisfies particularly poor (as most!) metadata and automatic metadata enrichment It is not suited for documentation! It is extensible: Do not manually reduce your metadata! Map your metadata formats to adequate richer standards and create automatically EDM views for global indexing and querying!


Download ppt "EDM Martin Doerr TPDL 2011 Berlin, Germany September 25, 2011"

Similar presentations


Ads by Google