Download presentation
Presentation is loading. Please wait.
Published byIngeborg Rønningen Modified over 6 years ago
1
eSciDoc –Object and content modelling experiences
Natasa Bulatovic, MPDL
2
eSciDoc Development teams
FIZ Team: Fachinformationszentrum Karlsruhe MPDL Team: Max Planck Digital Library, Munich Service Management (SvM) User interface engineering (GUI) Software Developement (DEV)
3
Max-Planck Society: Subject domains
Biology and Medicine Section Developmental and Evolutionary Biology/Genetics Immunobiology and Infection Biology/Medicine Cognition Research Microbiology/Ecology Neurosciences Plant Research Structural and Cell Biology Chemistry, Physics and Technology Section Astronomy/Astrophysics Chemistry Solid State Research/Material Sciences Earth Sciences and Climate Research High Energy and Plasma Physics/Quantum Optics Computer Science/Mathematics/Complex Systems Humanities Section Cultural Studies Jurisprudence Social and Behavioral Sciences 20000 potential users (Scientists, Librarians, Research groups, Project groups) in the 80 research institutes of the Max Planck Society
4
eSciDoc initial focus Main usage scenarios
Publication Data (Institutional repository, metadata management, quality assurance workflows) Research Data (Digital collections of images, multimedia content, cooperative authoring and annotating) Common functions that have to be supported Generalized and unified data model for all content resources Content modeling of resources as a specialization instrument Versioning PID User management, authentication and authorization Service orientation means: Reusability — regardless of whether immediate reuse opportunities exist, services are designed to support potential reuse. Formalized contracts— For services to interact, they need not share anything but a formal contract that describes each service and defines the terms of information exchange. Loose coupling - services are loosely coupled— Services must be designed to interact without the need for tight, cross-service dependencies. Abstraction - of logic-the only part of a service that is visible to the outside world is what is exposed via the service contract. Underlying logic, beyond what is expressed in the descriptions that comprise the contract, is invisible and irrelevant to service requestors. Composability- services may compose other services. This allows logic to be represented at different levels of granularity and promotes reusability and the creation of abstraction layers. Autonomous - the logic governed by a service resides within an explicit boundary. The service has control within this boundary and is not dependent on other services for it to execute its governance. Statelessness - services should not be required to manage state information, as that can impede their ability to remain loosely coupled. Services should be designed to maximize statelessness even if that means deferring state management elsewhere. Discoverability - services should allow their descriptions to be discovered and understood by humans and service requestors that may be able to make use of their logic
5
eSciDoc core services Context handler Item handler Container handler
“We need high level of abstraction and some common functions …” Context handler Item handler Container handler Organizational units handler Role handler Content model handler Semantic store handler .. Data-centric (CRUD), logic-centric (versioning, release, withdraw, etc.).
6
eSciDoc intermediate services
“.. but we also need some more added functionality …” Duplicate detection Image handling Metadata handler Validation of data Retrieval/download statistics Workflow management Functionality adding services, Technology gateways, adapters, façades
7
eSciDoc application services
“… we still need to create additional services…” Depositing Publishing Quality assurance Citation manager Export manager SearchAndOutput Controlled vocabularies Process-centric or simply public services shared with partner instutitions
8
Towards specialization
“.. and in addition end user interfaces to manage different content …” Publication items (at least one author must exist) Face images (metadata describe object on the image) Digitized Book (law books from Holly Roman Empire) Language resources (description of languages, features, other resources) “… different metadata schema and controlled vocabularies…” metadata profiles management Metadata schema/ context / content model specific validation rules “… with different workflows …” Publication management workflow (“depositing” service enriches core workflow)
9
eSciDoc solutions Publication management Image collections
Scanned books
10
eSciDoc project landscape
11
Solution development: towards data specialization
Services work with generic object patterns such as Items, Containers, Metadata Solutions work with specific content models such as Publication Items, Albums, Law Books Solutions require different metadata sets such as Publication metadata, Face image metadata, Law Book and Page metadata
12
eSciDoc generic data model
13
Publication data model
14
Publication item (styled XML)
Please use to check on latest demo data available
15
Publication Item (View)
16
Faces data model Faces Album Faces Item
17
Faces Album and Item (XML)
Faces Item: NOTE: Links may not be functional all time, as data are for DEMO purposes Please use to check on latest demo data available
18
Faces Album (View)
19
Faces Item (View)
20
VIRR data model CM1: Multivolume CM2: Volume CM1: Scanned Page
CM2: Table of Contents
21
Virr multvolume, volume (XML)
Multivolume: Volumes: (has Toc Item) NOTE: Links may not be functional all time, as these data are for development purposes Check to check if demo data are available or for latest version of VIRR Solution Scanned page: TOC:
22
VIRR Multivolume, volume (View)
23
Virr volume (View)
24
Virr Page (View)
25
eSciDoc services and solutions: What next?
Extending existing solutions and services Setting up productive environment (performance, service authorization, data ingestion) More data diversity Building more services and solutions (component reusability, tools, enhance SOA) Improve data interoperability, data dissemination, data repurposing Community-based open source development
26
eSciDoc - Content model definition
See also: For more details
27
eSciDoc project resources
eSciDoc project web pages, Infrastructure download MPDL collaboratory network MPDL software download The project webpage is a useful information resource for developers or any other interested people. The project webpage contains useful information on the build environment, source code access, javadocs, code analysis etc.
28
Thank you!
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.