The mapping process – some observations Robina Clayphan EDLF.

Slides:



Advertisements
Similar presentations
Metadata Normalization (Stein) Runar Bergheim. About Metadata Normalization The best place to perform normalization is in the collection management system.
Advertisements

IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Configuration management
Configuration management
Configuration Management
Local content in a Europeana cloud Alternative methods of ingestion for small institutions (Stein) Runar Bergheim Asplan Viak Internet as LoCloud is funded.
EVOLUTION, REVOLUTION, TRANSFIGURATION (SOMETHING WONDERFUL IS ABOUT TO HAPPEN) Philip E. Schreur Stanford University Libraries Heads of Cataloging Interest.
New digital libraries and aggregations in Greece: the case of the Hellenic Aggregator Dr. Emmanouel Garoufallou Veria Central Public.
An Leabharlann UCD Órna Roche UCD James Joyce Library Metadata Documenting your data
Setting the Stage Provide a high-level overview of the accessioning and management processes Depict where/how DLESE tools are used in the processes Identify.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Metadata for Heterogeneous Digital Assets Fellow: Yong-Mi Kim Faculty Mentors: Judy Ahronheim and Lynn Johnson.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Three Years Later: Lessons Learned from Establishing a Metadata Service Marty Kurth PCC Policy Committee Meeting November 5, 2004.
Federated Searching Pre-Conference Workshop - The federated searching cookbook Qin Zhu HP Labs Research Library February 18, 2007.
OCLC Online Computer Library Center Two Paths to Interoperable Metadata Jean Godby, Devon Smith, Eric Childress DC-2003 September 29, 2003.
Working with SharePoint Document Libraries. What are document libraries? Document libraries are collections of files that you can share with team members.
Agenda Overview 2.What is SharePoint? 3.NCDOT Websites 4.Roles 5.Search 6.SharePoint Interface.
This chapter is extracted from Sommerville’s slides. Text book chapter
Software Configuration Management (SCM)
Classroom User Training June 29, 2005 Presented by:
1 BTEC HNC Systems Support Castle College 2007/8 Systems Analysis Lecture 9 Introduction to Design.
LBTO IssueTrak User’s Manual Norm Cushing version 1.3 August 8th, 2007.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
® IBM Software Group © 2009 IBM Corporation Rational Publishing Engine RQM Multi Level Report Tutorial David Rennie, IBM Rational Services A/NZ
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Metadata Standards and Applications 1. Introduction to Digital Libraries and Metadata.
Best Practices for ADL Registry Metadata Thursday, August 29, 2007 Nina Pasini Deibler Joint ADL Co-Lab.
5-7 November 2014 DR Workflow Practical Digital Content Management from Digital Libraries & Archives Perspective.
Cataloguing Electronic resources Prepared by the Cataloguing Team at Charles Sturt University.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Metadata, the CARARE Aggregation service and 3D ICONS Kate Fernie, MDR Partners, UK.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Metadata Normalisation in Europeana The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing.
WAS to Archive-It Metadata Migration March 11, 2015.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
The Active Role of Libraries in Web Based Education Patras Greece April 11th 2003.
Upgrading a legacy taxonomy system OR Resuscitating a Dinosaur Robert G. Harp & Heidi Snead (For review purposes only. Permission to copy, publish or present.
FAMILY AND CHILDREN’S TRUST FUND (FACT) RESEARCH AND DATA MATERIALS.
What is RSS? And how do I use it to make my life easier.
ARCHIVISTS’ TOOLKIT WORKSHOP March 13, 2008 Christine de Catanzaro Jody Thompson.
Implementing the Standard on digital recordkeeping.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
Lifecycle Metadata for Digital Objects September 11, 2002 Major archival and digital library metadata schemes.
Reorientation for Moodle 2 Staff Guide. File Repositories With Moodle 2’s file repository system: Duplicate files are only stored once, saving disk space.
Microsoft ® Office Excel 2003 Training Using XML in Excel SynAppSys Educational Services presents:
RDA Toolkit Demonstration. Overview Accessing the Toolkit Navigating the Toolkit Understanding the functionality of the Toolkit Searching the Toolkit.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
CSUN eCommons Submitting Learning Objects to CSUN eCommons: A Preliminary Guide February 7, 2008.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
DalSpace A content repository for Dalhousie community members.
Digitization with Millennium & CONTENTdm Stuart Hunt IUG17 Anaheim May 2009.
From Access to Archive Transforming Scholars Portal into an E-Journal Archive.
Location Guide & Text Me a Call Number integration to Primo Presented By Dhanushka Samarakoon Marjorie Devlin.
Session 6: Data Flow, Data Management, and Data Quality.
Creating Accessible PDFs. It’s Happening Here Agenda Creating Accessible PDF files –Is that PDF REALLY necessary? –Did you SCAN that? Using the OCR feature.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Physical Layer of a Repository. March 6, 2009 Agenda – What is a Repository? –What is meant by Physical Layer? –Data Source, Connection Pool, Tables and.
DYNAMIC FAQ AND DOCUMENTATION PAGES IN SHAREPOINT A modern, reusable, and easy-to-use model.
SharePoint University of the Highlands and Islands SharePoint for Records Management.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
Professional Development Programme: Design and Development of Institutional Repository Using DSpace Nipul G Shihora INFLIBNET Centre Gandhinagar
[Slide stating problem]
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Notice! This file is a ‘disabled’ file. It is not complete. Slides have been left out and other info is lacking. I have posted this file for general information.
Presentation transcript:

The mapping process – some observations Robina Clayphan EDLF

Local schemas > ESE Data Flow

Management of the process Sheer complexity of managing the hundreds of files going through the steps in the process keeping track of the status of the files – straight-forward ones – in the right place for the next step – problem ones - refer back to provider or a developer Use of Sharepoint document libraries and rapid establishment of procedures that all must adhere to The management of the process evolved during implementation - a very steep learning curve Maintenance of authority files – getting for meta-metadata from the providers (types etc) – collection IDs

(sort of) Policy issues Inclusion criterion: must have a link giving direct access to the digital object – check if URLs in data actually resolve to the object described Often: – resolve to metadata page with e.g. pdf icon – how many clicks are acceptable – need for policy decision – granularity mismatch – link at title level only Sometimes: – 404 page not found - refer to provider – persistence of URLs – need a plug in (e.g. DjVu) – is that OK? Occasionally: a log-in required for restricted access resources Need for providers to ensure they only provide links to resources that can be accessed

Data level problems 1 Trying to understand decision-making process of the original metadata creators – What they meant by e.g. dc:date, dc:source Trying to discern the (implicit) data model of the original metadata creators – What is the dc:relation referring to Understanding data in a foreign language or foreign script – Is negyedévenként really hungarian for terminally? And, if so, why is it in dc:format?

Data level problems 2 Questions to developers that arose from examining the data – All records have two instances of dc:identifier the first a URL the second (possibly) a shelfmark. Need to map each instance to a different ESE - can it be done? – All records have two instances of dc:rights the first appropriate the second not – is it possible to just display the first and ignore the second? – Where values had been divided between multiple instances of the same element – could they be concatenated with punctuation for a better display e.g spatial1, spatial2, spatial3 used for a geographic hierarchy. Another with up to 14 instances of dc:subject.

Normalisation level At the normalisation stage you can see if your interpretation of the record actually makes sense when it has been processed against the source data. Apply the Quality Control Checklist Edit mapping and repeat !

(my) Conclusion All indicates: – that it is easier if the mapping and normalising is done as close to source as possible, ideally by the providers they are the ones who understand what the data means and can make sensible mapping decisions they understand the language and script – Tools would be nice!

Local schemas > ESE Data Flow Transform data to populate local repository #0 Export data to Europeana #5 Aggregator? EuropeanaLocal Aggregator with provider?

EuropeanaLocal Content Provider Model - to illustrate movement of metadata only Aggregator EuropeanaLocal Parallel Test Environment Aggregator Europeana C o n t e n t p r o v i d e r r e p o s i t o r i e s C o n t e n t p r o v i d e r l o c a l s y s t e m s Customised transformations to e.g. OAI-DC Mapping and transformation to ESE, including elements Harvesting of e.g. OAI-DC No metadata transformations

Currently a great deal of manual effort goes into metadata transformation. – at provider sites: local format to repository format – by the Europeana development team harvested format to ESE – normalisation by Europeana development team Where will this work happen in EuropeanaLocal? – feasibility of central Europeana staff handling hundreds more collections? Can we minimise the current manual overhead? Issues for EuropeanaLocal What are the possibilities for automating all or some of the transformation work?