Download presentation
Presentation is loading. Please wait.
Published byGerard Stevenson Modified over 9 years ago
1
Merritt Repository Depositing Content and Providing Access University of California Curation Center Team California Digital Library July 28, 2011 UC3 Summer Webinar Series
2
Merritt summary Curation repository – Supporting long-term preservation and access – Publish, share, preserve, discover, (re-)use “Model free” – There are no prescriptive requirements for content genre, format, structure, or accompanying metadata No service fee (for UC affiliates) – Contributors are billed only for storage, $1.04/GB/year Cost of a physical book in offsite storage$4.62/year Cost of a digital book in HathiTrust$0.15/year Cost of a digital book in Merritt$0.06/year Cost of a dataset in Merritt$1.00/year Cost of a physical book in offsite storage$4.62/year Cost of a digital book in HathiTrust$0.15/year Cost of a digital book in Merritt$0.06/year Cost of a dataset in Merritt$1.00/year For more information, review the June 9 webinar http://www.cdlib.org/uc3/uc3webinars.html For more information, review the June 9 webinar http://www.cdlib.org/uc3/uc3webinars.html
3
Master recipe Registration (one time) [contributor → UC3, uc3@ucop.edu ] uc3@ucop.edu Submission [contributor → Merritt] Ingest [Merritt] Notification [Merritt → contributor] Discovery/delivery [consumer → Merritt → consumer]
4
Registration Contact Perry Willett, Merritt service manager uc3@ucop.edu uc3@ucop.edu
5
Submission User interface METS feeder API manual deposits existing DPR workflows automated deposits
6
UI submission The submission package is always a single file An opportunity to supply descriptive metadata
7
UI submission The submission package is always a single file, which may be: – For a single object The complete object A multi-file object in a container (zip, gzip, tar.gz) A multi-file object defined by a manifest – For a batch of objects A manifest referring to single file objects A manifest referring to objects in containers A manifest referring to objects defined by manifests
8
Manifest A “packing slip” for an object, providing URLs for all object’s file components – Object manifest Algorithm = a dler32, crc32, md2, md5, sha1, sha256, sha384, sha256 See User’s Guide and online help for more information http://merritt.cdlib.org/http://merritt.cdlib.org/ fileURL | hashAlgorithm | hashValue | fileSize | fileName | mimeType... fileURL | hashAlgorithm | hashValue | fileSize | fileName | mimeType... #%checkm_0.7 #%profile| http://uc3.cdlib.org/registry/ingest/manifest/mrt-ingest-manifest #%prefix | mrt: | http://merritt.cdlib.org/terms# #%prefix | nfo: | http://www.semanticdesktop.org/ontologies/2007/03/22/nfo# #%fields | nfo:fileUrl | nfo:hashAlgorithm | nfo:hashValue | nfo:fileSize | nfo:fileLastModified | nfo:fileName | mrt:mimeType http://merritt.cdlib.org/samples/call911.jpg | md5 | 47d321056e60944a06973... http://merritt.cdlib.org/samples/call911.txt | md5 | 77fe42b1055bbabe51648... #%eof #%checkm_0.7 #%profile| http://uc3.cdlib.org/registry/ingest/manifest/mrt-ingest-manifest #%prefix | mrt: | http://merritt.cdlib.org/terms# #%prefix | nfo: | http://www.semanticdesktop.org/ontologies/2007/03/22/nfo# #%fields | nfo:fileUrl | nfo:hashAlgorithm | nfo:hashValue | nfo:fileSize | nfo:fileLastModified | nfo:fileName | mrt:mimeType http://merritt.cdlib.org/samples/call911.jpg | md5 | 47d321056e60944a06973... http://merritt.cdlib.org/samples/call911.txt | md5 | 77fe42b1055bbabe51648... #%eof
9
Manifest A “packing slip” for a batch, providing URLs for all object’s file components – Batch manifest Batch of single file objects Batch of container objects Batch of manifest objects An Excel macro is available for automatically generating manifests from spreadsheets http://merritt.cdlib.org/docs/merrittManifest.xls http://merritt.cdlib.org/docs/merrittManifest.xls See User’s Guide and online help for more information http://merritt.cdlib.org/http://merritt.cdlib.org/ fileURL | hashAlgorithm | hashValue | fileSize | fileName | primaryID | localID | creator | title | date... fileURL | hashAlgorithm | hashValue | fileSize | fileName | primaryID | localID | creator | title | date...
10
Metadata Submission form Batch manifest Object component: mrt-erc.txt erc: who: Blaine, Tegan Woodward what: Continuous measurements of atmospheric argon/nitrogen... when: 2005 where: ark:/20775/bb21509964 erc: who: Blaine, Tegan Woodward what: Continuous measurements of atmospheric argon/nitrogen... when: 2005 where: ark:/20775/bb21509964 Dublin KernelDublin CoreElement whocreatorResponsible person or party whattitleContent description whendateLifecycle-meaningful date whereidentifierLocally-meaningful identifier http://dublincore.org/groups/kernel/spec/
11
METS feeder METS must conform to a profile documented in the CDL Guidelines for Digital Objects http://www.cdlib.org/services/dsc/contribute/docs/GDO.pdf – METS, all referenced file components, and manifest must be web accessible – The Merritt IP address can be provided for configuring firewall rules Feeder manifest Submission http://url/path/mets.xml... http://url/path/mets.xml... http://feeder.cdlib.org/?userID=id&authCode=passwd& accessGroupID=collection&manifestURL=manifest http://feeder.cdlib.org/?userID=id&authCode=passwd& accessGroupID=collection&manifestURL=manifest
12
API submission FieldValue filenameoptionalFile name filerequiredFile contents typeoptional File type: file batch-manifest container container-batch-manifest object-manifest single-file-batch-manifest profilerequiredProfile (supplied by UC3) primaryIdentifieroptionalPrimary identifier (ARK) localIdentifieroptionalLocal identifier digestTypeoptional Message digest type: adler-32 sha-1 crc-32 sha-256 md2 sha-384 md5 sha-512
13
API submission FieldoptionalValue digestValueoptionalMessage digest value (hexadecimal encoded) creatoroptionalCreator titleoptionalTitle dateoptionalDate noteoptionalDescriptive note responseFormoptionalResponse form: anvl json xhtml xml
14
API submission POST /object/ingest HTTP/1.1 Host: merritt.cdlib.org Content-type: multipart/form-data; boundary=boundary --boundary Content-disposition: form-data; name=“file”; filename=“filename” file --boundary Content-disposition: form-data; name=“type” type --boundary Content-disposition: forma-data; name=“profile” profile --boundary... POST /object/ingest HTTP/1.1 Host: merritt.cdlib.org Content-type: multipart/form-data; boundary=boundary --boundary Content-disposition: form-data; name=“file”; filename=“filename” file --boundary Content-disposition: form-data; name=“type” type --boundary Content-disposition: forma-data; name=“profile” profile --boundary...
15
API submission cURL http://curl.haxx.se/ % curl –s –u user password –F “file=@manifest” -F “type=manifest-type” -F “profile=profile” -F “localIdentifier=identifier” -F “creator=creator” -F title=title” http://merritt.cdlib.org/object/ingest % curl –s –u user password –F “file=@manifest” -F “type=manifest-type” -F “profile=profile” -F “localIdentifier=identifier” -F “creator=creator” -F title=title” http://merritt.cdlib.org/object/ingest
16
Ingest Primary identifier – ARK (required; auto-generated by if not supplied) – DOI (can be optionally requested from ) Validation Characterization SIP → AIP ISO 1472, Open Archival Information System (OAIS)
17
Notification You will receive two email separate notifications – Initial notification that we have received your submission, and that it is queued for subsequent processing – Final notification that we have fully processed your submission UC3’s preservation commitment starts at the time of final notification
18
Initial notification From: UC3 Merritt Support [mailto:uc3@ucop.edu] Sent: Thursday, July 14, 2011 3:28 PM To: Stephen Abrams Subject: Completion of submission Completion of submission - Notification - Submission ID: bid-4ed4bf45-aa78-4da7-bb65- 63b125d88150 - Job(s): Number of pending job(s): 1 Number of completed job(s): 0 Number of failed job(s): 0 - User agent: slabrams - Submission date: 2011-07-14T15:27:41-07:00 - Status: QUEUED From: UC3 Merritt Support [mailto:uc3@ucop.edu] Sent: Thursday, July 14, 2011 3:28 PM To: Stephen Abrams Subject: Completion of submission Completion of submission - Notification - Submission ID: bid-4ed4bf45-aa78-4da7-bb65- 63b125d88150 - Job(s): Number of pending job(s): 1 Number of completed job(s): 0 Number of failed job(s): 0 - User agent: slabrams - Submission date: 2011-07-14T15:27:41-07:00 - Status: QUEUED Completion of submission - Notification Report - Submission ID: bid-4ed4bf45-aa78-4da7-bb65-63b125d88150 - Job(s): - Job ID: jid-3498bef6-e296-429d-b652-da1f35f8bc04 - Primary ID: ark:/20775/bb21509964 - Local ID: http://libraries.ucsd.edu/ark:/20775/bb21509964;b4946677;umi-ucsd-1040 - Filename: manifest2.txt - Object title: Continuous measurements of atmospheric argon/nitrogen as a tracer of air-sea heat flux : models, methods, and data - Object creator: Blaine, Tegan Woodward - Object date: 2005 - Status: PENDING - User agent: slabrams - Submission date: 2011-07-14T15:27:41-07:00 - Status: QUEUED Completion of submission - Notification Report - Submission ID: bid-4ed4bf45-aa78-4da7-bb65-63b125d88150 - Job(s): - Job ID: jid-3498bef6-e296-429d-b652-da1f35f8bc04 - Primary ID: ark:/20775/bb21509964 - Local ID: http://libraries.ucsd.edu/ark:/20775/bb21509964;b4946677;umi-ucsd-1040 - Filename: manifest2.txt - Object title: Continuous measurements of atmospheric argon/nitrogen as a tracer of air-sea heat flux : models, methods, and data - Object creator: Blaine, Tegan Woodward - Object date: 2005 - Status: PENDING - User agent: slabrams - Submission date: 2011-07-14T15:27:41-07:00 - Status: QUEUED With attachment, bid-4ed4bf45-aa78-4da7-bb65-63b125d88150.txt
19
Final notification From: UC3 Merritt Support [mailto:uc3@ucop.edu] Sent: Thursday, July 14, 2011 3:28 PM To: Stephen Abrams Subject: Completion of ingest Notification Summary - Submission ID: bid-4ed4bf45-aa78-4da7-bb65-63b125d88150 - Job(s): Number of pending job(s): 0 Number of completed job(s): 1 Number of failed job(s): 0 - User agent: slabrams - Queue Priority: 06 - Submission date: 2011-07-14T15:27:41-07:00 - Completion date: 2011-07-14T15:27:53-07:00 - Status: COMPLETED From: UC3 Merritt Support [mailto:uc3@ucop.edu] Sent: Thursday, July 14, 2011 3:28 PM To: Stephen Abrams Subject: Completion of ingest Notification Summary - Submission ID: bid-4ed4bf45-aa78-4da7-bb65-63b125d88150 - Job(s): Number of pending job(s): 0 Number of completed job(s): 1 Number of failed job(s): 0 - User agent: slabrams - Queue Priority: 06 - Submission date: 2011-07-14T15:27:41-07:00 - Completion date: 2011-07-14T15:27:53-07:00 - Status: COMPLETED With attachment, bid-4ed4bf45-aa78-4da7-bb65-63b125d88150.txt Completion of ingest - Notification Report - Submission ID: bid-4ed4bf45-aa78-4da7-bb65-63b125d88150 - Job(s): - Job ID: jid-3498bef6-e296-429d-b652-da1f35f8bc04 - Primary ID: ark:/99999/fk4vm4kg6 - Local ID: ark:/20775/bb21509964 - Version: 3 - Filename: manifest2.txt - Object title: Continuous measurements of atmospheric argon/nitrogen as a tracer of air-sea heat flux : models, methods, and data - Object creator: Blaine, Tegan Woodward - Object date: 2005 - Object state: http://store-stage.cdlib.org:35121/state/2111/ark%3A%2F99999%2Ffk4vm4kg6?t=xhtml - Submission date: 2011-07-14T15:27:46-07:00 - Completion date: 2011-07-14T15:27:53-07:00 - Status: COMPLETED - User agent: slabrams - Queue Priority: 06 - Submission date: 2011-07-14T15:27:41-07:00 - Completion date: 2011-07-14T15:27:53-07:00 - Status: COMPLETED Completion of ingest - Notification Report - Submission ID: bid-4ed4bf45-aa78-4da7-bb65-63b125d88150 - Job(s): - Job ID: jid-3498bef6-e296-429d-b652-da1f35f8bc04 - Primary ID: ark:/99999/fk4vm4kg6 - Local ID: ark:/20775/bb21509964 - Version: 3 - Filename: manifest2.txt - Object title: Continuous measurements of atmospheric argon/nitrogen as a tracer of air-sea heat flux : models, methods, and data - Object creator: Blaine, Tegan Woodward - Object date: 2005 - Object state: http://store-stage.cdlib.org:35121/state/2111/ark%3A%2F99999%2Ffk4vm4kg6?t=xhtml - Submission date: 2011-07-14T15:27:46-07:00 - Completion date: 2011-07-14T15:27:53-07:00 - Status: COMPLETED - User agent: slabrams - Queue Priority: 06 - Submission date: 2011-07-14T15:27:41-07:00 - Completion date: 2011-07-14T15:27:53-07:00 - Status: COMPLETED
20
Discovery/delivery Search
21
Discovery/delivery Search
22
Discovery/delivery Search
23
Discovery/delivery Browse
24
Discovery/delivery Browse
25
Coming soon … Enhanced characterization – JHOVE2 http://jhove2.org/ Faceted search/browse – XTF (the technology behind ) http://xtf.cdlib.org/ Investigation of CMS/DAMS-like function through integration with … – Islandora/Drupal (in cooperation with UCLA) – Alfresco (in cooperation with UCB) – Omeka (in cooperation with UCSC)
26
Questions?
27
Upcoming webinars Date/timeTopic Thursday, August 11 2:00 pm EZID: Create and Manage Persistent Identifiers Joan Starr, UC3/CDL Thursday, August 25 2:00 pm DCXL (Data Curation Excel) Carly Strasser, UC3/CDL Thursday, Sept. 22 2:00 pm Data Management Planning Tool Patricia Cruse/Tracy Seneca, UC3/CDL http://www.cdlib.org/uc3/uc3webinars.html
28
For more information UC Curation Center http://www.cdlib.org/uc3 http://www.cdlib.org/uc3/contact.html uc3@ucop.edu Stephen AbramsDavid Loy Lisa ColvinMark Reyes Patricia Cruse Abhishek Salve Scott Fisher Tracy Seneca Erik Hetzner Joan Starr Greg Janée Carly Strasser John Kunze Marisa Strong Margaret LowPerry Willett UC3 webinar series http://www.cdlib.org/uc3/uc3webinars.html Merritt repository http://merritt.cdlib.org/ http://merritt.cdlib.org/help http://merritt.cdlib.org/docs/merritt_handout.pdf http://merritt.cdlib.org/docs/merritt_user_guide.pdf
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.