Download presentation
Presentation is loading. Please wait.
Published byByron Caldwell Modified over 8 years ago
1
Using XSLT and PHP to Streamline Populating Proquest ETD metadata into CONTENTdm Yan Wang University of Alabama at Birmingham Libraries May 21 st, 2014
2
UAB Digital Collections Item Types in CONTENTdm
4
ProQuest Submission UAB Graduate School
5
Secure Data Transferring
6
Data receiving and unzipping VBscript Data receivedData unzipped
8
Prepare and Parse the data files PHP
9
Updating Finding Aids in Archon
11
Metadata Process Flow Chart
12
Documents and stylesheets
13
Layered architectures scale well
14
XML as representation of a hierarchy of elements
15
XSL Transform (stylesheet) as tree
16
CONTENTdmProQuestOPAC (bib) Author Main Author: Title Other Author(s): DescriptionAbstractTitle: Degree AwardedClassificationPublisher: LanguageIdentifier / keywordSubjects: TypeNumber of pagesSubject/s (Local): File TypePublication yearPhysical Description of Item: UseDegree dateBibliography: Physical DescriptionSchool codeSummary: Advisor/ChairAdvisorFull Text: Committee MembersPlace of publicationHeld at: DepartmentCountry of publication SchoolISBN DateSource Subject LC (Subject Mesh)University/institution KeywordsUniversity location AvailabilityDissertation/thesis number PublisherProQuest document ID CollectionDocument URL Distributor Copyright Copyright ProQuest, UMI Dissertations Publishing 2013 Signed Approval FormDatabase ProQuest Dissertations & Theses Full Text
17
Crosswalking Metadata For UAB ETD collection ProQuest (source)CONTENTdm (output)Format/ValueNotes xmlroot DISS_submissionitem DISS_description/DISS_title Title Outputs first letter of each word in caps DISS_authorship/DISS_author/DISS_name/DISS_surname DISS_authorship/DISS_author/DISS_name/DISS_fname DISS_authorship/DISS_author/DISS_name/DISS_middle Author Last name, First name Middle Initial. DISS_content/DISS_abstract/pDescription DISS_description/DISS_dates/DISS_co mp_date DateISO 8601 (yyyy-mm-dd) Completion date from ProQuest record. Only year is set to display in CONTENTdm DISS_description/DISS_categorization/DISS_keywordkeywordsConnected by “,” DISS_content/DISS_binary fulltext-url (http://www.mhsl.uab.edu/dt/http://www.mhsl.uab.edu/dt/ 2013(r)/ McAtee_uab_0005M_11130.pdf) Filename concatenated with local location on server staging area DISS_description/DISS_degreeDegree Awardedfields/field/@name=”degree_name”/value DISS_description/DISS_institution/DISS_inst_contactDepartmentfields/field/@name=”department”/value DISS_description/DISS_advisor[1]/DISS_name/DISS_fname DISS_description/DISS_advisor[1]/DISS_name/DISS_surname DISS_description/DISS_advisor[1]/DISS_name/DISS_middle Advisor/Chairfields/field/@name=”advisor[1]”/valueFirst three advisors captured
21
PHP
22
Sharing Metadata from UAB Digital Collections with WorldCat
23
Preservation Strategies and Systems Methods Refreshing (S) Migration (S) Technology Preservation Emulation Preservation Through Redundancy (L) Systems & Technologies for long-term preservation LOCKSS HathiTrust DPN
24
LOCKSS – (Lots Of Copies Keep Stuff Safe) ) Decentralized and distributed preservation Give libraries local custody and control of their assets Preserve the publisher’s original authoritative version Perpetual access – guaranteed and seamless Affordable and Sustainable
25
ADPNet –The Alabama Digital Preservation Network Participating Institutions Alabama Department of Archives and History Auburn University Birmingham Public Library Huntsville-Madison County Public Library Troy University University of Alabama University of Alabama at Birmingham
26
Usage of UAB LOCKSS Node
28
Summary Tools Used WinSCP Notepad++ WindowsCMD Oxygen Putty MarcEdit Excel Programming Vbscript XSLT PHP Regexpression HTML Systems CONTENTdm Voyager WorldCatgateway Archon LOCKSS
29
Thank You… Questions ? Phone: 205-934-6357 Email: yanwang3@uab.edu
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.