Presentation is loading. Please wait.

Presentation is loading. Please wait.

Using XSLT and PHP to Streamline Populating Proquest ETD metadata into CONTENTdm Yan Wang University of Alabama at Birmingham Libraries May 21 st, 2014.

Similar presentations


Presentation on theme: "Using XSLT and PHP to Streamline Populating Proquest ETD metadata into CONTENTdm Yan Wang University of Alabama at Birmingham Libraries May 21 st, 2014."— Presentation transcript:

1 Using XSLT and PHP to Streamline Populating Proquest ETD metadata into CONTENTdm Yan Wang University of Alabama at Birmingham Libraries May 21 st, 2014

2 UAB Digital Collections Item Types in CONTENTdm

3

4 ProQuest Submission UAB Graduate School

5 Secure Data Transferring

6 Data receiving and unzipping VBscript Data receivedData unzipped

7

8 Prepare and Parse the data files PHP

9 Updating Finding Aids in Archon

10

11 Metadata Process Flow Chart

12 Documents and stylesheets

13 Layered architectures scale well

14 XML as representation of a hierarchy of elements

15 XSL Transform (stylesheet) as tree

16 CONTENTdmProQuestOPAC (bib) Author Main Author: Title Other Author(s): DescriptionAbstractTitle: Degree AwardedClassificationPublisher: LanguageIdentifier / keywordSubjects: TypeNumber of pagesSubject/s (Local): File TypePublication yearPhysical Description of Item: UseDegree dateBibliography: Physical DescriptionSchool codeSummary: Advisor/ChairAdvisorFull Text: Committee MembersPlace of publicationHeld at: DepartmentCountry of publication SchoolISBN DateSource Subject LC (Subject Mesh)University/institution KeywordsUniversity location AvailabilityDissertation/thesis number PublisherProQuest document ID CollectionDocument URL Distributor Copyright Copyright ProQuest, UMI Dissertations Publishing 2013 Signed Approval FormDatabase ProQuest Dissertations & Theses Full Text

17 Crosswalking Metadata For UAB ETD collection ProQuest (source)CONTENTdm (output)Format/ValueNotes xmlroot DISS_submissionitem DISS_description/DISS_title Title Outputs first letter of each word in caps DISS_authorship/DISS_author/DISS_name/DISS_surname DISS_authorship/DISS_author/DISS_name/DISS_fname DISS_authorship/DISS_author/DISS_name/DISS_middle Author Last name, First name Middle Initial. DISS_content/DISS_abstract/pDescription DISS_description/DISS_dates/DISS_co mp_date DateISO 8601 (yyyy-mm-dd) Completion date from ProQuest record. Only year is set to display in CONTENTdm DISS_description/DISS_categorization/DISS_keywordkeywordsConnected by “,” DISS_content/DISS_binary fulltext-url (http://www.mhsl.uab.edu/dt/http://www.mhsl.uab.edu/dt/ 2013(r)/ McAtee_uab_0005M_11130.pdf) Filename concatenated with local location on server staging area DISS_description/DISS_degreeDegree Awardedfields/field/@name=”degree_name”/value DISS_description/DISS_institution/DISS_inst_contactDepartmentfields/field/@name=”department”/value DISS_description/DISS_advisor[1]/DISS_name/DISS_fname DISS_description/DISS_advisor[1]/DISS_name/DISS_surname DISS_description/DISS_advisor[1]/DISS_name/DISS_middle Advisor/Chairfields/field/@name=”advisor[1]”/valueFirst three advisors captured

18

19

20

21 PHP

22 Sharing Metadata from UAB Digital Collections with WorldCat

23 Preservation Strategies and Systems Methods  Refreshing (S)  Migration (S)  Technology Preservation  Emulation  Preservation Through Redundancy (L) Systems & Technologies for long-term preservation  LOCKSS  HathiTrust  DPN

24 LOCKSS – (Lots Of Copies Keep Stuff Safe) ) Decentralized and distributed preservation Give libraries local custody and control of their assets Preserve the publisher’s original authoritative version Perpetual access – guaranteed and seamless Affordable and Sustainable

25 ADPNet –The Alabama Digital Preservation Network Participating Institutions  Alabama Department of Archives and History  Auburn University  Birmingham Public Library  Huntsville-Madison County Public Library  Troy University  University of Alabama  University of Alabama at Birmingham

26 Usage of UAB LOCKSS Node

27

28 Summary Tools Used WinSCP Notepad++ WindowsCMD Oxygen Putty MarcEdit Excel Programming Vbscript XSLT PHP Regexpression HTML Systems CONTENTdm Voyager WorldCatgateway Archon LOCKSS

29 Thank You… Questions ? Phone: 205-934-6357 Email: yanwang3@uab.edu


Download ppt "Using XSLT and PHP to Streamline Populating Proquest ETD metadata into CONTENTdm Yan Wang University of Alabama at Birmingham Libraries May 21 st, 2014."

Similar presentations


Ads by Google