The PeDALS approach.  Pete Watters Arizona State Library, project coordinator  Richard Pearce-Moses Clayton State University, Georgia,

Slides:



Advertisements
Similar presentations
TIPS FOR MANAGING YOUR INFORMATION:
Advertisements

A centre of expertise in data curation and preservation DCC Workshop: Curating sApril 24 – 25, 2006 Funded by: This work is licensed under the Creative.
Microsoft ® Office Outlook ® 2007 Training Retrieve, back up, or share messages Sweetwater ISD presents:
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Fedora Users’ Conference Rutgers University May 14, 2005 Researching Fedora's Ability to Serve as a Preservation System for Electronic University Records.
Transferred 89,000+ messages XML preservation formats Account-centricMessage-centric.
PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division.
John L. Baines OIT Security and Compliance Retention: Preserving Public Records.
ISO & OAI-PMH By Neal Harmeyer, Amy Hatfield, and Brandon Beatty PURDUE UNIVERSITY RESEARCH REPOSITORY.
…your guide through terrain
Introducing Symposia : “ The digital repository that thinks like a librarian”
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
How to Get The Most Out of Outlook 2003 Michele Schwartzman Division of Customer Support Summer 2006.
Database Software Application
The British Library’s METS Experience The Cost of METS Carl Wilson
RECORDS MANAGEMENT MELANIE WELCH 1. What Is the Sunshine Law? The Sunshine law grants every person the Constitutional right to: ◦ View or copy any public.
Retention and Disposition. Are messages public records? At NMU, all messages composed and maintained on University hardware are considered.
Archive-It and CINCH tool: Using web harvesting to facilitate born- digital preservation Kathleen Kenney Archive-It Partners Meeting 2012.
Basic Records Management. What we’ll cover Virginia Public Records Act Definitions Understanding and using the LVA General Schedules The schedule cover.
Pasewark & Pasewark 1 Outlook Lesson 1 Outlook Basics and Microsoft Office 2007: Introductory.
CPS Acceptable Use Policy Day 2 – Technology Session.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
PeDALS Persistent Digital Archives & Library System Richard Pearce-Moses Deputy Director for Technology & Information Resources Arizona State Library,
Computer Based Information Systems Control UAA – ACCT 316 – Fall 2003 Accounting Information Systems Dr. Fred Barbee.
International Council on Archives Section on University and Research Institution Archives Michigan State University September 7, 2005 Preserving Electronic.
Finding a New Way Richard Pearce-Moses Deputy Director for Technology & Information Resources Arizona State Library, Archives and Public Records Using.
Persistent Digital Archives and Library System (PeDALS) SC Department of Archives and History.
Data management in the field Ari Haukijärvi 2nd EHES training seminar.
MAIL MERGE Designing Documents with. Terms Mail Merge: A process that inserts variable information into a standardized document to produce a personalized.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
DIGITAL IMAGING What Every Archivist and Records Officer Should Know DIGITAL IMAGING What Every Archivist and Records Officer Should Know Presented by.
Module 9 Configuring Messaging Policy and Compliance.
RECORDS MANAGEMENT Office of Compliance. OBJECTIVES Four main objectives of a Records Management Program: –Increase efficiency of record keeping. –Protection.
Archiving s. How to Manage Auto-Archive in Outlook Your Microsoft Outlook mailbox grows as you create and receive items. To manage the space.
The Real At Risk E-Content: University Web Resources EDUCAUSE Joanne Kaczmarek University of Illinois at Urbana-Champaign Taylor Surface OCLC October 12,
Dylan Bayliff. Contents: 1- Sending s & Using etiquette 2- Staying safe and Accessing 3- Open s 4- Replying to s 5- Setting up contacts.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
Records Management 101 The Basics Archival and Records Management Services Division.
Washington State Archives October 2010 Presented by: Russell Wood - State Records Manager Julie Woods – Local Government Records Retention Specialist Basics.
Incident Security & Confidentiality Integrity Availability.
U.S. Department of Commerce Web Advisory Group Minding Your Own Business The Platform for Privacy Preferences Project.
Selection Strategies for Digital Institutional Repositories Kent Woynowski 30 September 2004.
Washington State Archives “Going Paperless” Presented by: Leslie Koziara, ERMP May 7, 2009 A GUIDE TO WASHINGTON STATE’S APPROVAL PROCESS FOR THE DESTRUCTION.
Persistent Digital Archives and Library System (PeDALS)
VITAL at the National Library of Wales Glen Robson
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Incident Security & Confidentiality Integrity Availability.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan Florida Center for Library Automation (FCLA)
Washington State Archives September 2010 Presented by: Russell Wood State Records Manager Basics of Records Retention Washington State Archives Documenting.
From Access to Archive Transforming Scholars Portal into an E-Journal Archive.
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
RECORDS MANAGEMENT Office of Business Affairs. OBJECTIVES Four main objectives of a Records Management Program: –Increase efficiency of record keeping.
SOFTWARE. Software… Instructions that are stored electronically that tell the computer what to do.
Digital Archives You Can Do It! The Collective - March 2016 Paul Kelly - Digital Archivist - The Catholic University of America.
A Beginner’s Guide to Preserving Digital Resources in Historic Environment Records Catherine Hardman and Kieron Niven Archaeology Data Service.
CITY OF PHOENIX RECORDS MANAGEMENT AND E-PRIVACY Margie Pleggenkuhle City Clerk Department March 18, 2004.
Data Management and Digital Preservation Carly Dearborn, MSIS Digital Preservation & Electronic Records Archivist
INFORMATION ASSURANCE POLICY. Information Assurance Information operations that protect and defend information and information systems by ensuring their.
Barracuda Essentials for Office 365 Barracuda Essentials combines three proven cloud-based solutions enhance Office 365 deployments, making it easy to.
Archive Migration Service
Ingest and Dissemination with DAITSS
DAITSS: Dark Archive in the Sunshine State
DAITSS and the Florida Digital Archive
Managing Student Records
Records Management Compliance Training
Good Spirit School Division
Presented by: Steve Gerdes 26 January 2019
Presentation transcript:

The PeDALS approach

 Pete Watters Arizona State Library, project coordinator  Richard Pearce-Moses Clayton State University, Georgia, principal investigator  Brian Schnackel Arizona State Library, lead developer

 PeDALS strives for OAIS compliance  Archivists focus on process, not individual records  Business rules…  generate normalized metadata  transform SIPs into standardized AIPs  create DIPs for each record

 Suited to the PeDALS methodology  Born digital  Potential for historical value  Message transmission information provides a rich source of metadata  All partners had Outlook PST files

 Atomize individual messages To store as individual AIPs To disseminate as browser-friendly DIPs  Create a database of rich metadata From the process: to support administration From the headers: to support discovery From BagIt, New Zealand Metadata Extractor, other sources: to support preservation

 PeDALS is intended for permanent records  PeDALS is not a records management system  Deleting files is difficult at best

 When negotiating with the originating office, archivists encourage weeding PSTs of non- permanent records  Archivists work with rules rather than records – they don’t have time to weed the collections  If you give us junk, we’ll archive junk.  PSTs plucked from hard drives can work, but more likely to generate errors during processing.

 Metadata taken from headers was surprisingly messy  One response is to learn to cope with a complete lack of authority control  Or possibly correct by “data wrangling” from within the database

 Senders and recipients can be an address or display name from one or more contact lists “Janet Napolitano” or or or “Napolitano, Janet “ or “Janet” or “J Napolitano”?  Subject line not reliable source for titles or abstracts – often blank, repetitive, or a remnant from an unrelated message

 (and other records) may be open to the public by statute, but some content may be sensitive Personally identifying information Private information (intimate, of no public interest)  Repositories must develop procedures and policies for aggregates that may have some records with sensitive information

 Boucher/Stearns draft legislation for online privacy would require “notice to and consent of an individual prior to the collection and disclosure of certain personal information” such as street and addresses, phone numbers, aliases, and other common information.  Excludes government agencies, but may include academic libraries.  Possible chilling effect on archives: Keeping such information confidential would effectively block access to and many other records

 PST file structure was proprietary  Considered third-party Outlook plug-ins Smithsonian Institution had done research  Adopted open-source PST export utility No longer supported Written in Visual Basic

 Could generate human-readable XML of messages  Was based on code open to public  Did not require understanding of PST structure

It’s more than just  What to do with tasks, calendar items, contacts?  Need to give the archivist the ability to decide what to keep  What about viruses, corrupt attachments?

What is the record? What are we authenticating?  PST as database; messages are constructs of fields in tables tied together by keys and other tables  XML is best way to preserve these relations and dependencies

 Did not use the full record  Had almost no way to handle errors  Tended to break when dealing with large PST files that had not been curated  Required a copy of Outlook  Ran very slowly

 In late February, Microsoft released the PST specification  203 pages of techspeak with some errors and inaccuracies  Based on the spec, we’ve been developing a file-based tool that doesn’t require Outlook.

 Generates XML from the entire PST file  Much improved exception handling  Does not require Outlook  Runs much more quickly

 File-based processor was slow to develop because of some errors in Microsoft’s documentation.  Test on as many PST samples as possible. Don’t rely on small curated samples.  Discovered differences between Unicode PST files and earlier ANSI-encoded files.

 PSTs are not an automatic occurrence in Outlook 2010  But they can be generated manually and can remain part of a scheduled retention routine