ELECTRONIC RECORDS PRESERVATION ARCHIVES OF MICHIGAN.

Slides:



Advertisements
Similar presentations
OGF-23 iRODS Metadata Grid File System Reagan Moore San Diego Supercomputer Center.
Advertisements

DATA PROCESSING SYSTEMS
CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
The Office Procedures and Technology
Fedora Users’ Conference Rutgers University May 14, 2005 Researching Fedora's Ability to Serve as a Preservation System for Electronic University Records.
Resources for Social Sciences
Data Grid: Storage Resource Broker Mike Smorul. SRB Overview Developed at San Diego Supercomputing Center. Provides the abstraction mechanisms needed.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.
Background Chronopolis Goals Data Grid supporting a Long-term Preservation Service Data Migration Data Migration to next generation technologies Trust.
Dissemination of U.S. Census Data and Results: The role of ICPSR First Conference of Al-Khawarezmi Committee on Statistics Doha, Qatar 6-8 December 2010.
SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA, SAN DIEGO Disk and Tape Storage Cost Models Richard Moore & David Minor San Diego Supercomputer.
Making Census Data Better in Wisconsin LTSB GIS TEAM| TONY J. VAN DER WIELENRYAN SQUIRESJAMES BEAUDOINGRACE WHITE.
Electronic Records Management: A Lesson in Collaboration ECURE 2005 Arizona State University Tempe, AZ Feb. 28-March 2, 2005.
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
Preservation and Security IPUMS International Wendy Thomas Data Archivist.
Pertemuan 16 Matakuliah: A0214/Audit Sistem Informasi Tahun: 2007.
Kevin L. Glick Electronic Records Archivist Manuscripts and Archives Yale University ECURE Arizona State University March 2, 2005 Fedora and the Preservation.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
1 Archival Storage for Digital Libraries Arturo Crespo Hector Garcia-Molina Stanford University.
Preservasi Informasi Digital.  It will never happen here!  Common Causes of Loss of Data  Accidental Erasure (delete, power, backup)  Viruses and.
Agenda  Overview  Configuring the database for basic Backup and Recovery  Backing up your database  Restore and Recovery Operations  Managing your.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
Data Preservation Best Practices for preserving your research data for future reuse The goal of data preservation is to ensure that your data is in a sustainable.
2009 Indiana Election Administrator’s Conference Statewide Voter Registration System (SVRS) Project Update December 2,
Improving the Quality of Tax Statistics: Recent Innovations in Editing and Imputation Techniques at the Statistics of Income Division of the U.S. Internal.
Presented by Mitch Toda and Shawn Johnstone Archives and Information Management Team, Smithsonian Institution Archives August 6, 2009.
The Office Procedures and Technology Chapter 9 Records Management Systems Copyright© 2007 Thomson/South-Western.
Records Survey and Retention Schedule Recertification 2011.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Preserving Electronic Mailing Lists: The H-Net Archive H-Net Mapped to the OAIS Model Preservation AssessmentPreservation improvementsOverview How H-Net.
Curating and Managing Research Data for Re-Use Review & Processing Jared Lyle.
International Council on Archives Section on University and Research Institution Archives Michigan State University September 7, 2005 Preserving Electronic.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
Introduction to Databases A line manager asks, “If data unorganized is like matter unorganized and God created the heavens and earth in six days, how come.
Sunshine Center of the N.C. Open Government Coalition Feb. 27, 2015 NORTH CAROLINA SUNSHINE LAWS & GIS.
Digital Preservation 101, or, How to Keep Bits for Centuries Julie C. Swierczek Digital Asset Manager and Digital Archivist Harvard Art Museums.
Preventing Common Causes of loss. Common Causes of Loss of Data Accidental Erasure – close a file and don’t save it, – write over the original file when.
Richard MarcianoChien-Yi Hou Caryn Wojcik University of University of State of Michigan North Carolina North Carolina Records Management ServicesSALT DCAPE.
Improving Security, Systems, and Statistics San Diego, CA June 4 th – 8 th, 2006 Jim Edgar, MPA State Registrar Montana Office of Vital Statistics Centralized.
File Systems (1). Readings r Reading: Disks, disk scheduling (3.7 of textbook; “How Stuff Works”) r Reading: File System Implementation ( of textbook)
INFORMATION MANAGEMENT Unit 2 SO 4 Explain the advantages of using a database approach compared to using traditional file processing; Advantages including.
Rule-Based Preservation Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar Richard Marciano {moore, schroede, mwan, sekar,
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Archive for the NSDL Reagan W. Moore Charlie Cowart.
Persistent Digital Archives and Library System (PeDALS)
Data processing of the 1999 Vietnam Population Census.
…..Kansas Department of Revenue – Property Valuation Division – Kansas GIS Policy Board - DASC ….. Statewide Tax Units Database A collaborative partnership.
Canvassing, Reporting and Preserving Results 27 th Annual Election Law Seminar Ashley Fischer.
Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives Lisa M. Schmidt
1 Designated Record Set © HIPAA Pros 2002 All rights reserved.
National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
SAN DIEGO SUPERCOMPUTER CENTER Replication Policies for Federated Digital Repositories Robert H. McDonald Chronopolis Project Manager
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
Preservation Environments for GIS Systems Reagan Moore Richard Marciano Ilyz Zaslavsky San Diego Supercomputer Center.
( ) 1 Chapter # 8 How Data is stored DATABASE.
File-System Management
Florida Citation Inventory System and Improvements to Citation / Adjudication Data 4/11/2016 4/11/2016.
Chapter 12: File System Implementation
Canvassing, Reporting and Preserving Results
Electronic Records Management Program
Events After the Election
Events After the Election
Canvassing, Reporting and Preserving Results
Events After the Election
The Office Procedures and Technology
Election Night Returns and Canvassing
Technical Issues in Sustainability
The Office Procedures and Technology
Presentation transcript:

ELECTRONIC RECORDS PRESERVATION ARCHIVES OF MICHIGAN

PAT PROJECT NHPRC Grant to San Diego Supercomputer Center (SDSC) Goal: Conduct case studies that test the ability to implement the SDSC’s Storage Resource Broker (SRB) data grid technology using a variety of archival collections. Test a community model for electronic records management, with archival and technological functions practically and appropriately allocated in a distributed network.

ARCHIVES OF MICHIGAN: PRESERVATION CASE STUDY Michigan Department of State, Bureau of Elections Precinct Level Election Results Paper Version Database Version 1972 – 2004 Goal: Enable access to 30+ years worth of electronic elections data and preserve it for the future.

MICHIGAN’S ELECTIONS STRUCTURE Decentralized Administration City Clerks (274) Village Clerks (271) Township Clerks (1,242) Precinct maps are created and maintained by local clerks Vote counts are sent to the Bureau of Elections

CURRENTLY, ELECTION RESULTS DATING BACK TO 1996 ARE AVAILABLE ONLINE. AT ONE POINT IN TIME, RESULTS WERE AVAILABLE ONLINE DATING BACK TO 1994.

DATA TRANSFER open reel magnetic tapes (1972 – 1992) 3 floppy disks (1992 – 1996) 2 compact discs (1998 – 2000)* *Mainframe was used until compact disc (2002) compact disc (2004) compact disc (2006) compact disc (2008) compact disc (2010) compact disc (2012)

OTHER CUSTODIANS University of Michigan, Inter-University Consortium for Political and Social Research (ICPSR) 1972 general election 1974 general election 1978 primary election 1978 general election Michigan Center for Geographic Information 1984 – present Michigan Manual (publication) county level data

MICHIGAN’S APPRAISAL QUESTIONS What value do the ER have if the paper is already preserved? If the Department of State, Bureau of Elections makes the records available electronically, why does the Archives of Michigan need to do it? Will researchers use the data?

STEP #1: DATA RECOVERY Michigan Department of Community Health magnetic tape machine  broke Muller Media ~ $50 per tape (completed September 2005) Load data into the Storage Resource Broker (SRB) Replicate data on Michigan grid brick and 2 SDSC grid bricks

STEP #2: CREATING USABLE DATA Work backwards: start with the most recent data first SDSC loaded a copy of the data into a search interface (mimics the Bureau of Elections) SDSC mapped the data to the codes in the metadata files to translate the data into meaningful information SDSC validated the data and recorded anomalies Data errors were compared with the paper records

DATA BEFORE

DATA AFTER

Clean the data Clean the data Address anomalies Address anomalies Map the data Map the data 1.Delete duplicate data 2.Make the 1992, 1994, and 1996 data format consistent 1.Delete duplicate data 2.Make the 1992, 1994, and 1996 data format consistent 3.Solve duplicate key problem 4.Fix Detroit AVCB code problem 3.Solve duplicate key problem 4.Fix Detroit AVCB code problem 5.Use codes and vote total files to produce 7 files like 1998 data 5.Use codes and vote total files to produce 7 files like 1998 data The databases had a different layout than the databases. SDSC had to resolve these differences so all of the databases could be loaded into the same search interface. Original data New data

DATA Code files that are needed to translate the data could not be located. Determined it was not cost-effective to manually translate the data. Decided to post the original code files online, in case a researcher wants to work with the raw data codes.

STEP #3: GIS INTERFACE Create statewide maps displaying the county level election results for each general election Identify blue and red counties for each race Identify voting trends SDSC mapped the data to geographic polygons supplied by CGI Races: President, Governor, US Senate, Secretary of State, Attorney General

LESSONS LEARNED A formal submission agreement would have better documented the records upon accessioning Accessioning ER shortly after they are created protects the records from loss and media problems Avoid reactive appraisal—have a plan Data may be recoverable from obsolete media Metadata is essential for data translation

CARYN WOJCIK State of Michigan Records Management Services (517)