Economic Data Time Travel Adrienne Brennecke September 30, 2011.

Slides:



Advertisements
Similar presentations
Zetoc.mimas.ac.uk Zetoc Electronic Table of Contents from the British Library Zetoc Support.
Advertisements

EBooks and Audiobooks. This class will give you an overview of eBooks and electronic Audiobooks available from the Library. We will also explain the basic.
Collections Management Software for Museums and Archives r e d i s c o v e r y s o f t w a r e. c o m O V E R V I E W P R E S E N T A T I O N.
Federal Reserve Economic Data Katrina Stierholz Manager, Research Library Federal Reserve Bank of St. Louis June 26, 2006 Note: the views expressed are.
Internet Research Techniques Graham Seibert Copyright 2006 This is a segment of the draft version of a large syllabus. I need your feedback to improve.
Single Search By Rakphao Theppan, librarian Searching Online Resources.
Elibrary.worldbank.org World Bank eLibrary User Guide Take full advantage of your eLibrary subscription!
Toulouse School of Graduate Studies Theses and Dissertations ETDs - Why We Do them –We at UNT believe that electronic theses and dissertations enhance.
Constructing the Memories Creating a Digital Collection Linda J. White, Digital Project Coordinator.
Contents and Formats Existing Digital Sources Gertraud Griepke Cornell University, July 26th 2002.
Economic Data as Snapshots in Time Katrina Stierholz Federal Reserve Bank of St. Louis IASSIST Conference May 27, 2005.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Digitization Projects: Internal Development vs. Outsourcing Production or D.I.Y. vs. The Pros.
1 Minerva The Web Preservation Project. 2 Team Members Library of Congress Roger Adkins Cassy Ammen Allene Hayes Melissa Levine Diane Kresh Jane Mandelbaum.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
Database Design IST 7-10 Presented by Miss Egan and Miss Richards.
1 The Vietnam Center and Archive Stephen Maxner, Ph.D.
Project Selection Theses Posters – Disciplines: Psychology, Sociology/Anthropology, Biology – Benefits Immediate access to files. Born Digital Student.
IAEA International Atomic Energy Agency Dobrica Savić & Germain St-Pierre Nuclear Information Section, IAEA Vienna Austria.
Library Electronic Resources in the EUI Library Veerle Deckmyn, Library Director Aimee Glassel, Electronic Resources Librarian 5 September
Agenda Overview 2.What is SharePoint? 3.NCDOT Websites 4.Roles 5.Search 6.SharePoint Interface.
Literati by Credo Literati by Credo (formerly Credo Reference) is an in depth research tool that provides access to over 600 e-reference books as well.
LOUISVILLE.EDU Sharing Our Special Collections with the World: an IMSLP Digitizing Project By James Procell, Music Librarian University of Louisville.
ALFRED: Capturing data as it happens Katrina Stierholz, Director of Library and Information Services Federal Reserve Bank of St. Louis The views expressed.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
1 Library Services. 2 Benefits of using the Library To find resources for your assignments and identify areas of interest To produce extra good papers.
1 DATABASES By: Hanna Ben-Or Phone: October 2011.
Erin Kinney, Wyoming State Library. Motivation #1 priority that came out of 2004 statewide digitization meeting WSL received many reference questions,
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
Cataloging and Metadata at the University Library.
Looking back, moving forward: Examining the impact of digitizing the ACS archive 232nd ACS National Meeting September 13, 2006 David Martinsen, Adam Chesler.
Cataloguing Electronic resources Prepared by the Cataloguing Team at Charles Sturt University.
Lecture Four: Steps 3 and 4 INST 250/4.  Does one look for facts, or opinions, or both when conducting a literature search?  What is the difference.
Google Books, UMI and Other Intriguing Trends in Digital Publishing Joe Wible Hopkins Marine Station of Stanford University October 9, 2006.
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
The Legislative Library of Ontario’s Ontario Documents Repository Road to Partnership.
User’s guide. Compare features:EndNote WebEndNote Save references++ Organize & edit references++ Storage capacity (number of references)10,000unlimited.
The Real At Risk E-Content: University Web Resources EDUCAUSE Joanne Kaczmarek University of Illinois at Urbana-Champaign Taylor Surface OCLC October 12,
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
Library of Vilnius Gediminas Technical University Asta Katinaitė, Aurelija Striogienė
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
1 UNOG Library Digitization and Microform Unit (DMU) – December 2009.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
WISER : OxLIP+ Workshops in Information Skills and Electronic Research Oxford Libraries Information Platform Craig Finlay Gillian Beattie.
1 By: Suman Negi, Technical Officer ‘B’ DESIDOC, DRDO, Delhi Presentation at NACLIN 14 (During 9-11 December 2014, Pondicherry) Design and Development.
IAEA International Atomic Energy Agency International Nuclear Information System (INIS) 2.3 Digital Preservation Activities 36 th Consultative Meeting.
June 2010 A demonstration of Birkbeck Library’s eresources.
1 EndNote X2 Your Bibliographic Management Tool 29 September 2009 Humanities and Social Sciences Resource Teams.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
Bibliographic Record Description of a book or other library material.
Grant Writing for Digital Projects September 2012 IODE Project Office IODE Project Office Oostende, Belgium Oostende, Belgium Sustainability and.
DIGITIZATION IN THEORY AND PRACTICE WEBSITE: Helen Nneka Okpala Presentation done at University of.
Picking up the Pieces: A Retro ETD Project at Utah State University Richard W. Clement Dean of Libraries Utah State University ETD 2013 University of Hong.
Reference Management Module I: Introduction By Rehema Chande-Mallya(PhD)
Improving the Discovery and Access of Archival Content Through the Institutional Repository: ScholarWorks at Boise State JULIA STRINGFELLOW CIMA ANNUAL.
PDF Recovery Tool Fix Portable Document File Format.
How to Develop and Write a Research Paper.
Jim Duran and Julia Stringfellow, Special Collections Department
Cleaning up the catalog: getting your data in order
Building A Web-based University Archive
DIGITAL LIBRARY.
Zetoc: Electronic Table of Contents from the British Library
Zetoc: Electronic Table of Contents from the British Library
Beyond Description: Metadata for Catalogers in the 21st Century
Search for Article Citation
Presentation transcript:

Economic Data Time Travel Adrienne Brennecke September 30, 2011

New York Times Article

Quick demo

Value of this history Determine the accuracy of early estimates Evaluate policy decisions using information available at the time, not what is known in hindsight Allows economists to model the economy using data that was actually available

Users can save data sets to their own account Share Published Data ListsPublished Data Lists Average about 3,000 unique visitors a month Value of this website

History of the Project Why? How? Challenges? technical details

Looking for revisions, and then solutions Former Research Director was looking for the economic data that were released originally—not the revised data We searched high and low... – Libraries removed news releases when the final version was published – Agencies historically wrote over the data, as the computing storage costs were high

Help from libraries Searched online catalogs for press releases Called documents librarians all over the country Contacted issuing agencies and the Library of Congress Depository libraries came through for us

Challenges How to design ALFRED to store revisions – See Developing Time-Oriented Database Applications in SQL Finding and verifying old data and release dates Early electronic information lost Underestimating amount of work involved Figuring out the best process, and dealing with changing workloads for staff

Technical details These data are saved only when there are revisions; each data value has three pieces of information – The time period it applies to (e.g., 2 nd quarter 2011) – The time period it is true for (e.g., from July 30 th to August 26 th ) – The date that the information was entered into the database to allow for tracking of data entry errors

Technical details Underneath the hood, FRED and ALFRED are the same application. – ALFRED was populated by collecting historical data for series in FRED, and ALFRED continues to be extended by capturing "expiring" FRED values when new ones are published. – The coverage dates for data series are the same in both FRED and ALFRED

Conclusion ALFRED shows revisions to a series and presents data as they were at a particular point in time Unique information, FREE and easily accessed Preserving important data for future research

FRASER: Federal Reserve Archival System for Economic Research

Technical Aspects of FRASER Variation on LAMP software bundle – Linux operating system – Apache web server – PostgreSQL database (rather than the more common MySQL) – PHP programming

Google search appliance – Metadata plus full text (OCR) – Basic and advanced search options available – Standard Google search functions, plus a couple filters unique to FRASER

Topic Collections Special/ Archival Collections

Publications Originally, data publications Now include various types of serials and monographs Statistical releases Available issues, arranged by date Bibliographic information

Historical Documents Based on categories Originally “non-data” publications Documents Categories

Special Collections

Page Stacking Purpose: – View a single data series over time Solution: – Grouped page files – PDFLib+PDI

Personnel Center for Economic Documents Digitization (CEDD) consists of – 1 manager – 1 librarian – 5 part-time scanning clerks Additional support from – Web group – Library director

Digitization Process Selection and preparation Review paper documents & establish scanning procedures Scan Additional review, page by page Quality check (QC) This is done by a person other than the scanner Clean scanned image Process varies based on project Create PDF OCR Add metadata QC (brief) Transfer to server This must be done by one of the two librarians Post to FRASER Items can be posted as publications, historical documents, special collections – each with their own interface and metadata options Add link to catalog record and OCLC record This is done by the library’s cataloger, outside of the CEDD

Locating Paper Copies We scan documents from – Our own library collection, and other Fed libraries – FDLP Needs and Offers lists – Interlibrary loan – Partner institutions But… – As we digitize, libraries throw out paper copies

Copyright We focus on public domain materials – Federal Reserve Bank publications Not technically public domain, but we have an agreement to digitize – Federal Government publications – Pre-1923 publications

Hardware and Software Hardware Automatic Document Feeder (ADF) – 3 - Fujitsu fi-5650C – 2 - Fujitsu fi-6670 (newer model) Overhead/planetary scanner – 1 - Indus Color Book Scanner 5002 Flatbed scanner – 1 - Epson Expressions Graphic Arts 10000XL Software ImageWare BCS-2 – Indus scanning Techsoft PixEdit7 – Fujitsu and Epson scanning, and all cleaning ABBYY FineReader 10.0 – OCR Adobe Acrobat 9 Pro – Metadata Also: Microsoft Access 2007 – Metadata and tracking purposes for some larger collections PDF Summary Maker – Embedding metadata from Access into pdfs

Image/text areas as recognized by OCR software Green=text Blue=table Red=picture Text recognized by OCR software Blue=uncertain character(s)

Data Entry Web-based forms for data entry Here: setting up the overall publication (library catalog- level metadata)

Data Entry Issue-level metadata – Issue date – Issue title (text- formatted date, or other title) – Attach pdf – Enter table names and page titles for the page stacking described earlier

Data Entry Historical and Special Collection documents have both publication- and issue-level metadata Special Collection Document

Output 3 image files – Original multipage tiff – Cleaned multipage tiff – PDF 3 types of text/metadata – Underlying text in pdf (OCR) – Title and author embedded in pdf – Other metadata entered in database when posting

Contact Us Adrienne Brennecke alfred.stlouisfed.org/alfred.stlouisfed.org/ Data Acquisitions, Reference Librarian Pamela Campbell fraser.stlouisfed.org/ fraser.stlouisfed.org/ Digital Projects Librarian