Web Archiving at the Innsbruck Newspaper Archive Innsbrucker Zeitungsarchiv / IZA Presentation by Renate Giacomuzzi, Elisabeth Sporer, Armin Schleicher.

Slides:



Advertisements
Similar presentations
ELIBRARY CURRICULUM EDITION The ultimate K-12 curriculum and reference solution.
Advertisements

White House New Media & Open Source Software Macon Phillips White House New Media.
Databases vs the Internet Coconino Community College Revised August 2010.
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
Integrated Digital Event Web Archive and Library (IDEAL) and Aid for Curators Archive-It Partner Meeting Montgomery, Alabama Mohamed Farag & Prashant Chandrasekar.
Looking Ahead Archive-It Partner Meeting November 18, 2014.
Looking Ahead Archive-It Partner Meeting November 12, 2013.
Fu Jen Catholic Univ. – User Training April 2005 Hyonjoo Yang Electronic Product Manager Gale, Thomson Learning Asia.
1 Do More Searching in Less Time Fall Term 2010 Helen B. Josephine
Greenstone Digital Library Usage and Implementation By: Paul Raymond A. Afroilan Network Applications Team Preginet, ASTI-DOST.
Presented by Mina Haratiannezhadi 1.  publishing, editing and modifying content  maintenance  central interface  manage workflows 2.
Archive-It Architecture Introduction April 18, 2006 Dan Avery Internet Archive 1.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
1 Archive-It Training University of Maryland July 12, 2007.
Multimedia search engine Michal Krsek, UISK Charles University at Prague & CESNET Ivan Doležal, CESNET Michal Illich, Jyxo.
Search Search Drupal with Apache Solr with CERN Web Communications Group – Copyright 2013.
Archive-It collection on “Occupy Movement 2011/2012” Archiving Web Content.
Joanne Archer University of Maryland Kate Odell Archive-It Abbie Grotke Library of Congress Tessa Fallon Columbia University Creating and Maintaining Web.
WebArchiv Czech Web Archive IIPC 2007, Paris.
1 Archiving and Preserving the Web Dan Avery Kristine Hanna Merrilee Proffitt Internet Archive RLG April 2006.
What is Discover GALILEO? What’s in Discover GALILEO? Demonstration of Discover GALILEO Coming Soon! A Few Resources to Access Outside of Discover GALILEO.
Introducing the Hurricane Preparedness and Recovery Web Portal - October 8, Presented by Charles R. McClure, PhD Director, FSU Information Institute.
Digital Library Collections (DLC) Website A platform for integrated access to CUL/IS specialized, digital collections September 2014 Status Report.
Cataloging and Metadata at the University Library.
Locate & Evaluate Organize & Use Package &Present Information Literacy 101.
Patient Empowerment for Chronic Diseases System Sifat Islam Graduate Student, Center for Systems Integration, FAU, Copyright © 2011 Center.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Plans for 2015 Tallinn, Jan 29 th, 2015 Ditte Laursen, Sabine Schostag,
Scholarly Communication in a Digital World: the Role of the Digital Repository at the Raman Research Institute Girija Srinivasan, Y.M. Patil and Jacob.
Revolutionizing enterprise web development Searching with Solr.
1 Archive-It: Archiving and Preserving Born Digital Content NDIIPP June 2009 Molly Bragg Partner Specialist Internet Archive.
Can we be doing more? Beth Tillinghast University of Hawaii at Manoa October 19, 2011 Archive-It Partner Meeting ACCESS TO OUR ARCHIVED WEBSITE COLLECTIONS.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
GALILEO for Online Learning Tools and Tips to Get the Most Out of GALILEO Lauren Fancher and Katie Gohn Rock Eagle Online 2009.
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
We Know IT … IT’s What We Do! ® 2 Cyprien Mvuanda & Jonathan Davis Empire 2.0 Services October 1, 2010 Albany, NY Design, Development,Workflow and Implementation.
Digitizing Aloha: Using Information Technology to Preserve and Present the History and Culture of Hawai'i Bob Schwarzwalder Assistant University Librarian,
Page 1 © 2001, Epicentric - All Rights Reserved Epicentric Modular Web Services Alan Kropp Web Services Architect WSRP Technical Committee – March 18,
How do I search the Internet? Narrow your topic and its description; pull out key words and categories.
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
NetarchiveSuite Workshop, November 24, 2011, Paris 1 Austria Using Wayback for Access and QA Andreas P. Austrian National Library
Elisabeth M. Long Digital Library Development Center University of Chicago Library uchicago.edu CreativeCommons.org: Publishing in the Digital.
Chicago Manual of Style. About Notes Subsequent references to sources already fully cited Note consisting of several references documenting a single fact.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
Metadata Extraction & Web Archives: Automating the Record Creation Process Abbie Grotke / Gina Jones /
Uncovering the Invisible Web. Back in the day… Students used to research using resources hand-picked by librarians and teachers. These materials were.
1 Do More Searching in Less Time Winter Term 2013 Helen B. Josephine
Building Collections on the Web BCWeb. What’s BCWeb ? BCWeb was developped entirely by the BnF for the content curators to replace its old selection tools.
1 Advanced Archive-It Application Training: Reviewing Reports and Crawl Scoping.
IUScholarWorks Repository Update Jim Halliday, Stacy Konkiel & Jennifer Laherty.
Using NoodleTools. What is NoodleTools? NoodleTools is a quick and easy way to make a works cited page of your sources for your researched speech. Any.
ELISQ Systems Demonstration Sagnik Ray Choudhury Doha -- May 2015.
UNIONDALE HIGH SCHOOL DIGITAL CURRICULUM WITH GALE RESOURCES Julie Pepera, Customer Education Specialist.
Matt Goldner Product & Technology Advocate Mela Kircher Product Manager WorldCat Local Metasearch 13 November 2009.
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
Use cases for BnF broad crawls Annick Lorthios. 2 Step by step, the first in-house broad crawl The 2010 broad crawl has been performed in-house at the.
The world’s libraries. Connected. The Benefits of CONTENTdm Hosting Services OCLC’s Digital Lifecycle Webinar Series April 9, 2013.
Databases vs the Internet Coconino Community College Revised August 2010.
Archiving & Preserving Digital Content
Databases vs the Internet
Institution update KB DK
Databases vs the Internet
Joanne Archer University of Maryland Libraries
Avalon's Role in the Digital Collections Ecosystem
OER Commons Hubs A Primer
Creating Web Collections with Archive-It
Chapter 6: Community Features.
9/21 Find and cite a website source
Latin American Government Documents Archive, LAGDA
Information needed for citing sources:
Presentation transcript:

Web Archiving at the Innsbruck Newspaper Archive Innsbrucker Zeitungsarchiv / IZA Presentation by Renate Giacomuzzi, Elisabeth Sporer, Armin Schleicher Web Archiving Meeting, Innsbruck 2013

The Innsbruck Newspaper Archive and Research Center for Literary Criticism and Dissemination of Literature Over one million digitized articles (book and theater reviews, interviews, portraits etc.). URL:

Getting started with Web Archiving: Pilot Project URL:

Database and archive for 147 literary magazines 70 permissions for open or restricted access Technical tool: Web Curator

2011: A new collection and a new partner

Our collections on the archive-it collection page

Our new homepage (coming soon)

Problems arising from collecting literary magazines: huge amount of documents

not allowed example: Keeping the material clean from external seeds

Blocked access to videos because of copyright problems

Our Archive-Website for the collection of Author homepages

Author metadata

Website metadata

Metadata Detail

Archive View

Our Archiving Process Harvestin g Processing Presentin g

Our Archiving Process Harvesting -Currently done by Internet Archive -Managed using Archive It! Web application Add seed Configure Run Test Crawl Analyze Reports OK? Harvest

Our Archiving Process Processing -Custom importer application written in pure Java -Highly configurable (filters for indexing etc.) -Controlled via archive webapp (in development) Copy new files Create index Update DB Update Wayback/ Solr

Our Archiving Process Presentation -Webapplication built using JSF 2 -Modular design for high upgradability -Integrating Open Source projects like IA Wayback and Apache Solr

Our Archiving Process Presentation Features: -Pagetext search (fulltext) -Metadata search (fulltext) -Metadata detail view/ chronologiocal view of harvests -Faceted Search -Role management -Admin section providing tight content control

Our Archiving Process Current issues -Access control -Integration of multimedia content -Social media

Web Archiving at the Innsbruck Newspaper Archive Innsbrucker Zeitungsarchiv / IZA Presentation by Renate Giacomuzzi, Elisabeth Sporer, Armin Schleicher Web Archiving Meeting, Innsbruck 2013