MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.

Slides:



Advertisements
Similar presentations
Business Development Suit Presented by Thomas Mathews.
Advertisements

USING WORDPRESS. WEEK 1 1.Why WP? 2.Setting Up WP 3.Exploring the Admin screen 4.Page Organization 5.Posting 6.Polls.
1 Minerva The Web Preservation Project. 2 Team Members Library of Congress Roger Adkins Cassy Ammen Allene Hayes Melissa Levine Diane Kresh Jane Mandelbaum.
Web archiving at the NLA ‘ Archiving the music web’ Music Council of Australia Annual Assembly 28 September 2009 Paul Koerbin Manager Digital Archiving.
New School Websites Teacher Pages. Visit the SCUSD Website for videos tutorials: For more information.
The capture and preservation of websites at the National Library of New Zealand Gillian Lee Alexander Turnbull Library.
1 Archive-It Training University of Maryland July 12, 2007.
8/16/2015 Search Engine Optimization (SEO). Keyword Research After closely monitoring the competitors we have come up with the business keywords that.
Washington State Archives Presented by: Leslie Koziara Electronic Records Management Consultant Part 1: Managing Your Records.
The Digital Motion Picture Archive Framework Project © 2008 AMPAS Academy of Motion Picture Arts and Sciences Science and Technology Council Nancy Silver,
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
The attic & the parlor CHM collections & exhibitions overview May 5, 2006 Kirsten Tashev VP Collections & Exhibitions.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
9/10/2015 What’s New? Edline at Valley View!! Joyce Potempa Technology Department presentation to Building Support Staff February 2, 2010 Institute Day.
The SAU Website Workshop. Using the site Website Management The Campus Directory Form Manager Other available resources.
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
System for Administration, Training, and Educational Resources for NASA SATERN Overview for Learners May 2006.
Reliable Sources Six questions to ask to determine the trustworthiness of an internet source.
WHS joined Archive-It in the fall of 2010 Began capturing state information with the capture of Governor Jim Doyle’s websites at the end of the administration.
Cataloging and Metadata at the University Library.
The web has revolutionized our access to information. Documents and publications that were once difficult to fin are now readily available to anyone. Government.
Do You Have a Web Site?. Everyone does, don’t they?
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
Chapter 5: Online Orchestration: Establishing an Effective Web Presence Presentation Given To You By: Julie Bertoni Erin Farmer Katelyn Maroney Cody Fish.
Research Data Management Victoria University Context Lyle Winton Adrian Gallagher Julie Gardner.
WAS to Archive-It Metadata Migration March 11, 2015.
The Legislative Library of Ontario’s Ontario Documents Repository Road to Partnership.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
System for Administration, Training, and Educational Resources for NASA SATERN Overview for Users December 2009.
Library Orientation Review By: Mrs. Sanderman Databases There are four main Database Providers that Alter subscribes to: Facts on File INFOhio ProQuest.
Foxbright – Smarter Education Websiteswww.foxbright.com Foxbright Training Foxbright Teacher Pages
10 Reputation Management Tips for Your Local Business Presented by: Your Name
How do I search the Internet? Narrow your topic and its description; pull out key words and categories.
Metadata for the Web Andy Powell UKOLN University of Bath
Blogs, Wikis and Podcasting  By Zach, Andrew and Sam.
Washington State Archives Presented by: Russell Wood – State Records Manager Julie Woods – Local Government Records Specialist Managing Your Records Public.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
Jonathan H. Harwell Collection Development & Assessment Librarian Georgia Southern University
CharMeck.org Contributer Training SharePoint 2013 Orientation and Basic Training.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
ALA Annual Meeting Claire Cocco Global Product Manager CONTENTdm Users Group June 30th, 2008.
DESIGN AND DEVELOPMENT OF NOAA VIRTUAL LIBRARIES: THE INTERSECTION OF TRADITIONAL LIBRARY KNOWLEDGE AND CUTTING EDGE INFORMATION TECHNOLOGIES Dottie Anderson.
An Application Profile and Prototype Metadata Management System for Licensed Electronic Resources Adam Chandler Information Technology Librarian Central.
Parent Information Session Welcome! Field School Online Databases & Subscriptions.
The Big 6 Model for Effective Research While Researching specific topics and how they work you will be using the Big 6 Model for Effective Research to.
Lunchtime Byte Carys Morgan – Hazel Thomas. Carys Morgan Office Manager and People's Collection Wales Officer responsible for Editorial and Content.
Schoolwires How to modify your classroom webpage.
Financial Management of ECE Programs.  Go to “Tools”  Click on “Personal Information” to edit your personal information (including address) or.
HOW TO SET UP A WEBSITE. Why use WordPress? Nearly half of the websites on the Internet are running on the WordPress website platform It’s totally free.
Using Artstor Digital Library for Image-Based Research and Study.
Moshe Shechter | Alma Product Manager
Archiving & Preserving Digital Content
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
Omeka Web-Publishing Platform
Databases vs the Internet
Best Practices for LTER Site Websites
Search Engine Optimization (SEO)
University Career Services Committee
After this course you will be able to:
Creating Web Collections with Archive-It
Challenges and Opportunities of Archiving the UK Web
Video Retention and Metadata Guidelines CIO Council Update
MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.
Metadata to fit your needs... How much is too much?
Health On-Line Patient Education Web Site
Márton Németh – László Drótos How to catalogue a web archive?
Navigating the Thinkfinity.org
Brand Yourself and Promote Your Business in Play Therapy
Presentation transcript:

MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became MSU.  That was also our centennial year.  The entrance is unknown.

Web Archiving @ MSU Ed Busch March 14, 2014

Overview What We Did What We Learned What Are We Doing Now Suggestions

What We Did Our Goal: To “preserve and make accessible” MSU web sites of enduring historical and research value Almost every office and unit on campus has a web site with business information Content that isn’t preserved anywhere else Integral to mission of MSU This goal is what is driving our web archiving. Many of our campus publications are only on the web now as pdfs or html

What We Did Inventory of MSU related web sites (early 2011) Top level domains = approx. 1,300 sites External domains = approx. 190 sites e.g. coachizzo.com or spartancash.com Trial ran “snapshots” of msu.edu using Archive-It Huge number of pages Example, there were over 3.6 million PDF files just within msu.edu at that time Numbers from “host master” at ATS Network Management Services (Doug Nelson) Probably more domains and pages now. Many units have started blogs using site such as wordpress. Highlighted vocabulary differences between archivists and IT professionals Many MSU affiliated sites outside msu.edu domain Much of the content on web sites is new; not available in print or other media formats Many sites have password protected content Many sites have dynamic content and updated frequently

What We Did Used list of known MSU websites from IT Created 3 large collections and 2 smaller special collections Administration and Services; Colleges, Schools, Research Centers & Institutes; and Student Organizations and Groups Topical Events Web Sites; Decommissioned MSU Web Sites Added Landing Page to our web site Updated Retention Schedule to include web sites MSU Publications Created Web Site Collection Plan Added Metadata at collection level Identified crawl schedule Now have over 700 seeds assigned to collections. Because of subscription constraints, have to keep some inactive Our current retention schedule is online. A new retention schedule should be coming out in 2014 Draft of collection plan available online Always test crawls first

Archive-it.org

What We Learned Once you create a collection, you can’t split or combine easily What’s the best collection creation strategy- to lump or not? I’ve started splitting collections into smaller Collections by moving seeds Archive-It investigating adding a combine function Pluses and minuses to lump: leaning towards recommending smaller sized clumps What is useful metadata?

What We Learned Our New Collections Michigan State University Libraries Collection MSU Administration and Services Collection MSU Alumni and Fan Sites Collection MSU Athletics Collection MSU Colleges, Schools, Research Centers & Institutes Collection MSU Employee Unions Collection MSU Related News Publications Collection MSU Social Media Collection MSU Sponsored Projects Collection MSU Student Organizations and Groups Collection MSU Topical Events and Subjects Web Sites Collection MSU Arts and Culture Collection

What We Learned Some sites are just difficult to crawl – recursive issues Using regular expressions and constraints – Archive-It staff very helpful Lots of test runs – takes time Creating useful metadata Archive-It provides 15 Dublin Core fields Collection – title, creator, subjects, description, publisher, contributor, type, format, source, relation, coverage, rights, collector, language Seed – title

What We Learned Web Archiving requires more staff time than expected Websites are being created or modified every day New functionality often causes problems in next crawl Run Test crawls! Now have over 700 seeds assigned to collections. I have deactivated most of my scheduled runs so that I can do test crawls first. Maybe a feature wo9ld be to automatically do a test crawl x days before scheduled.

What Are We Doing Now Quality check Social Media sites Historical Collection sites Can an old dog learn regular expressions? On-demand sync can be done by their staff Get help from units to point out problems To find on Worldcat, search by collection name and institution. Not sure how useful this will be for lumped collections

Suggestions Plan Start small Get the word out to site creators What do you need to capture? How much time do you have? Can you afford Archive-It or need to use “free” tool? Start small Get the word out to site creators

Contact Ed Busch Electronic Records Archivist buschedw@msu.edu

Surveying class photo:  Taken in 1885, which was the beginning of the engineering course, the second major offered at the college.  The students are all juniors or seniors.  At this point in college history, there was no women’s course, so the women were taking the same courses as the men.