Introduction to Digital Libraries Assignment #2 Old Dominion University Department of Computer Science CS 695 Fall 2003 Michael L. Nelson <mln@cs.odu.edu> 10/14/04
Writing A Robot Build a web robot that harvests metadata from: http://trs.nis.nasa.gov/ Restrictions use your own code, not an existing robot it must work from the command line have it take N arguments, which represent the years to harvest from e.g.: % robot.pl 1984 1999 2003 it must visually display progress of harvesting the metadata and storing it locally In class demo, Oct 21 2004
Assumptions This is not a general robot; it only has to work on this 1 site You don’t have to harvest the images or PDFs; only the metadata Save the metadata as DC files might be of interest: www.robotstxt.org “lynx” libWWW