Introduction to Digital Libraries Assignment #3 Old Dominion University Department of Computer Science CS 695 Fall 2003 Michael L. Nelson <mln@cs.odu.edu> 10/23/03
Writing A Robot Build a web robot that harvests metadata from: http://naca.larc.nasa.gov/ Restrictions use your own code, not an existing robot have it work from the command line have it take N arguments, which represent years from 1917..1958 e.g.: % robot.pl 1923 1949 1958 it must visually display progress of harvesting the metadata and storing it locally In class demo, Nov 6 2003
Assumptions This is not a general robot; it only has to work on this 1 site You don’t have to harvest the images or PDFs; only the metadata maintain the year/report # information however Just save the metadata locally other: use this url as a base: http://naca.larc.nasa.gov/reports/YEAR where YEAR is 1917..1958 investigate the use of “lynx” and/or libWWW