GPO Plans to Mine Deep Web Data of Federal Agency Sites

by Sabrina I. Pacifici on January 3, 2005

On December 30, 2004, GPO issued an RFP to “procure Web Harvesting Services.” From the Statement of Work:

  • “The U.S. Government Printing Office (GPO) requires the services of a vendor that can provide a number of different products and/or services related to the discovery, harvesting, and assessment of documents and publications from Web sites using Web crawler and data mining technologies. GPO is involved in a project that is attempting to discover and retrieve publications from Federal agency Web sites in order to identify publications that have not been cataloged by GPO but fall within the scope of the Federal Depository Library Program (FDLP) and the Cataloging and Indexing Program.”
  • Previous post:

    Next post: