Heritrix

Last updated January 18, 2012. Created by scott on January 18, 2012.
Log in to edit this page.

Heritrix is the Internet Archive's open-source, extensible, web-scale,
archival-quality web crawler project. Heritrix (sometimes spelled heretrix, or
misspelled or missaid as heratrix/heritix/heretix/heratix) is an archaic word
for heiress (woman who inherits). Since our crawler seeks to collect and
preserve the digital artifacts of our culture for the benefit of future
researchers and generations, this name seemed apt.

Technology
License: 
Development Status: 
Operating System: 
Programming Language: 
Database: