Last updated on August 1st, 2017 at 01:55 pm
We provide access to internal technical data produced during the web archiving process for research purposes.
Example of technical data generated during the experimental crawl of the .EU domain available for research purposes:
- Heritrix original crawl log (19,6 GB);
- Heritrix generated reports (21,5 MB);
- Analysis sheet generated using the Notebook Python library.