Arquivo.pt services and source code of developed projects.
- Search the past: search the archive and access pages of the past.
- Arquivo.pt API: enables automatic access to services so that new applications can be developed autonomously by third-parties.
Open-source projects
Examples of projects
- Search system developed by the Portuguese Web Archive based on the Archive-access project.
- Full-text query suggestions mechanism.
- Software to use the Portuguese Web Archive WAIR test collection (by Zeynep Pehlivan)
- Httrack2Arc Tool that converts Httrack crawls to ARC format.
- Roteiro2Arc Tool used to convert to ARC format the files in the CD-ROM of the book “Novo Roteiro Prático da Internet” por José Magalhães.
- rARC: collaborative preservation distributed system (project suspended)
Datasets for research
Other
- Map of web archiving initiatives worldwide (9th of March, 2012)
- Rejection filters used in the crawls