Last updated on October 28th, 2024 at 06:31 pm
Arquivo.pt is the official initiative for archiving the Portuguese web which started in 2007.
However, the original idea came in 2001 with a project called tumba! (Portuguese initials of “we have an alternative search engine!”), developed by the research group XLDB of the University of Lisbon and supported by FCCN.
Tumba! enabled term search over the last crawl of the Portuguese web only. Following tumba!, came the Tomba Portuguese web archive prototype, named after the Tombo Tower which is the Portuguese National Archive established in 1378. Tomba enables access to different versions of contents collected by tumba! for 4 years (2002-2006).
The initial team that worked on the Portuguese web archive project was composed of 3 former researchers of tumba!.
The know-how and experience gained from these academic projects were crucial to the development at FCCN. However, the Arquivo.pt was implemented using different technology.
This is our history, now we have the responsibility to preserve History.
Main milestones
- 2024: Arquivo.pt received the award for Best Central Public Administration Digital Project
- 2024: Arquivo.pt is finalist for the DPC Awards 2024
- 2023: Arquivo.pt in the top 3 of government services in Portugal
- 2022: Arquivo.pt was considered the best Digital Service of 2022
- 2022: Arquivo.pt enters the honor roll for security in Portugal according to the Portuguese Observatory for Internet Technologies
- 2022: Publication of derived open data sets, launch of SavePageNow and Arquivo404.
- 2021: Launch of the largest search service over web-archived images
- 2020: Arquivo.pt officially preserves the websites of national scientific projects
- 2019: Launch of Arquivo.pt Memorial
- 2018: First edition of the Arquivo.pt Award
- 2017: Start of the training program
- 2016: Improved quality of reproduction of preserved pages
- 2015: Transition from project to public service Arquivo.pt
- 2014: Robustified service
- 2013: Decree-Law mandating the preservation of content available on the national Internet by FCT-FCCN
- 2012: Launch of the 1st API to access Arquivo.pt
- 2011: Open source available
- 2010: Launch of the prototype of the search and access system
- 2009: Publication of recommendations for publishing preservable web content
- 2008: First crawl of the Portuguese web
- 2007: Start of the Portuguese Web Archive (AWP) project at FCCN
- 2006: First prototype of a Portuguese web archive (FCUL)
- 2001: Research project TUMBA! at FCUL