History

Origin of the Portuguese Web Archive project.

This project is the official initiative for archiving the Portuguese web and started in January 2008. However, the original idea came in 2001 with a project called tumba! (Portuguese initials of “we have an alternative search engine!”), developed by the research group XLDB of the University of Lisbon and supported by FCCN.

Tumba! enabled term search over the last crawl of the Portuguese web only. Following tumba!, came the Tomba Portuguese web  archive prototype, named after the Tombo Tower which is the Portuguese National Archive established in 1378. Tomba enables access to different versions of contents collected by tumba! for 4 years (2002-2006).

The initial team that worked on the Portuguese web archive project was composed by 3 former researchers of tumba!.

The know-how and experience gained from these academic projects were crucial to the development of the Portuguese Web Archive at FCCN. However, the Portuguese Web Archive was implemented using different technology.

This is our history, now we have the responsibility to preserve History.

Main milestones

  • 2016: Better quality in the replay of the preserved Web pages
  • 2014, 2015: Service enhancement and establishment
  • 2012: Launch of the search and access public service
  • 2010: Launch of the experimental prototype of the search and access service
  • 2008: First crawl of the Portuguese web
  • 2007: Launch of the Portuguese Web Archive at FCCN
  • 2006: First prototype of a web archive for the Portuguese web
  • 2001: Research project TUMBA! at FCUL