Technical report analyses query suggestion for Web archives

Last updated on September 29th, 2017 at 02:20 pm

Misspelled queries are a common problem in search engines and Web archives. This work is the result of the development and integration of a spell checking and query suggestion feature in the Portuguese Web Archive.

The integration of the developed query suggestion feature in the Portuguese Web Archive’s interface enables an improved user experience.

I. P. Santarém, 7th and 8th Feb.: learn more about the Portuguese Web Archive

Last updated on August 4th, 2024 at 06:16 pm

Come and meet the Archive’s team.

The Portuguese Web Archive will be presented at Jornadas FCCN on 7th and 8th of February 2012, with the following activities (in Portuguese):

Scientific study analyses worldwide Web archiving initiatives

Last updated on August 9th, 2024 at 03:08 pm

This research presents a global and updated overview of Web archiving initiatives. The analysis of the initiatives provided several statistics, such as the volume of archived data or the number of people engaged.

The paper A survey on web archiving initiatives, by Daniel Gomes, João Miranda and Miguel Costa, was presented at the International Conference on Theory and Practice of Digital Libraries 2011, in Berlin, Germany.

rARC service suspended

Last updated on September 29th, 2017 at 02:46 pm

The rARc project is suspended since July, 2011.

The project for collaborative preservation named rARC started in 2007 within the Portuguese Web Archive.

Contributions:

We thank all the contributors for their collaboration and support.

93% of the searches are answered in less than 5 seconds

Last updated on August 4th, 2024 at 06:20 pm

Data from April to June, 2011

  • 93% of the full-text searches performed on the Portuguese Web Archive were responded in less than 5 seconds.
  • 95% of the URL searches were answered in less than 5 seconds.
  • 73% of the user clicks are on the first page of results.
  • We wrote 72 000 lines of code to improve the original search system based on the Archive-access project.
  • Try our search!