rARC Windows 7 version now available

Last updated on August 6th, 2024 at 01:08 pm

rARC version for Windows 7 is now available.

RARC is a pioneer system being developed within the Portuguese Web Archive project. Its main goal is to enable Internet users to provide storage space from their computers to help preserve web contents for the future. Anyone can contribute to preserve the web, by providing a little amount of space to keep a backup of a small part of the archived data.

Please contact us if you find any problem.

Search over the past Web is available

Last updated on August 6th, 2024 at 01:43 pm

The Portuguese Web Archive released a service that enables search and access to web contents that are no longer available online.

Portuguese Web Archive

This beta version of the service includes 130 million contents of the Portuguese web archived between 1996 and 2007.

It enables advanced search options, such as, date range restrictions.

Please send us your comments and critiques. They are most welcome.

Search the past now!

2005 contents provided by the National Library can now be searched

Last updated on December 20th, 2019 at 03:41 pm

The contents provided by the National Library of Portugal were successfully integrated and can be searched through our experimental search system.

In 2005 the National Library of Portugal in collaboration with INESC conducted a series of web crawls to gather information related to the national elections.

This project was named RECOLHA and the data collected comprised over 14 million contents (165 GB).

In 2009 these data was supplied to the Portuguese Web Archive.

It was successfully integrated and now it can be searched through our experimental search system.

Notice, that searches are performed over all the archived data, independently from their origin, so you will not be able to identify the RECOLHA contents.

Scientific study about Web accessibility conducted in collaboration with the Archive

Last updated on December 20th, 2019 at 03:44 pm

This research, conducted by the HCIM group from the University of Lisbon in collaboration with the Portuguese Web Archive, presents a measurement of the Portuguese Web accessibility for people with disabilities.

Scientific study presents a search log analysis of a search engine

Last updated on August 4th, 2024 at 06:00 pm

This research presents a characterization of the information-seeking behaviour of the users of a Portuguese web search engine, based on the analysis of its logs.

The paper A Search Log Analysis of a Portuguese Web Search Engine, by Miguel Costa and Mário J. Silva, was presented at INForum 2010 – Simpósio de Informática, in Braga, Portugal.

Scientific study presents a characterization of the information needs of Web Archive users

Last updated on December 20th, 2019 at 03:58 pm

This research focuses on what the users intents are and which topics are most interesting to them. Three instruments were used to collect quantitative and qualitative data: search logs, an online questionnaire and a laboratory study.

The paper Understanding the Information Needs of Web Archive Users, by Miguel Costa and Mário J. Silva, was presented at the 10th International Web Archiving Workshop, in Vienna, Austria.

Web archivists: please answer 3 quick questions regarding resources committed to web archiving

Last updated on September 29th, 2017 at 03:10 pm

We are conducting a preliminary study regarding the efforts being committed world-wide to web archiving.

If you are a web archivist, please send us your answers to these 3 quick questions:

  • What is the name of your web archive initiative (please state if you want to remain anonymous)?
  • Which is the amount of data that you have archived (number of files, disk space occupied)?
  • How many people work at your web archive (in person/month)?

Any additional comments are welcome.

Thank you very much.

Characterizing Search Behavior in Web Archives: we have published a video about a study we performed

Last updated on December 20th, 2019 at 04:03 pm

Talk about the search behavior characterization of web archive users, based on a scientific study performed by the Portuguese Web Archive.

The presentation focuses on the following points:

  • Web archiving
  • Search log analysis
  • How do users search?
  • Time dimension in searches

Find out more: