Searching in the Portuguese Web Archive
- What can I find through the Portuguese Web Archive?
- How can I search in the Archive?
- What does the date displayed for each archived file mean?
- Are there only pages archived after 1996?
- Can I increase the number of results on the results page?
- Why are there pages that do not appear in the results?
- Why are some searches slow?
- What is the criterion to sort the search results?
1. What can I find through the Portuguese Web Archive?
You can find historical content in several languages archived from the web since 1996. We hold mainly Portuguese sites but also international sites (see examples of pages preserved by Arquivo.pt).
If you hold historical web content, please supply them to us.
2. How can I search in the Archive?
You can search:
- by term, entering terms or expressions you want to find in the archived contents (e.g.: expo 98)
- by address, entering the content URL to access its history (e.g.: http://www.fccn.pt)
- using the advanced search, where you can choose other options, for instance, format or web site (e.g.: PDF files at Público newspaper site)
3. What does the date displayed for each archived file mean?
It is the content archiving date.
Though it would be interesting to present the publication date, most of the contents on the Web do not provide this information. Therefore, it is only possible to present accurately the date on which a content was archived.
4. Are there only pages archived after 1996?
We perform periodic crawls since January 2008. The only way to preserve contents published prior to that, is gathering them from external entities that have saved them over time.
5. Can I increase the number of results on the results page?
In the advanced search page.
6. Why are there pages that do not appear in the results?
There are several reasons, for instance:
- they were published with errors
- they are under the embargo period
- they were not archived
7. Why are some searches slow?
The more terms the query expression has, the slower is the search. We are, however, continuously improving our system.
8. What is the criterion to sort the search results?
The ranking system uses several page features to present the most relevant results for a search.
In general, a preserved page that contains the search term in its title, text, address or anchor text of incoming links is considered to be more relevant for the search.
In the advanced search page you can choose to sort results by date.
You can learn more about the search results ranking through our publications.