We archived the Web pages of the Portuguese Parliamentary Elections of 2015!

The Arquivo.pt made 4 crawls of Web pages related with the Portuguese Parliamentary Elections of 2015.

We had appealed to the community contribution by suggesting Web pages related with the Parliamentary Elections of 2015 in order to archive it.

We made 4 crawls, during and after the election campaign period, using the list of 127 Web pages suggested by the community, archiving a total of 2 802 407 Web resources, that occupy 274 GB.

It were collected Web pages such as the ones from the running political parties, news in the media about the elections, blogs, opinion articles, and satirical political Web pages.

The Arquivo.pt respects an embargo period of 1 year, and for that reason the archived collection will only be avaliable by the end of 2016.

However you can consult now some archived Web pages from the previous Portuguese Parliamentary Elections such as:

We would like to thank all the volunteers that helped with this initiative.
Now we need your collaboration suggesting Web pages about the Portuguese Presidencial Elections.
Can we count on you?
Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Paper presented at EPIA 2009

An Updated Portrait of the Portuguese Web presented at EPIA 2009

The paper An Updated Portrait of the Portuguese Web, by João Miranda and Daniel Gomes, was presented at the 14th Portuguese Conference on Artificial Intelligence (EPIA 2009) in Aveiro.

This paper presents a characterization of the Portuguese Web derived from a crawl performed by the Portuguese Web Archive in March 2008, with 48 million documents in 2.5 TB of amount of data.

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Session at ISCTE “Archive.pt as an infrastructure for research in Social Sciences and Humanities

Session at ISCTE (Lisbon) “Archive.pt as an infrastructure for research in Social Sciences and Humanities”

You missed it?

No problem. Here are all the presentations:

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Technical report documents the creation of a searchable web archive

This report presents some of the work developed to create an efficient and effective web archive service, from data acquisition to user interface design.

The results of this research were applied to create the Portuguese Web Archive that is publicly available since January 2010. It supports full-text search over 1 billion contents archived from 1996 to 2010. The developed software is available as an open source project.

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone