Paper presented at EPIA 2009

An Updated Portrait of the Portuguese Web presented at EPIA 2009

The paper An Updated Portrait of the Portuguese Web, by João Miranda and Daniel Gomes, was presented at the 14th Portuguese Conference on Artificial Intelligence (EPIA 2009) in Aveiro.

This paper presents a characterization of the Portuguese Web derived from a crawl performed by the Portuguese Web Archive in March 2008, with 48 million documents in 2.5 TB of amount of data.

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Session at ISCTE “Archive.pt as an infrastructure for research in Social Sciences and Humanities

Session at ISCTE (Lisbon) “Archive.pt as an infrastructure for research in Social Sciences and Humanities”

You missed it?

No problem. Here are all the presentations:

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Technical report documents the creation of a searchable web archive

This report presents some of the work developed to create an efficient and effective web archive service, from data acquisition to user interface design.

The results of this research were applied to create the Portuguese Web Archive that is publicly available since January 2010. It supports full-text search over 1 billion contents archived from 1996 to 2010. The developed software is available as an open source project.

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone