Paper presented at EPIA 2009

Last updated on October 2nd, 2017 at 10:51 am

An Updated Portrait of the Portuguese Web presented at EPIA 2009

The paper An Updated Portrait of the Portuguese Web, by João Miranda and Daniel Gomes, was presented at the 14th Portuguese Conference on Artificial Intelligence (EPIA 2009) in Aveiro.

This paper presents a characterization of the Portuguese Web derived from a crawl performed by the Portuguese Web Archive in March 2008, with 48 million documents in 2.5 TB of amount of data.

Session at ISCTE “Archive.pt as an infrastructure for research in Social Sciences and Humanities

Last updated on September 28th, 2017 at 11:13 am

Session at ISCTE (Lisbon) “Archive.pt as an infrastructure for research in Social Sciences and Humanities”

You missed it?

No problem. Here are all the presentations:

Portuguese Web Archive – a Memory Infrastructure @DLM2014

Last updated on December 20th, 2019 at 05:18 pm

Presentation about the Archive.pt service and the importance of web archiving to preserve the memory of Humanity.

Presentation on Thursday 17:15 (13 November) in Lisbon at DLM Forum – Making the Information Governance Landscape in Europe

The Forum will be held at Instituto Superior Técnico.

@dlmforum2014 #DLM2014

WWW 2013: Search the Past with the Portuguese Web Archive

Last updated on September 28th, 2017 at 01:29 pm

The Portuguese Web Archive (PWA) is at the World Wide Web Conference (WWW 2013) in Rio de Janeiro, Brazil, with a demo session.

The demo at WWW 2013 presents the Portuguese Web Archive, which enables search over 1.6 billion files archived from 1996 to 2012.

New video: “The Portuguese Web Archive and the open access to scientific knowledge”

Last updated on December 20th, 2019 at 05:31 pm

Web archiving contributes to empower open-access to science.

There is a growing amount of open access scientific knowledge published on the Web.

This video debates the importance of web archiving to empower open access to science.

Technical report documents the creation of a searchable web archive

Last updated on September 29th, 2017 at 02:17 pm

This report presents some of the work developed to create an efficient and effective web archive service, from data acquisition to user interface design.

The results of this research were applied to create the Portuguese Web Archive that is publicly available since January 2010. It supports full-text search over 1 billion contents archived from 1996 to 2010. The developed software is available as an open source project.

I. P. Santarém, 7th and 8th Feb.: learn more about the Portuguese Web Archive

Last updated on September 29th, 2017 at 02:22 pm

Come and meet the Archive’s team.

The Portuguese Web Archive will be presented at Jornadas FCCN on 7th and 8th of February 2012, with the following activities (in Portuguese):

93% of the searches are answered in less than 5 seconds

Last updated on September 29th, 2017 at 02:40 pm

Data from April to June, 2011

  • 93% of the full-text searches performed on the Portuguese Web Archive were responded in less than 5 seconds.
  • 95% of the URL searches were answered in less than 5 seconds.
  • 73% of the user clicks are on the first page of results.
  • We wrote 72 000 lines of code to improve the original search system based on the Archive-access project.
  • Try our search!