Tutorial on using web archives for research

Last updated on December 29th, 2022 at 12:09 pm

Gathering professionals from all around the world, the 22nd International Conference on Theory and Practice of Digital Libraries, TPDL 2018, took place from September 10th to 13th at the Faculty of Engineering of the University of Porto.

Alongside, two events also discussed web archives and the preservation of the web.

Tutorial on using web archives for research

On its first day, TPDL 2018 held the Research the Past Web using Web archives tutorial, in which researchers, computer scientists, information professionals, and webmasters discussed new ways and tools to preserve online published information.

The Tutorial presented ways to create and explore web archives as well as methods and technologies to develop web applications that automatically access and process information preserved in web archives, such as Wayback Machine, Memento Time Travel protocol or the Arquivo.pt API.

The average satisfaction with the tutorial was 4 out of 5, and 80% of the participants would recommend it to their colleagues. See all the results from the evaluation questionnaire.


Workshop RESAW@Porto2018

On Thursday, the Workshop RESAW@Porto2018 took place at TPDL 2018 and was organized by Jane Winters (School of Advanced Study) and Daniel Gomes (Arquivo.pt).

The event aimed at researchers who work exploring web archives to search for information about the past. There were 24 participants from 11 distinct countries.

Speakers from five countries presented a range of different approaches, from small-scale analyses of individual websites to large-scale investigations of entire domains, and innovative combinations of quantitative and qualitative research methods.

The third RESAW Conference will take place in Amsterdam, 19-21 June 2019. The call for papers is open.

Presentations and abstracts

  • Conta-me histórias – Temporal summarization and the Portuguese web archive, by Ricardo Campos, Arian Pasquali, Vítor Mangaravite, Alípio Jorge, Adam Jatowt – presentationabstract
  • The Archived Web for Research in the humanities and social sciences, by Jane Winters – presentation
  • The ‘Arquivo de Opinião’ archive, by Miguel Won – presentation, abstract
  • The Memento Infrastructure, by Martin Klein, Herbert Van de Sompel – presentation, abstract
  • Web Archive Research and the role of (digital) academic libraries, by Thomas Risse- presentation, abstract


Photo Gallery


Resaw10 Resaw19 Resaw12 Resaw13 Resaw18 Resaw3 Resaw14 Resaw15 Resaw16 Resaw11 Resaw7 Resaw5 Resaw6


Tutorial and Workshop in Porto, September

Sessão de trabalho do grupo Investiga XXI

Last updated on August 7th, 2018 at 10:32 am

Would you like to know more about web archives?

Then, do not miss the RESAW@Porto2018 workshop and the tutorial Using web archive tools to preserve and research the Past Web.

Workshop RESAW@Porto2018

The RESAW@Porto2018 workshop is aimed at everyone who wish to explore web archives to search for information about the past. The detailed program is already available.

This workshop will be held on September 13, 2018 (9:00-18:30) in Porto (FEUP), Portugal, as part of the TPDL 2018 international conference.

Price and registration

The registration fee is 120 EURO or 90 EURO for students. Lunch is included.

In the registration form, you must:

  1.  send the following comment: “Special authorization for a reduced fee to the Web Archive workshop – An introduction to web archives for Humanities and Social Science research”;
  2. choose option payment by “bank transfer”.

Then, you will receive the details to perform the payment.

You may register only for the workshop. Registration for the remaining conference is optional.

Tutorial “Research the Past Web using Web archives”

The tutorial Research the Past Web using Web archives is suitable for researchers, computer scientists, information professionals and webmasters, who wish to gain new insights about preserving information published online.

This tutorial will be held on September 10, 2018 (9:00-12:30) in Porto (FEUP), Portugal, as part of the TPDL 2018 conference.

Price and registration

The registration on the tutorial is free but the registration in the remaining conference is mandatory.

Spread the word!

Help us disseminating these events among potential participants.

Arquivo.pt 2018 Awards Presentation

Last updated on April 3rd, 2019 at 11:22 am

Arquivo.pt 2018 Award was presented  to the winners, by the European Commissioner for Research, Science and Innovation , Carlos Moedas, on July 3 at  Ciência 2018 – Encontro com a Ciência e Tecnologia at the Lisbon Congress Centre

Photo Gallery

Entrega dos Prémios Arquivo.pt

VGOU7345 VGOU7705 VGOU7637 VGOU7608 VGOU7594 VGOU7517 VGOU7497 VGOU7491 VGOU7448 VGOU7458 VGOU6867



Arquivo.pt 2018 Award Winners

Last updated on July 24th, 2024 at 12:14 pm

Arquivo.pt 2018 Award was presented  to the winners by the European Commissioner for Research, Science and Innovation , Carlos Moedas,on July 3 at  Ciência 2018 – Encontro com a Ciência e Tecnologia at the Lisbon Congress Centre.

27 applications were received.

First prize – “Conta-me Histórias” (Tell me Stories)

“Conta-me Histórias” won first prize and 10 000 euros. This is a team effort from Ricardo Campos, Arian Pasquali, Vítor Mangaravite, Alípio Jorge and Adam Jatowt.

Conta-me Histórias is an online service that provides a temporal narrative about any subject using  24 electronic news platforms.

Second Prize – Framing the concept of “homosexuality” in 20 years of publication of the Expresso newspaper

The second prize of 3 000 euros was given to “Framing the concept of “homosexuality” in 20 years of publication of the Expresso newspaper” from João Teixeira Duarte and Zélia Teixeira.

This study intends to facilitate a reflection about homosexuality portrait in a Portuguese newspaper over 20 years.

Third Prize – “Arquivo de Opinião”

The third prize of 2 000 euros was given to “Arquivo de Opinião” from Miguel Won.

Arquivo de Opinião is a web application which offers the user a digital repository  of opinion articles published between 2008 and 2016, in the most prominent Portuguese media outlets.

About Arquivo.pt Award

Arquivo.pt Award intends to highlight every year, innovative and original projects showing Arquivo.pt utility as a research data infrastructure.

Press Releases

Know more

Software Engineer Java/Linux needed at Arquivo.pt

Last updated on October 1st, 2018 at 04:40 pm

Arquivo.pt is looking for a Software Engineer Java/Linux to execute the following functions:

  • Maintenance and quality control of the service;
  • Development of large scale distributed computer systems;
  • Software maintenance and corrective evolution;
  • Interaction with the scientific community to establish collaborations.


  • Bachelor, master or PhD in computer science;
  • Experience with: Java J2EE, Java Server Pages, Java Beans, Python, Tomcat, Maven/Ant,  Linux  and Apache HTTPd servers;
  • Knowledge about architecture, development and operation of distributed computer systems;
  • Good level of English.


  • Source code repositories management (ex. Git);
  • Automatic tests platform management (ex. Jenkins, SeleniumHQ, SonarCube);
  • Participation in open source collaboration projects;
  • Internet production systems management experience;
  • Knowledge of Big Data/NoSQL (ex. Hadoop, HBase, Lucene, Solr);
  • Knowledge of Information Retrieval or Machine Learning;
  • Experience in web archive technologies (ex. Wayback Machine, Heritrix, NutchWAX);
  • Participation in Research & Development projects.


Arquivo.pt Award will be given at Ciência 2018

Last updated on October 26th, 2018 at 11:36 am

Arquivo.pt 2018 award will be given at  “Ciência 2018 – Encontro com a Ciência e Tecnologia”, wich will take place from the 2nd to the 4th of July, at the Lisbon Congress Centre.

The European Commissioner for Research, Science and Innovation , Carlos Moedas, will present the award at the plenary session “Novas fronteiras da era digital na Europa e no Mundo”, on July 3 at 17h30. The 1.º prize is 10 000 euros, the 2.º is 3 000 euros and the 3.º is 2 000 euros.

This was the Arquivo.pt award first edition. The competition ended on May 4 and 27 proposals from various fields such as: media, technology, computer science, tourism, cultural and historic patrimony, were received. This award recognises originality and innovation in Arquivo.pt’s applicability and the importance of web preservation.

Sites crawled in 2016 are now available through Arquivo.pt!

Last updated on October 1st, 2018 at 04:41 pm

The information collected from the Web during 2016 is now available at Arquivo.pt.

Remember events of 2016 such as the United  States Presidential Elections or UEFA Euro 2016.

Now you can access 964 million preserved files (58TB) from 2 million websites collected in 2016.

More examples of preserved pages

Web archives for research: Call for Proposals!

RESAW@Porto2018 workshop

An introduction to web archives for Humanities and Social Science research.

This one-day workshop will be held as part of TPDL 2018 on 13 September 2018, Porto, Portugal.

RESAW is a network of researchers and web archivists that promotes the development of Research infrastructures for the Study of Archived Web Material.

The organizing committee is composed by  Jane Winters (Professor of Digital Humanities at the University of London) and Daniel Gomes (Arquivo.pt).

Call for Proposals

We are seeking proposals from potential contributors to the workshop in two main areas:

  • Work presenting the state of the field and discussing the opportunities and challenges of using this new kind of primary source for research. This may include demonstrations of existing web archives.
  • Presentations of ground-breaking Humanities and Social Science research drawing on web archives, from small-scale analyses of individual websites to large-scale investigations of entire domains.


To apply, just submit a 300-500 word abstract via our online form. Final presentations should be around 30 minutes in length.

Deadline for applications: 15 June 2018, 17.00 (GMT+1).

All details at: http://arquivo.pt/resawPorto2018

Spread the word!

Please disseminate among potentially interested authors and attendees.


Jane Winters & Daniel Gomes

Arquivo.pt at Jornadas de Computação Científica 2018

Last updated on October 2nd, 2018 at 02:12 pm

The Jornadas de Computação Científica 2018 were in April, in Braga Portugal, at the International Iberian Nanotechnology Laboratory (INL).

At the event, the Portuguese Secretary of State for Science, Technology and Higher education, Maria Fernanda Rollo, spoke about the Digital Future.

Arquivo.pt team and some guests presented new services and applications.

Plenary session

Presentations about Arquivo.pt

Grants to collaborate with Arquivo.pt

Last updated on October 1st, 2018 at 04:41 pm

The ROSSIO Project-  Ciências Sociais, Artes e Humanidades has an open call for 3 grants in colaboration with Arquivo.pt. The call is open until the 10th of May 2018.

Check for application details at the links below

Digital Curator / Information Manager

Main tasks

  • Creation and management of themed collections in digital format.
  • Quality control and tests.
  • Training and users support
  • Critical review of scientific and technical documents (ex. scientific articles, operational documents).
  • Validation and information management in multiple digital formats (ex. videos, web pages, documents).
  • Identification and documentation of users needs.

Application details

Community Manager

Main tasks

  • User support with creation and management of community interaction.
  • Creation of strategic communication and training.
  • Communication quality control support.
  • Content creation and social media dissemination.
  • Services needs assessment according to users real performance
  • Networking management

Application details

Social Sciences and Humanities Researcher

Main tasks

  • Research to demonstrate the scientific utility of digital platforms
  • Training in community involvement for researchers

Application details