H2020 projects preserved by Arquivo.pt

Thumbnail H2020 projects

Last updated on December 6th, 2021 at 05:03 pm

The main objective of Arquivo.pt is to preserve online information for research and education purposes.

Previously, Arquivo.pt identified and preserved Research & Development project websites funded by the European Union during the FP4, FP5, FP6 and FP7 programmes (1994-2013).

Now, Arquivo.pt contributed to preserve online information that documents R&D projects funded by the Horizon 2020 programme (2014-2021). It preserved 197 million web files (17 TB) related to science for future access.

H2020 projects publish valuable information online but are being lost

Websites about Research and Development (R&D) projects are increasingly being used to publish and disseminate important scientific information that complements published literature (e.g. data sets, documentation or software).

However, after projects ending, the corresponding websites usually disappear causing a permanent loss of unique and valuable scientific information.

Arquivo.pt automatically identified URLs that document H2020 Research and Development projects

The European Union’s Open Data Portal published a data set from the Community Research and Development Information Service (CORDIS) that documents H2020 research projects. However, from the 31 129 projects listed, only 46% presented a project URL.

Arquivo.pt developed a low-cost methodology that automatically identifies URLs related to R&D projects to be systematically preserved. This automatic identification is achieved through the combination of open data sets with web search services. This methodology is detailed on a scientific article published at the International Conference on Digital Preservation 2016.

In sum, we extracted 106 300 unique URLs from the following open data sets:

Then, we extracted the acronym and title of the projects from the data sets and automatically searched the web for additional URLs using the Bing Search API.

All the data sets and tools developed have been made publicly available in open access so that they can be reused and collaboratively enhanced. In particular, you can access the software developed to automatically identify additional URLs about H2020 projects.

197 million web files related to science were preserved

Arquivo.pt identified and preserved 197 million web files (17 TB) that document R&D projects funded by Horizon 2020.

In 2021, we can already witness project websites that are no longer available online, such as the Extended Model of Organic Semiconductors (EXTMOS) project (http://extmos.eu/). However, it was preserved and can be accessed at Arquivo.pt:

Archived version at Arquivo.pt (https://arquivo.pt/wayback/20170427182603/http://extmos.eu/) of the home page of the EXTMOS Research and Development project (http://extmos.eu/)funded by H2020.
Archived version at Arquivo.pt of the home page of the EXTMOS Research and Development project funded by H2020.

Contributions to complement the European Open Data Sets

All the resulting data sets were made publicly available so that they can be improved and reused by other organizations also interested on preserving this digital heritage:

If you want to know more information about this collection you can watch the video Preservation of web content related to Horizon 2020.

References

Are you a researcher?

Create automatic narratives about any topic!

thumbnail-narrative-q2

Arquivo.pt provides a new function that allows you to automatically create temporal narratives on any topic.

The “Narrative” functionality, integrated into Arquivo.pt in September 2021, is the result of the collaboration between “Conta-me Histórias”, winner of the Arquivo.pt Award 2018, and Arquivo.pt.

The Conta-me Histórias” (Tell me Stories) project was developed by researchers from the Laboratory of Artificial Intelligence and Decision Support (LIAAD – INESCTEC )  and affiliated to the institutions Instituto Politécnico de Tomar – Center for Research in Smart Cities (CI2) ; University of Porto and University of Innsbruck .

How it works?

When a user enters a set of words about a topic in the Arquivo.pt search box and clicks on the “Narrative” button, the user is directed to the “Conta-me Histórias” service, which automatically analyzes the news from 25 websites archived by Arquivo.pt over time and presents a chronology of news related to the topic.

For example, if we search for “Just Bieber” and click on the “Narrative” button (Figure 1), we will be directed to the “Conta-me Histórias” , where we will automatically obtain a narrative of archived news (Figure 2).

example-narrative-arquivopt

Figure 1: Search results for pages about “Justin Bieber”.

example-tell-me-stories-arquivopt

Figure 2: Narrative of news about “Justin Bieber” from Portuguese news sites preserved by Arquivo.pt generated by the “Conta-me Histórias” service.

Create your narrative now!

“Conta-me Histórias” researches, analyzes and aggregates thousands of results to generate each narrative about a topic. It is recommended to choose descriptive words about well-defined themes, personalities or events to obtain good narratives.

Creating a narrative is useful for researchers, journalists or citizens who want to quickly get an overview of the evolution of a topic along time, thus saving them a lot of time and effort.

Go to Arquivo.pt and try to create a narrative about a theme of your choice.

Tell us about your experience so we can improve the service!

Presentations in the IIPC Web Archiving Conference and RESAW 2021

Thumbnail IIPC WAC 2021

Last updated on August 17th, 2021 at 07:35 pm

During the week of 14 to 18 June, three international meetings were held by videoconference with the participation of the Arquivo.pt:

    • International Internet Preservation Consortium (IIPC) – General Assembly – general assembly of the consortium that gathers the Web archiving initiatives around the world
    • Web Archiving Conference 2021 – the most important meeting in the field of Web preservation, where experts share new knowledge and experiences
    • RESAW Conference – meeting of the European RESAW network (Research Infrastructure for the Study of Archived Web Materials) this year in its 4th edition, mainly addressed to the community of researchers from non-technological scientific areas, such as Social Sciences, Arts and Humanities.

Contributions of Arquivo.pt to the international community

Arquivo.pt presented some results of the work developed in the last year, with emphasis on the functionalities that improve the reproduction of the archived contents, such as the “Complete the page”.
Two historical collections were integrated on the Arquivo.pt: the Geocities and the Internet Memory Foundation. Arquivo.pt did special collections about the 2019 European Elections and Covid-19.
The contents of Arquivo.pt are accessible to any researcher regardless of the country they are in and therefore it is a useful service to the international community.

Presentations

  • Arquivo.pt updates 2021: presentation at the IIPC – General Assembly, by Daniel Gomes (Vídeo)
  • Complete the page. 1 minute drop in (presentation at the IIPC – General Assembly “complete the page”), by Daniel Gomes (Slide)
  • A transnational and cross-lingual crawl of the European Parliamentary Elections 2019, by Ivo Branco (Slides, Vídeo)
  • Enhancing access to research the Geocities historical collection, by Pedro Gomes (Slides, Vídeo)
Complete the page - demo
Complete the page – demo. Slide used in the IIPC 1 minute presentation, at the IIPC General Assembly 2021

Meet the winners of the Arquivo.pt Award 2021!

Last updated on February 18th, 2022 at 12:39 pm


The winners of the Arquivo.pt 2020 Award was announced by the Público newspaper, the official media partner of this year’s edition, which granted an honorable mention to the best work based on the contents of the newspaper. 26 candidate works were received.

The award ceremony toke place during Science 2021 – Meeting with Science and Technology, june 30, at the Lisbon Congress Center.

1st place – “Major Minors”

The winner of the 10,000 euros prize was the work “Major Minors” by Paulo Martins e Leandro Costa.

“Major Minors” is an Ontology of press clippings from Portuguese newspapers with reference to social minorities.

2nd place – “Politiquices”

The 2nd prize in the amount of 3,000 euros was awarded to the work “Politiquices” developed by David Batista.

“Politquices” is a Web application that allows searching support or opposition relations between political personalities and parties expressed in news headlines preserved at Arquivo.pt.

This interface makes it possible to analyse the relationship of support or opposition between two political personalities or organisations.

3rd place – “Primeiras páginas de jornais online portugueses”

The 3rd place winner received a prize of 2,000 euros and was awarded to the work “Primeiras páginas de jornais online portugueses”, developed by Susana Parreiraunder the supervision of Ana Sabino, Ana Boavida e Penousal Machado.

“Primeiras páginas de jornais online portugueses” (Front pages of Portuguese online newspapers) presents an interactive graphical analysis of the front pages of Portuguese online newspapers. For this study, specific items within the newspaper design were analysed, thus allowing trends to be observed over time.

As a result we have a Web interface that allows interactively visualising, for example, the space occupied by the images on the Público newspaper front page.

Menção Honrosa do Público

Público newspaper, official partner of the 4th edition of the Arquivo.pt Prize, awarded its Honorable Mention to the work “Primeiras páginas de jornais online portugueses”.

Videos

Gallery

Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021
Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021 Entrega do Prémio Arquivo.pt 2021

Photos by Valter Gouveia – FCT | FCCN | Arquivo.pt

Internet Memory Foundation collection available in Arquivo.pt

logo Internet Memory Foundation - website

Last updated on September 15th, 2021 at 09:29 am

The historical collection of web content generated during the Internet Memory Foundation’s (IMF) activity has been donated to Arquivo.pt and is now searchable!

The IMF was a European organization dedicated to preserving web content that was wound up in 2018.

The 1st web archiving project in Europe (2004-2010) was led by Julien Masanès (who was guest of honour at the celebration of 10 years of Arquivo.pt) and was called European Archive Foundation.

In 2010, Julien Masanès, the “father” of Web archives in Europe created the IMF.

Examples of pages from the collection donated by the IMF

The collection donated by the IMF has now been integrated in the Arquivo.pt collection to be preserved for posterity.

This collection is composed of 142 million files that total 6.3 TB of historical information whose texts or images can now be searched through Arquivo.pt.

webpage liteScience printscreen

Life Science Competence in Europe portal, 2009.

print homepage www.limes.fp6-limes.eu

LIMES project homepage (Land and Sea Monitoring for Environment and Security), 2009.

print homepage intelligence-territoriale.eu

Project Intelligence-territoriale homepage, 2009.

European Parliament news page in the 20th anniversary of the break of the Berlim Wall, 2009.

Le Figaro about French presidential election, 2012.

Reuters with a new about WikiLeaks, 2011.

Print da página do Internet Memory Research em 2014

Internet Memory Foundation homepage, 2014.

Search this new collection!

This new collection has been named “InternetMemory” in the Arquivo.pt collections list.

Searches can be made on this collection using the collection search parameter or through the custom search page available at arquivo.pt/InternetMemory.

custom-search-page of Internet Memory collection

 

Replay with old browser and export results with the new version of Arquivo.pt

Exported results into an Excel sheet of a search for the word "universidade", university, limited to 10 items

Arquivo.pt launched a new version of its service on July 1, 2020 named Responsive.

The purpose of this version was to improve the user experience between different devices and add new features.

Replay a past webpage using a browser from the past

We added an option to view the archived page using a browser from the past. In the Options choose Replay with old browser and you will be redirected to the oldweb.today service that emulates browsers such as Netscape NavigatorMicrosoft Internet Explorer or NSCA Mosaic.

This external service is useful for research use cases, in areas such as Web design, Art, Communication or History,where it is necessary to access the original visual aspect of a page from the past in the most reliable way possible.

Web page of the European Union in 1996 using the Oldweb.Today service
Web page of the European Map of WWW/NIR sites in 1996 using the Oldweb.Today service

Try this new option from Arquivo.pt to replay the European Map of WWW/NIR sites in 1996 using a contemporary browser or any other historical page using the Oldweb.Today service.

You may have to wait a while for your request to be processed but it is always faster than having to install a browser from the past on your computer.

Export search results to spreadsheet format

This new function enables users to save their search results for further treatment and analysis. This is specially useful to perform thorough research about a given topic.

After a search, in the Options, just choose one of the available formats to export the obtained results: XLSX, CSV or TXT.

Exported results into an Excel sheet of a search for the word "universidade", university, limited to 10 items
Exported results into an Excel sheet of a search for the word “universidade”, university, limited to 10 items

More on the Responsive milestone

New version of Arquivo.pt (Webapp release)

Webapp release on mobile version example

Last updated on October 12th, 2020 at 11:52 am

Arquivo.pt launched a new version of its service on April 15, 2020 named WebApp.

The purpose of this version was to standardize the user experience between different devices and reduce maintenance costs by removing components with redundant functions.

Its main novelty is the combination of the desktop and mobile interfaces in a single user interface.

The old desktop version has been disabled and the mobile version has evolved to work on various types of devices and screen sizes.

Webapp release desktop and mobile

New design of the homepage

 

Try the new image and page search

Webapp release search in english

New user interfaces for image or text search

Help us to improve!

To help us, just search the Arquivo.pt using any device (e.g. laptop, mobile phone, tablet).

If you encounter any problems, please contact us!

Remember to always send the address of the page where you detected the problem.

More information

Arquivo.pt Award 2020 launched at Público Newspaper

Lançamento do Prémio Arquivo.pt 2020

Last updated on March 24th, 2020 at 12:21 pm

The Arquivo.pt Award 2020 was officially launched on January 16th, at the Público Newspaper in Lisbon. Público is one of the most well-known newspapers in Portugal.

The event had talks by the Director of Público Newspaper Manuel Carvalho, the President of the Foundation for Science and Technology Helena Pereira and the manager of Arquivo.pt Daniel Gomes.

Participants of this open event were led into a guided visit to the newsroom. They saw a real scenario where contents of a newspaper are edited and produced.

Público’s website is daily crawled by the Arquivo.pt, which means an important contribution to the future access and use of contents.

In the 2020 edition of the Arquivo.pt Award, Público Newspaper will grant an Honorable Mention to works based on the newspaper’s content along its 20 years.

Find out how to apply, till 4th of May: arquivo.pt/award2020

Galeria de fotos

Lançamento do Prémio Arquivo.pt 2020

Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020
Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020 Lançamento do Prémio Arquivo.pt 2020

Photos by: Valter Gouveia, FCT

Applications open to the Arquivo.pt Award 2020!

Arquivo.pt Award 2020

Applications are open to the Arquivo.pt Award 2020!

In this 3rd edition of the annual Arquivo.pt Award, € 15,000 will be awarded to the 3 best works (1st place: € 10,000).

The deadline for submissions is May 4, 2020.

Works may be developed individually or in group about any topic, as long as they use the information provided by Arquivo.pt as the main source of information.

The Público Newspaper is the official media partner of the Arquivo.pt Award in 2020. It was one of the first newspapers to become available online.

Jornal Público celebrates its 30th anniversary on March 5, 2020 and will award an Honorable Mention to one of the works which focused on the historical web-archive of Público online.

Full details about the applications are available at:
https://arquivo.pt/award2020

The Arquivo.pt Award promotes the visibility of the applicants and their institutions.

Help us to spread the word!

Meet the winners of the Arquivo.pt Award 2019!

Last updated on February 18th, 2022 at 12:28 pm

The winners of the Arquivo.pt Award 2019 were announced by His Excellency the Prime Minister of Portugal António Costa during the session that took place on July 8 at 9:00 am in Auditorium 1 during the event “Science 2019 – Encounter with Science and Technology” at the Lisbon Congress Center.

Entregas do Prémio Arquivo.pt durante o Encontro Ciência 2019

1_GOU6417
_GOU6658
_GOU6570
_GOU6861
_GOU6647
_GOU6630
_GOU6610-2
_GOU6687
_GOU6753
_GOU6719
_GOU6713
1_GOU6417 _GOU6658 _GOU6570 _GOU6861 _GOU6647 _GOU6630 _GOU6610-2 _GOU6687 _GOU6753 _GOU6719 _GOU6713

Photos by Valter Gouveia, FCT

1st place: meuParlamento.pt

MeuParlamento.pt is a mobile application that simulates the Portuguese Parliament, calling on all citizens to play the role of a deputy.

Authors: Nuno Moniz, Arian Pasquali, Tomás Amaro

2nd place: Revisionista.pt: Un-cover the news

The Revisionista.pt is an online tool to reveal post-publication changes in the Portuguese news.

Authors: Flávio Martins and André Mourão

3rd place: Public speeches about violence in private

Analysis of 217 news collected in Arquivo.pt from the three main daily newspapers, on domestic violence.

Author: Zélia de Macedo Teixeira

Materials for dissemination about the winners

About the Arquivo.pt Award 2019

Arquivo.pt is a research infrastructure managed by the Foundation for Science and Technology (FCT) that enables search and access to web pages archived since 1996.

The Arquivo.pt Award aims to annually promote innovative works based on historical information preserved by Arquivo.pt. Submissions closed on May 3 and we received works in areas such as: media studies, education, design, information technology, health or cultural and historical heritage.

The Arquivo.pt Award 2019 received the High Sponsorship of His Excellency the President of the Republic Marcelo Rebelo de Sousa.

Arquivo.pt Award 2020

The Arquivo.pt Award was approved as an annual initiative of the Foundation for Science and Technology that will run from January to May.

Sign up for the Arquivo.pt international mailing list to be informed about future editions of the Arquivo.pt Award!