Arquivo.pt is a free public service that allows searching and accessing Web pages preserved since the 1990’s, such as viewing an old news or accessing an old version of a website.
The collaboration between the AMCC and Arquivo.pt is materialized in a training program entitled Arquivo.pt: Digital Skills for the Media, developed in four webinars, and in the attribution of the AMCC Honorable Mention to work done on Portuguese centenary newspapers in the Arquivo.pt Award 2023.
Webinar cycle: Arquivo.pt: digital skills for media
The webinar cycle aims to equip trainees with digital skills that enable them to solve problems caused by the disappearance of digital information and gain competitive advantage in the production of unique and exclusive content.
Webinar 1: A tool for quickly searching the past
Data: Mars 24, 2023 Time: 14h00-15h30 (in Portuguese)
Until May 4th, Arquivo.pt launches the challenge of creating a work based on historical information preserved from the Web.
In this 6th edition of the Arquivo.pt Award, 15 000 euros will be granted to the three best works (1st place: 10 000 euros).
Works about any subject may be submitted, done individually or in group. The only condition is that Arquivo.pt was the main source of information.
The Público newspaper will grant an Honorable Mention for works based on the web-archived content of Público online.
The Aveiro Media Competence Center (AMCC) will also grant an Honorable Mention to one of the submitted works that focuses on the archives of the online version of century-old newspapers.
Exame Informática, the oldest Portuguese magazine on Information and Communication Technology, distinguished Arquivo.pt with the award for the Best Digital Service of the year 2022.
Daniel Gomes, manager of Arquivo.pt, dedicated the award to the various teams that have worked on Arquivo.pt over the years. In the month in which Arquivo.pt marked 15 years of existence, this distinction is an excellent anniversary gift, he concluded.
On November 8, 2007, the Portuguese Web Archive was officially created and later named Arquivo.pt.
To celebrate this date, Wikimedia Portugal and Arquivo.pt have associated themselves in the organization of an online event dedicated to the preservation of the digital heritage.
Agenda
Introdução – André Barbosa, Wikimédia Portugal (Video)
15 anos de Arquivo.pt – Daniel Gomes, Arquivo.pt (Slides, Video)
Wikimedia na Universidade: Exploração e Projetos na NOVA FCSH – Rute Correia, Residência WMPT na NOVA FCSH, (Slides;Video)
GLAM Wiki. Uma introdução geral – Giovanna Fontenelle, Fundação Wikimédia, Brasil (Slides;Video)
Demo dos recursos em acesso livre no Arquivo.pt – Daniel Gomes (Video)
On August 15, 2021 the presidential palace in Kabul was taken over by the Taliban, consummating the fall of the regime that had been in place for 20 years, following the 9/11 attacks on the United States.
No time to lose when it comes to preserving the Web
Arquivo.pt reacted quickly, launching an automatic content search focused on .af domain sites and on international media news about the ongoing events.
On August 17, the websites began to be recorded.
1800 website addresses from Afghanistan (ending in .af) and 500 media news stories from around the world were used.
The addresses, URLs or “seeds” were obtained through automated search using the Bing Search API and immediately put into recording.
Content available to know Afghanistan’s history
As a result of the collection carried out, more than 400 Gigabytes of information became available at Arquivo.pt, which anyone can use for research in the most diverse areas.
The main contribution of Arquivo.pt to the community of Web archivists was the use of the automatic search that allows a quick reaction in the recording of Web contents in imminent risk of being lost.
One of them was the tutorial “Timeline summarization for large-scale past-web events with Python: the case of Arquivo.pt” developed by Daniel Gomes and Ricardo Campos.
Since 2008 the cryptocurrency market has revolutionised the world by innovating and expanding into other areas (e.g., finance and art). However, with this rapid expansion, many projects are created every day, giving rise to a wide and varied range of websites, technologies and scams. Markets follow financing stages and it is during an initial stage of euphoria that more projects are created.
We believe that as the cryptocurrency market stabilises, projects/websites are disappearing because funding diminishes or runs out.
Arquivo.pt initiated a new web archive collection that preserves web content that documents Cryptocurrency activities.
This work produced a new open dataset with information documenting each cryptocurrency project, including it is original URLs and links to the corresponding web-archived version in Arquivo.pt. The information sources selected to create this dataset were:
We believe that by creating this new dataset related to cryptocurrencies and by preserving all the corresponding web content, it has the potential to originate innovative scientific contributions in several areas such as Economy or Digital Humanities.
The winners of the Arquivo.pt Award 2022 were announced by the Público newspaper on 22th July 2022, the official communication partner of this edition, which awarded an honorable mention to the best work based on its historical web content.
This work developed a methodology for the automatic classification of stigmatizing mental illness articles, present in Portuguese online news newspapers, using Artificial Intelligence.
For example, a news article that uses the term schizophrenia associated with a news article about political life is classified as stigmatizing. Using automated processes, this work allows to identify thousands of news items and draw the attention of the media and society to the stigmatization of mental illnesses.
The 3rd place winner received a prize of 2 000 euros and was awarded to the work “Arquivo Público”, developed by Diogo Correia and Ricardo Campos.
“Arquivo Público” is a web application focused on the contents published on the Público newspaper website over time and preserved by Arquivo.pt.
As a result, we have a web interface that allows the visualization of archived news about a specific subject and also the representation of the number of news, most frequent terms and geographical reference.
The Público newspaper, official partner of the 5th edition of the Arquivo.pt Award, granted an Honorable Mention to the work “Arquivo Público”, carried out by Diogo Correia and Ricardo Campos.
Photos of the award cerimony
The award ceremony took place during the commemorative session of the National Day of Scientific Culture, on November 24th 2022, at the Teatro Thalia, in Lisbon.
The awards were presented by the Minister of Science, Technology and Higher Education, Elvira Fortunato, the President of the Board of Directors of FCT, Madalena Alves, and the representative of the media partner, the science editor of Público newspaper, Teresa Firmino.
Image Gallery
Créditos das fotos: Pedro Ferreira – FCT | FCCN | Arquivo.pt
The International Internet Preservation Consortium (IIPC), a consortium that brings together Web preservation initiatives from around the world, held its General Assembly with its members between May 17 and 19, 2022.
The following week, between May 24 and 25, held the IIPC Web Archiving Conference (IIPC WAC), online as in the previous year due to the contingencies of the Covid-19 pandemic.
Arquivo.pt resources and initiatives presented at the IIPC WAC 2022
The IIPC Web Archiving Conference is an initiative open to the community, where people or entities interested in the Web preservation domain may participate.
The Arquivo.pt contributed to the Ligthtning Talks sessions (session 5 and session 13).
The Arquivo.pt presentations focused on the resources and initiatives that this service has lately developed for the community.
Exhibiting Web memories from Arquivo.pt with free tools (abstract, slides, video)
The Portuguese Museums Network was the community invited to participate in the cycle of three webinars entitled “Cultural Heritage on the Web: online presence of museums”.
The aim is to raise awareness among museum managers and professionals about the importance of preserving content published on the Web and to make known the services and tools of Arquivo.pt.
This initiative is promoted by the Direção Geral do Património Cultural, through the Departamento de Museus, Conservação e Credenciação and Divisão de Museus e Credenciação, which welcomed and integrated in its training offer the proposal of Arquivo.pt (FCT, I.P.) .
Information and materials
June 21st, 2022 – The Arquivo.pt and the preservation of digital memory (1st webinar)
In this session Arquivo.pt is presented as a useful service to museums and institutions that the community can count on to preserve digital cultural heritage, specifically Web content.
Speaker: Ricardo Basílio, digital curator (in substitution of Daniel Gomes, manager of Arquivo.pt)
June 27, 2022 – Archiving the Web: DIY (3rd Webinar)
This session offers a tutorial for creating a local web archive, recording contentes in a standard format and using open tools that any person can use.