Exame Informática, the oldest Portuguese magazine on Information and Communication Technology, distinguished Arquivo.pt with the award for the Best Digital Service of the year 2022.
Daniel Gomes, manager of Arquivo.pt, dedicated the award to the various teams that have worked on Arquivo.pt over the years. In the month in which Arquivo.pt marked 15 years of existence, this distinction is an excellent anniversary gift, he concluded.
On November 8, 2007, the Portuguese Web Archive was officially created and later named Arquivo.pt.
To celebrate this date, Wikimedia Portugal and Arquivo.pt have associated themselves in the organization of an online event dedicated to the preservation of the digital heritage.
Agenda
Introdução – André Barbosa, Wikimédia Portugal (Video)
15 anos de Arquivo.pt – Daniel Gomes, Arquivo.pt (Slides, Video)
Wikimedia na Universidade: Exploração e Projetos na NOVA FCSH – Rute Correia, Residência WMPT na NOVA FCSH, (Slides;Video)
GLAM Wiki. Uma introdução geral – Giovanna Fontenelle, Fundação Wikimédia, Brasil (Slides;Video)
Demo dos recursos em acesso livre no Arquivo.pt – Daniel Gomes (Video)
On August 15, 2021 the presidential palace in Kabul was taken over by the Taliban, consummating the fall of the regime that had been in place for 20 years, following the 9/11 attacks on the United States.
No time to lose when it comes to preserving the Web
Arquivo.pt reacted quickly, launching an automatic content search focused on .af domain sites and on international media news about the ongoing events.
On August 17, the websites began to be recorded.
1800 website addresses from Afghanistan (ending in .af) and 500 media news stories from around the world were used.
The addresses, URLs or “seeds” were obtained through automated search using the Bing Search API and immediately put into recording.
Content available to know Afghanistan’s history
As a result of the collection carried out, more than 400 Gigabytes of information became available at Arquivo.pt, which anyone can use for research in the most diverse areas.
The main contribution of Arquivo.pt to the community of Web archivists was the use of the automatic search that allows a quick reaction in the recording of Web contents in imminent risk of being lost.
One of them was the tutorial “Timeline summarization for large-scale past-web events with Python: the case of Arquivo.pt” developed by Daniel Gomes and Ricardo Campos.
Since 2008 the cryptocurrency market has revolutionised the world by innovating and expanding into other areas (e.g., finance and art). However, with this rapid expansion, many projects are created every day, giving rise to a wide and varied range of websites, technologies and scams. Markets follow financing stages and it is during an initial stage of euphoria that more projects are created.
We believe that as the cryptocurrency market stabilises, projects/websites are disappearing because funding diminishes or runs out.
Arquivo.pt initiated a new web archive collection that preserves web content that documents Cryptocurrency activities.
This work produced a new open dataset with information documenting each cryptocurrency project, including it is original URLs and links to the corresponding web-archived version in Arquivo.pt. The information sources selected to create this dataset were:
We believe that by creating this new dataset related to cryptocurrencies and by preserving all the corresponding web content, it has the potential to originate innovative scientific contributions in several areas such as Economy or Digital Humanities.
The winners of the Arquivo.pt Award 2022 were announced by the Público newspaper on 22th July 2022, the official communication partner of this edition, which awarded an honorable mention to the best work based on its historical web content.
This work developed a methodology for the automatic classification of stigmatizing mental illness articles, present in Portuguese online news newspapers, using Artificial Intelligence.
For example, a news article that uses the term schizophrenia associated with a news article about political life is classified as stigmatizing. Using automated processes, this work allows to identify thousands of news items and draw the attention of the media and society to the stigmatization of mental illnesses.
The 3rd place winner received a prize of 2 000 euros and was awarded to the work “Arquivo Público”, developed by Diogo Correia and Ricardo Campos.
“Arquivo Público” is a web application focused on the contents published on the Público newspaper website over time and preserved by Arquivo.pt.
As a result, we have a web interface that allows the visualization of archived news about a specific subject and also the representation of the number of news, most frequent terms and geographical reference.
The Público newspaper, official partner of the 5th edition of the Arquivo.pt Award, granted an Honorable Mention to the work “Arquivo Público”, carried out by Diogo Correia and Ricardo Campos.
Photos of the award cerimony
The award ceremony took place during the commemorative session of the National Day of Scientific Culture, on November 24th 2022, at the Teatro Thalia, in Lisbon.
The awards were presented by the Minister of Science, Technology and Higher Education, Elvira Fortunato, the President of the Board of Directors of FCT, Madalena Alves, and the representative of the media partner, the science editor of Público newspaper, Teresa Firmino.
Image Gallery
Créditos das fotos: Pedro Ferreira – FCT | FCCN | Arquivo.pt
The International Internet Preservation Consortium (IIPC), a consortium that brings together Web preservation initiatives from around the world, held its General Assembly with its members between May 17 and 19, 2022.
The following week, between May 24 and 25, held the IIPC Web Archiving Conference (IIPC WAC), online as in the previous year due to the contingencies of the Covid-19 pandemic.
Arquivo.pt resources and initiatives presented at the IIPC WAC 2022
The IIPC Web Archiving Conference is an initiative open to the community, where people or entities interested in the Web preservation domain may participate.
The Arquivo.pt contributed to the Ligthtning Talks sessions (session 5 and session 13).
The Arquivo.pt presentations focused on the resources and initiatives that this service has lately developed for the community.
Exhibiting Web memories from Arquivo.pt with free tools (abstract, slides, video)
The Portuguese Museums Network was the community invited to participate in the cycle of three webinars entitled “Cultural Heritage on the Web: online presence of museums”.
The aim is to raise awareness among museum managers and professionals about the importance of preserving content published on the Web and to make known the services and tools of Arquivo.pt.
This initiative is promoted by the Direção Geral do Património Cultural, through the Departamento de Museus, Conservação e Credenciação and Divisão de Museus e Credenciação, which welcomed and integrated in its training offer the proposal of Arquivo.pt (FCT, I.P.) .
Information and materials
June 21st, 2022 – The Arquivo.pt and the preservation of digital memory (1st webinar)
In this session Arquivo.pt is presented as a useful service to museums and institutions that the community can count on to preserve digital cultural heritage, specifically Web content.
Speaker: Ricardo Basílio, digital curator (in substitution of Daniel Gomes, manager of Arquivo.pt)
June 27, 2022 – Archiving the Web: DIY (3rd Webinar)
This session offers a tutorial for creating a local web archive, recording contentes in a standard format and using open tools that any person can use.
The meeting was broadcast online with the aim of sharing with the community of archivists what has been an experience of collaborative curation of Web content.
Collaboration between a municipal archive and a web archive
This meeting took place in the continuity of a collaboration between the two teams developed during the pandemic period.
The Arquivo Municipal de Sines made a selective and systematic collection of Web content related to the Municipality of Sines, with the collaboration of local media, such as Rádio Miróbriga and Rádio Sines.
In turn, Arquivo.pt contributed with training on tools, like Webrecorder.net, that records in standardized format and prepared useful services, such as SavePageNow that allows to record pages on the fly directly on Arquivo.pt.
Local history is better with preserved Web pages
From this collaboration resulted the preservation of thousands of Web pages (about 200 Gigabytes of information) about the experience of the pandemic in the geographical area of Sines and Santiago do Cacém.
The copies of the Web Archive Files (WARCs) sent to Arquivo.pt have been integrated to become available.
Cryptocurrencies and web curation were the starting point for sharing the news of the service and talking about the work developed since the last edition of the Jornadas.
Zapping session remembered the 15 years of Arquivo.pt
Arquivo.pt was created in 2007 with the goal of collecting the Portuguese Web. After fifteen years it continues its mission, collecting, but mainly facilitating the access to preserved contents, both for the researcher and the common citizen.
In the Zapping session at the conference, in which each FCCN service presented its services, the Arquivo.pt was highlighted for its long-standing activity in Web preservation.
Training with the Library of the Escola Superior de Tecnologia e Gestão
The Arquivo.pt team was in the Library of the School of Technology and Management (ESTGV) in an extra session of the conference dedicated to digital preservation, mainly to institutional content published on the Web.
The training was promoted by the Library team, especially Dr. Rosa Silva, Coordinator of the service, and had the participation of the community. Besides the presentations, there was an opportunity to share ideas and point out future collaborations.
Paulo Medeiros, responsible for the service of Culture, Communication and Documentation, presented the institutional channels of the Instituto Politécnico de Viseu. These channels are increasingly present on the Web, such as the magazine Polistécnica that went digital in 2012, the scientific journal Millenium and the video channel Politécnico TV.
Arquivo.pt showed how any person or institution can have their Web contents preserved in an adequate format. To save contents directly on Arquivo.pt you can use the new SavePageNow recording service. To make a local Web archive you can use ArchiveWeb.page – Webrecorder.net.
Arquivo.pt APIs presented to Internet technologies students
The Arquivo team was in the classroom, thanks to the excellent welcome given by Prof. Dr. Valter Alves, director of the Design and Multimedia Technology course. Vasco Rato, Web developer of the Arquivo.pt, presented the APIs of the Arquivo.pt (Applications Programming Interfaces) for the automatic processing of preserved information.
By using the APIs of Arquivo.pt the students can make assignments for the technology subjects and compete to the Arquivo.pt Award.
Daniel Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu
Daniel Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu
Sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu
Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu
Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu
Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu
Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu
Sessão de formação na Biblioteca da ESTGV
Sessão de formação na Biblioteca da ESTGV
Sessão de formação na Biblioteca da ESTGV
Sessão de formação na Biblioteca da ESTGV
Sessão de formação na Biblioteca da ESTGV
Sessão de formação na Biblioteca da ESTGV
Sessão de formação na Biblioteca da ESTGV
Aula no curso de Tecnologia Design e Multimédia da ESTGV
Aula no curso de Tecnologia Design e Multimédia da ESTGV
Aula no curso de Tecnologia Design e Multimédia da ESTGV
Aula no curso de Tecnologia Design e Multimédia da ESTGV
Aula no curso de Tecnologia Design e Multimédia da ESTGV
Web pages for the history of the Instituto Politécnico de Viseu
In 2018, the library team developed a project with the participation of young students that resulted in a documentary short film where memories of old web pages, preserved by Arquivo.pt, were included.