Documents cite web content by referencing their URLs so that readers can later access them.
In the case of scientific articles, the importance of these citations is even greater to maintain the integrity of research works because they often reference essential information to enable the reproducibility of an experiment or analysis.
For example, links in a scientific article may cite the datasets, software or web news that supported the research, which are not included in the text of the article.
To respond to the need of preserving the integrity of documents, Arquivo.pt launched the CitationSaver.
CitationSaver automatically extracts cited links in a document and preserves their content (e.g. web pages cited in a book) so that they can be retrieved later from Arquivo.pt.
Use CitationSaver to preserve the integrity of your documents
Upload a document and CitationSaver will extract the cited URLs, archive their content and make it available on Arquivo.pt after a short notice. There are 3 methods to upload a document:
insert the address (URL) of the PDF or TXT file, if it is published online
upload the file in PDF or TXT format
paste the text containing the addresses you want to preserve (e.g. References section of an article or Bibliography of a book).
Organizations keep domains that referenced websites which are no longer used, to prevent them from being bought or because they were just forgotten.
The aim of project Renascer (Reborn) is to bring back historical websites whose content is no longer available online and whose domain continues to be held by their authors.
“Forgotten” domains can cause cybersecurity problems
In this situation, the original content of the website was inaccessible despite the fact that the domain continued to be owned by the author of the website.
Furthermore, since the domain was still pointing to an active web server, cybersecurity issues could occur if this server was not being properly maintained.
The domain owner only has to redirect it to Arquivo.pt, through the Memorial service.
For example, the mctes.pt domain started to reference back its original contents preserved by Arquivo.pt, thus making this website to be reborn.
Examples of Reborn domains
Project Renascer identified active domains managed by FCCN which were not referencing any content, and gave them a new life turning them to reference its historical contents preserved by Arquivo.pt.
Contact Arquivo.pt to reborn the historical websites of your organization.
Arquivo.pt is a free public service that allows searching and accessing Web pages preserved since the 1990’s, such as viewing an old news or accessing an old version of a website.
The collaboration between the AMCC and Arquivo.pt is materialized in a training program entitled Arquivo.pt: Digital Skills for the Media, developed in four webinars, and in the attribution of the AMCC Honorable Mention to work done on Portuguese centenary newspapers in the Arquivo.pt Award 2023.
Webinar cycle: Arquivo.pt: digital skills for media
The webinar cycle aims to equip trainees with digital skills that enable them to solve problems caused by the disappearance of digital information and gain competitive advantage in the production of unique and exclusive content.
Webinar 1: A tool for quickly searching the past
Data: Mars 24, 2023 Time: 14h00-15h30 (in Portuguese)
Until May 4th, Arquivo.pt launches the challenge of creating a work based on historical information preserved from the Web.
In this 6th edition of the Arquivo.pt Award, 15 000 euros will be granted to the three best works (1st place: 10 000 euros).
Works about any subject may be submitted, done individually or in group. The only condition is that Arquivo.pt was the main source of information.
The Público newspaper will grant an Honorable Mention for works based on the web-archived content of Público online.
The Aveiro Media Competence Center (AMCC) will also grant an Honorable Mention to one of the submitted works that focuses on the archives of the online version of century-old newspapers.
Exame Informática, the oldest Portuguese magazine on Information and Communication Technology, distinguished Arquivo.pt with the award for the Best Digital Service of the year 2022.
Daniel Gomes, manager of Arquivo.pt, dedicated the award to the various teams that have worked on Arquivo.pt over the years. In the month in which Arquivo.pt marked 15 years of existence, this distinction is an excellent anniversary gift, he concluded.
On November 8, 2007, the Portuguese Web Archive was officially created and later named Arquivo.pt.
To celebrate this date, Wikimedia Portugal and Arquivo.pt have associated themselves in the organization of an online event dedicated to the preservation of the digital heritage.
Agenda
Introdução – André Barbosa, Wikimédia Portugal (Video)
15 anos de Arquivo.pt – Daniel Gomes, Arquivo.pt (Slides, Video)
Wikimedia na Universidade: Exploração e Projetos na NOVA FCSH – Rute Correia, Residência WMPT na NOVA FCSH, (Slides;Video)
GLAM Wiki. Uma introdução geral – Giovanna Fontenelle, Fundação Wikimédia, Brasil (Slides;Video)
Demo dos recursos em acesso livre no Arquivo.pt – Daniel Gomes (Video)
On August 15, 2021 the presidential palace in Kabul was taken over by the Taliban, consummating the fall of the regime that had been in place for 20 years, following the 9/11 attacks on the United States.
No time to lose when it comes to preserving the Web
Arquivo.pt reacted quickly, launching an automatic content search focused on .af domain sites and on international media news about the ongoing events.
On August 17, the websites began to be recorded.
1800 website addresses from Afghanistan (ending in .af) and 500 media news stories from around the world were used.
The addresses, URLs or “seeds” were obtained through automated search using the Bing Search API and immediately put into recording.
Content available to know Afghanistan’s history
As a result of the collection carried out, more than 400 Gigabytes of information became available at Arquivo.pt, which anyone can use for research in the most diverse areas.
The main contribution of Arquivo.pt to the community of Web archivists was the use of the automatic search that allows a quick reaction in the recording of Web contents in imminent risk of being lost.
The winners of the Arquivo.pt Award 2022 were announced by the Público newspaper on 22th July 2022, the official communication partner of this edition, which awarded an honorable mention to the best work based on its historical web content.
This work developed a methodology for the automatic classification of stigmatizing mental illness articles, present in Portuguese online news newspapers, using Artificial Intelligence.
For example, a news article that uses the term schizophrenia associated with a news article about political life is classified as stigmatizing. Using automated processes, this work allows to identify thousands of news items and draw the attention of the media and society to the stigmatization of mental illnesses.
The 3rd place winner received a prize of 2 000 euros and was awarded to the work “Arquivo Público”, developed by Diogo Correia and Ricardo Campos.
“Arquivo Público” is a web application focused on the contents published on the Público newspaper website over time and preserved by Arquivo.pt.
As a result, we have a web interface that allows the visualization of archived news about a specific subject and also the representation of the number of news, most frequent terms and geographical reference.
The Público newspaper, official partner of the 5th edition of the Arquivo.pt Award, granted an Honorable Mention to the work “Arquivo Público”, carried out by Diogo Correia and Ricardo Campos.
Photos of the award cerimony
The award ceremony took place during the commemorative session of the National Day of Scientific Culture, on November 24th 2022, at the Teatro Thalia, in Lisbon.
The awards were presented by the Minister of Science, Technology and Higher Education, Elvira Fortunato, the President of the Board of Directors of FCT, Madalena Alves, and the representative of the media partner, the science editor of Público newspaper, Teresa Firmino.
Image Gallery
Créditos das fotos: Pedro Ferreira – FCT | FCCN | Arquivo.pt
The International Internet Preservation Consortium (IIPC), a consortium that brings together Web preservation initiatives from around the world, held its General Assembly with its members between May 17 and 19, 2022.
The following week, between May 24 and 25, held the IIPC Web Archiving Conference (IIPC WAC), online as in the previous year due to the contingencies of the Covid-19 pandemic.
Arquivo.pt resources and initiatives presented at the IIPC WAC 2022
The IIPC Web Archiving Conference is an initiative open to the community, where people or entities interested in the Web preservation domain may participate.
The Arquivo.pt contributed to the Ligthtning Talks sessions (session 5 and session 13).
The Arquivo.pt presentations focused on the resources and initiatives that this service has lately developed for the community.
Exhibiting Web memories from Arquivo.pt with free tools (abstract, slides, video)
The Portuguese Museums Network was the community invited to participate in the cycle of three webinars entitled “Cultural Heritage on the Web: online presence of museums”.
The aim is to raise awareness among museum managers and professionals about the importance of preserving content published on the Web and to make known the services and tools of Arquivo.pt.
This initiative is promoted by the Direção Geral do Património Cultural, through the Departamento de Museus, Conservação e Credenciação and Divisão de Museus e Credenciação, which welcomed and integrated in its training offer the proposal of Arquivo.pt (FCT, I.P.) .
Information and materials
June 21st, 2022 – The Arquivo.pt and the preservation of digital memory (1st webinar)
In this session Arquivo.pt is presented as a useful service to museums and institutions that the community can count on to preserve digital cultural heritage, specifically Web content.
Speaker: Ricardo Basílio, digital curator (in substitution of Daniel Gomes, manager of Arquivo.pt)
June 27, 2022 – Archiving the Web: DIY (3rd Webinar)
This session offers a tutorial for creating a local web archive, recording contentes in a standard format and using open tools that any person can use.