Afghanistan Websites and the fall of the regime in August 2021

thumbnail_Karima Faryabi

Last updated on September 26th, 2022 at 03:57 pm

afghan-ministry-of-economy-17-08-2021

Afghanistan Ministry of Economy website with Karima Faryabi (recorded August 17, 2021)

On August 15, 2021 the presidential palace in Kabul was taken over by the Taliban, consummating the fall of the regime that had been in place for 20 years, following the 9/11 attacks on the United States.

The community of Web archivists, through the Content Development Working Group – International Internet Preservation Consortium, was challenged to record the Afghan sites, given the risk that they would disappear with the new regime.

No time to lose when it comes to preserving the Web

Arquivo.pt reacted quickly, launching an automatic content search focused on .af domain sites and on international media news about the ongoing events.

On August 17, the websites began to be recorded.

1800 website addresses from Afghanistan (ending in .af) and 500 media news stories from around the world were used.

The addresses, URLs or “seeds” were obtained through automated search using the Bing Search API and immediately put into recording.

Content available to know Afghanistan’s history

As a result of the collection carried out, more than 400 Gigabytes of information became available at Arquivo.pt, which anyone can use for research in the most diverse areas.

The main contribution of Arquivo.pt to the community of Web archivists was the use of the automatic search that allows a quick reaction in the recording of Web contents in imminent risk of being lost.

Know more

Arquivo.pt open data set (Dados.gov)

Content collected by the Content Development Working Group of the International Internet Preservation Consortium available at the Archive-it service

Meet the winners of the Arquivo.pt Award 2022!

thumbnail-award-arquivo.pt 2022

Last updated on August 9th, 2022 at 03:30 pm

The winners of the Arquivo.pt Award 2022 were announced by the Público newspaper on 22th July 2022, the official communication partner of this edition, which awarded an honorable mention to the best work based on its historical web content.

22 applications were received.

The award ceremony will take place in September on date to be announced.

1st place – “Arquivo do Parlamento”

The winner of the 10 000 euro prize was the work “Parliamentary Archive” developed by Tiago Santos.

“Parliament Archive” is a web application that aggregates news and opinion articles extracted from Arquivo.pt based on Parliament.pt’s open data.

For example, a user can search on a political personality and get speeches, news and other publications that Arquivo.pt has preserved.

2nd place – “Classificação automática de artigos estigmatizantes de doenças mentais”

The 2nd prize of 3 000 euros was awarded to the work “Automatic classification of stigmatizing articles of mental illness“, authored by Alina Yanchuk, Alina Trifan, Olga Fajarda and José Luís Oliveira.

This work developed a methodology for the automatic classification of stigmatizing mental illness articles, present in Portuguese online news newspapers, using Artificial Intelligence.

For example, a news article that uses the term schizophrenia associated with a news article about political life is classified as stigmatizing. Using automated processes, this work allows to identify thousands of news items and draw the attention of the media and society to the stigmatization of mental illnesses.

3rd place – “Arquivo Público”

The 3rd place winner received a prize of 2 000 euros and was awarded to the work “Arquivo Público”, developed by Diogo Correia and Ricardo Campos.

“Arquivo Público” is a web application focused on the contents published on the Público newspaper website over time and preserved by Arquivo.pt.

As a result, we have a web interface that allows the visualization of archived news about a specific subject and also the representation of the number of news, most frequent terms and geographical reference.

Honorable Mention granted by Público newspaper

The Público newspaper, official partner of the 5th edition of the Arquivo.pt Award, granted an Honorable Mention to the work “Arquivo Público”, carried out by Diogo Correia and Ricardo Campos.

Dissemination materials

Press

Participation of Arquivo.pt in the meetings of the International Internet Preservation Consortium

thumbnail_GA_WAC2022

Last updated on July 29th, 2022 at 12:33 pm

IIPC Web Archiving Conference

The International Internet Preservation Consortium (IIPC), a consortium that brings together Web preservation initiatives from around the world, held its General Assembly with its members between May 17 and 19, 2022.

The following week, between May 24 and 25, held the IIPC Web Archiving Conference (IIPC WAC), online as in the previous year due to the contingencies of the Covid-19 pandemic.

The 2022 edition of these two events was hosted by the Library of Congress.

Arquivo.pt resources and initiatives presented at the IIPC WAC 2022

The IIPC Web Archiving Conference is an initiative open to the community, where people or entities interested in the Web preservation domain may participate.

The Arquivo.pt contributed to the Ligthtning Talks sessions (session 5 and session 13).

The Arquivo.pt presentations focused on the resources and initiatives that this service has lately developed for the community.

Cultural heritage on the Web: the online presence of museums

Last updated on July 7th, 2022 at 09:26 pm

The Portuguese Museums Network was the community invited to participate in the cycle of three webinars entitled “Cultural Heritage on the Web: online presence of museums”.

The aim is to raise awareness among museum managers and professionals about the importance of preserving content published on the Web and to make known the services and tools of Arquivo.pt.

This initiative is promoted by the Direção Geral do Património Cultural, through the Departamento de Museus, Conservação e Credenciação and Divisão de Museus e Credenciação, which welcomed and integrated in its training offer the proposal of Arquivo.pt (FCT, I.P.) .

Information and materials

June 21st, 2022 – The Arquivo.pt and the preservation of digital memory (1st webinar)

In this session Arquivo.pt is presented as a useful service to museums and institutions that the community can count on to preserve digital cultural heritage, specifically Web content.

  • Speaker: Ricardo Basílio, digital curator (in substitution of Daniel Gomes, manager of Arquivo.pt)
  • Duration: 15h30 -17h00
  • Slides (PDF)
  • Video

June 22, 2022 – Publishing Well to Preserve Well (2nd Webinar)

This session deals with the aspects that an institution must take into account to create and maintain preservable websites.

  • Speaker: Pedro Gomes, responsible for the Arquivo.pt collections
  • Duration: 15h30 -17h00
  • Slides
  • Vídeo

June 27, 2022 – Archiving the Web: DIY (3rd Webinar)

This session offers a tutorial for creating a local web archive, recording contentes in a standard format and using open tools that any person can use.

  • Speaker: Ricardo Basílio, digital curator
  • Duration: 15h30 -17h00
  • Vídeo
  • Slides

June 28, 2022 – Repeat of the first session (extra session)

Open session for those who were not able to participate in the 1st session.

  • Speaker: Ricardo Basílio, digital curator
  • Duration: 15h30 -17h00
  • Video
  • Slides

Online exhibition: discover museums’ online presence over time

 

Municipality of Sines and Arquivo.pt together on the International Archives Day

thumbnail-sines-dia-internacional-dos-arquivos

Last updated on June 27th, 2022 at 08:40 am

The Municipal Archive of the Municipality of Sines and Arquivo.pt celebrated the International Archives Day, June 9, at the Salão Nobre dos Paços do Concelho, with a Workshop on preserving the digital memory of Sines (Portugal).

The meeting was broadcast online with the aim of sharing with the community of archivists what has been an experience of collaborative curation of Web content.

Collaboration between a municipal archive and a web archive

This meeting took place in the continuity of a collaboration between the two teams developed during the pandemic period.

The Arquivo Municipal de Sines made a selective and systematic collection of Web content related to the Municipality of Sines, with the collaboration of local media, such as Rádio Miróbriga and Rádio Sines.

In turn, Arquivo.pt contributed with training on tools, like Webrecorder.net, that records in standardized format and prepared useful services, such as SavePageNow that allows to record pages on the fly directly on Arquivo.pt.

Local history is better with preserved Web pages

From this collaboration resulted the preservation of thousands of Web pages (about 200 Gigabytes of information) about the experience of the pandemic in the geographical area of Sines and Santiago do Cacém.

The copies of the Web Archive Files (WARCs) sent to Arquivo.pt have been integrated to become available.

Presentations

Cryptocurrencies and web curation on the 15th anniversary in Viseu

Last updated on June 24th, 2022 at 08:37 pm

Session of Arquivo.pt at the Jornadas 2022

Arquivo.pt was at the annual meeting Jornadas de Computação Científica 2022, held from May 31st to June 2nd, at the Instituto Politécnico de Viseu.

Cryptocurrencies and web curation were the starting point for sharing the news of the service and talking about the work developed since the last edition of the Jornadas.

Zapping session remembered the 15 years of Arquivo.pt

Arquivo.pt was created in 2007 with the goal of collecting the Portuguese Web. After fifteen years it continues its mission, collecting, but mainly facilitating the access to preserved contents, both for the researcher and the common citizen.

In the Zapping session at the conference, in which each FCCN service presented its services, the Arquivo.pt was highlighted for its long-standing activity in Web preservation.

Training with the Library of the Escola Superior de Tecnologia e Gestão

The Arquivo.pt team was in the Library of the School of Technology and Management (ESTGV) in an extra session of the conference dedicated to digital preservation, mainly to institutional content published on the Web.

The training was promoted by the Library team, especially Dr. Rosa Silva, Coordinator of the service, and had the participation of the community. Besides the presentations, there was an opportunity to share ideas and point out future collaborations.

Paulo Medeiros, responsible for the service of Culture, Communication and Documentation, presented the institutional channels of the Instituto Politécnico de Viseu. These channels are increasingly present on the Web, such as the magazine Polistécnica that went digital in 2012, the scientific journal Millenium and the video channel Politécnico TV.

Arquivo.pt showed how any person or institution can have their Web contents preserved in an adequate format. To save contents directly on Arquivo.pt you can use the new SavePageNow recording service. To make a local Web archive you can use ArchiveWeb.page – Webrecorder.net.

Arquivo.pt APIs presented to Internet technologies students

The Arquivo team was in the classroom, thanks to the excellent welcome given by Prof. Dr. Valter Alves, director of the Design and Multimedia Technology course. Vasco Rato, Web developer of the Arquivo.pt, presented the APIs of the Arquivo.pt (Applications Programming Interfaces) for the automatic processing of preserved information.

By using the APIs of Arquivo.pt the students can make assignments for the technology subjects and compete to the Arquivo.pt Award.

Image gallery

Daniel Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Daniel Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Daniel Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Daniel Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Aula no curso de Tecnologia Design e Multimédia da ESTGV Aula no curso de Tecnologia Design e Multimédia da ESTGV Aula no curso de Tecnologia Design e Multimédia da ESTGV Aula no curso de Tecnologia Design e Multimédia da ESTGV Aula no curso de Tecnologia Design e Multimédia da ESTGV

Web pages for the history of the Instituto Politécnico de Viseu

In 2018, the library team developed a project with the participation of young students that resulted in a documentary short film where memories of old web pages, preserved by Arquivo.pt, were included.

Put an end to “page not found” on your website

thumbnail- erro404-en-

Last updated on August 17th, 2022 at 09:09 am

Does your website present “Error 404 – Page not found” messages to your users?

Arquivo.pt offers a solution for this problem through arquivo404.

Just insert a single line of code in the page that generates the 404 error message on your website.

How Arquivo404 works

example-fccn-arquivo404_

When a page is no longer on a website, Arquivo404 checks if a preserved version exists.

When a user tries to access a page that is no longer available on a website, arquivo404 automatically checks if there is a version of that page preserved in Arquivo.pt.

If the page exists in Arquivo.pt, a link is presented so that the user may visit this version. If it does not exist, the normal error page is displayed.

See Arquivo404 at work in this example of an error page that presents a link automatically generated by arquivo404.

How to install file404 on your website?

The simplest implementation of file404 is to insert the following line of Javascript code on the 404 error page:

<script type="text/javascript" src="https://arquivo.pt/arquivo404.js" async defer onload="ARQUIVO_NOT_FOUND_404.call();"></script>

The code in file404 can easily be adapted. You can for example create a customised error message.

To know more

SavePageNow to record webpages immediately on Arquivo.pt

thumb_savepagenow

Last updated on August 23rd, 2022 at 11:51 am

Arquivo.pt launched a new version, called Francisco, on the 19th of January 2022.

The SavePageNow function stands out, allowing anyone to save a Web page to be preserved by Arquivo.pt.  It is only necessary to enter a page’s address and browse through its contents.

Arquivo.pt SavePageNow was inspired on the Internet Archive Save Page Now and implemented using webrecorder pywb.

For example, a publication on the FCCN blog marking the 30th anniversary of the Internet in Portugal was saved with SavePageNow and preserved at Arquivo.pt. This way, anyone using SavePageNow is contributing to the contents published on the Internet not being lost.

 

Help us to improve!

The user interfaces have been recoded to be optimized, so we need your help to test them in different devices of various brands (e.g. mobile phones, tablets, laptops).

If you detect any problems, please contact us!

Remember to always send the address of the page where you detected the problem.

To Know more

 

How to preserve Web references from Wikipedia?

thumbnail-wikimedia

Last updated on May 19th, 2022 at 07:05 pm

Wikimedia Portugal has started a collaboration with Arquivo.pt that aims at raising the community’s attention to the preservation of contents published on Wikipedia.

Eighty percent of the pages published on the Web disappear or are changed, just one year after their publication. At the same time, the information in Wikipedia is based on information mostly published on the Web. The disappearance of reference information undermines the reliability of Wikipedia articles.

Webinar cycle “Cultural Heritage on the Web: how to preserve references in Wikipedia?”

The cycle of Webinars, promoted by Wikimedia Portugal, includes educational content that enriches the training of information and communication professionals but also the digital literacy of any citizen.

Arquivo.pt and the preservation of digital memory (1st Webinar)

Gonçalo Themudo, President of Wikimedia Portugal, introduced the 1st webinar of the cycle entitled Cultural heritage on the Web: how to preserve references in Wikipedia?. He stressed the importance of preserving the references (URLs) used by authors when publishing articles in Wikipedia. Daniel Gomes, Manager of Arquivo.pt, showed how Arquivo.pt preserves Web contents and how the community of Wikipedia authors can contribute to the effective preservation of those contents.

  • Held on February 22, 2022
  • Speaker: Daniel Gomes, Arquivo.pt
  • Slides
  • Video

Automatic access and processing of preserved information from the Web through APIs (2nd Webinar)

Webinar that presents the Archive.pt’s APIs (Application Programming Interface) that enable the automatic processing of historical information preserved from the Web, in order to develop innovative and useful applications for organizations. This Webinar is mainly intended for IT professionals (e.g. Web developers, Web designers, Web marketers).

  • Date: 22 Mar. 2022 15:00 – 16:30
  • Speaker: Vasco Rato, Arquivo.pt
  • Slides
  • Video

Web archiving: do it yourself! (3rd Webinar)

Webinar that presents how to preserve cultural information of a municipal and national nature published on the Web. It demonstrates through practical cases how anyone can archive information published on the web in a proper format that will allow its preservation for the future using free tools. This Webinar is intended for any Internet user but is particularly useful for those responsible for communication and information management in organisations.

  • Date: 19 Abr. 2022 15:00 – 16:30
  • Speaker: Daniel Gomes, Arquivo.pt
  • Slides
  • Video

On line Cafe with Arquivo.pt continues

Last updated on August 17th, 2022 at 09:27 am

banner-cafe-com-o-arquivo-pt

Share this page: arquivo.pt/onlinecafe

Welcome to the third season of the Online Cafe with Arquivo.pt

Talk directly to the Arquivo.pt team and get answers to all your questions! The Arquivo.pt launched a new cycle of team chats with you through online sessions. Brief introductory presentations will be given, leaving time to ask all your questions about how to get more out of Arquivo.pt or how to apply to the Arquivo.pt Awards.

Sessions

February 17, 2022 – Primeiras páginas de jornais online portugueses

Primeiras páginas de jornais online portugueses” (Front pages of Portuguese online newspapers) presents an interactive graphical analysis of the front pages of Portuguese online newspapers. For this study, specific items within the newspaper design were analysed, thus allowing trends to be observed over time.

Susana Parreira, explains how she developed this work as part of her Masters, with the collaboration and guidance of Ana Boavida (Universidade de Coimbra) Ana Sabino (Instituto Politécnico de Castelo Branco) and Penousal Machado (Universidade de Coimbra).

22nd session –  January 20, 2022 – Politiquices

Politiquices.pt, allows to research support or opposition relations between political personalities and parties expressed in news headlines. This application uses information preserved in Arquivo.pt to create an ontology of relations. It uses Natural Language Processing technology. David Batista, 2nd place of Arquivo.pt Awards 2021, will explain how he developed his work and demonstrate the applications for researchers and citizens in general.

Special session – World Digital Preservation Day 2021 – Major minors project – november 5

In November, World Digital Preservation Day is broadly celebrated and, to mark this international initiative, Arquivo.pt held an online session open to the community. Special guests of this session were the winners of the Arquivo.pt Award 2021, Leandro Costa, Paulo Martins and José Carlos Ramalho.

Previous seasons

Presentation at the IIPC Web Archiving Conference 2022