Afghanistan Websites and the fall of the regime in August 2021

thumbnail_Karima Faryabi

Last updated on September 26th, 2022 at 03:57 pm

afghan-ministry-of-economy-17-08-2021

Afghanistan Ministry of Economy website with Karima Faryabi (recorded August 17, 2021)

On August 15, 2021 the presidential palace in Kabul was taken over by the Taliban, consummating the fall of the regime that had been in place for 20 years, following the 9/11 attacks on the United States.

The community of Web archivists, through the Content Development Working Group – International Internet Preservation Consortium, was challenged to record the Afghan sites, given the risk that they would disappear with the new regime.

No time to lose when it comes to preserving the Web

Arquivo.pt reacted quickly, launching an automatic content search focused on .af domain sites and on international media news about the ongoing events.

On August 17, the websites began to be recorded.

1800 website addresses from Afghanistan (ending in .af) and 500 media news stories from around the world were used.

The addresses, URLs or “seeds” were obtained through automated search using the Bing Search API and immediately put into recording.

Content available to know Afghanistan’s history

As a result of the collection carried out, more than 400 Gigabytes of information became available at Arquivo.pt, which anyone can use for research in the most diverse areas.

The main contribution of Arquivo.pt to the community of Web archivists was the use of the automatic search that allows a quick reaction in the recording of Web contents in imminent risk of being lost.

Know more

Arquivo.pt open data set (Dados.gov)

Content collected by the Content Development Working Group of the International Internet Preservation Consortium available at the Archive-it service

Tutorial: how to explore Arquivo.pt using Python

Last updated on July 17th, 2023 at 01:44 pm

The Programming Historian aims to develop digital skills among the Humanities researchers through the publication of practical lessons in several languages.

The call Computational analysis skills for large-scale humanities data originated 7 new lessons.

One of them was the tutorial “Timeline summarization for large-scale past-web events with Python: the case of Arquivo.pt” developed by Daniel Gomes and Ricardo Campos.

It shows how to explore Arquivo.pt user interface and the Application Programming Interface (API) to execute advanced queries, process large amount of data or build new services, such as Tell me stories.

All the developed resources are freely available in open-access.

Open-access resources of the tutorial “Timeline summarization for large-scale past-web events with Python: the case of Arquivo.pt”

 

 

Open dataset about cryptocurrency

Criptomoedas gráfico

Last updated on August 17th, 2022 at 09:19 am

(Photo: QuoteInspector)

Since 2008 the cryptocurrency market has revolutionised the world by innovating and expanding into other areas (e.g., finance and art). However, with this rapid expansion, many projects are created every day, giving rise to a wide and varied range of websites, technologies and scams. Markets follow financing stages and it is during an initial stage of euphoria that more projects are created.

We believe that as the cryptocurrency market  stabilises, projects/websites are disappearing because funding diminishes or runs out.

Arquivo.pt initiated a new web archive collection that preserves web content that documents Cryptocurrency activities.

This work produced a new open dataset with information documenting each cryptocurrency project, including it is original URLs and links to the corresponding web-archived version in Arquivo.pt. The information sources selected to create this dataset were:

We believe that by creating this new dataset related to cryptocurrencies and by preserving all the corresponding web content, it has the potential to originate innovative scientific contributions in several areas such as Economy or Digital Humanities.

Resources

Researchers who want to carry out studies on the Cryptocurrencies dataset and need earlier access to the collected contents can contact Arquivo.pt.

Presentation at the IIPC Web Archiving Conference 2022

Meet the winners of the Arquivo.pt Award 2022!

thumbnail-award-arquivo.pt 2022

Last updated on April 28th, 2023 at 03:41 pm

The winners of the Arquivo.pt Award 2022 were announced by the Público newspaper on 22th July 2022, the official communication partner of this edition, which awarded an honorable mention to the best work based on its historical web content.

22 applications were received.

The award ceremony took place during the Commemorative Session of the World Science Day: the excellence of research in Portugal, on November 24th, at the Teatro Thalia, in Lisbon.

1st place – “Arquivo do Parlamento”

The winner of the 10 000 euro prize was the work “Parliamentary Archive” developed by Tiago Santos.

“Parliament Archive” is a web application that aggregates news and opinion articles extracted from Arquivo.pt based on parlamento.pt open data.

For example, a user can search on a political personality and get speeches, news and other publications that Arquivo.pt has preserved.

2nd place – “Classificação automática de artigos estigmatizantes de doenças mentais”

The 2nd prize of 3 000 euros was awarded to the work “Automatic classification of stigmatizing articles of mental illness“, authored by Alina Yanchuk, Alina Trifan, Olga Fajarda and José Luís Oliveira.

This work developed a methodology for the automatic classification of stigmatizing mental illness articles, present in Portuguese online news newspapers, using Artificial Intelligence.

For example, a news article that uses the term schizophrenia associated with a news article about political life is classified as stigmatizing. Using automated processes, this work allows to identify thousands of news items and draw the attention of the media and society to the stigmatization of mental illnesses.

3rd place – “Arquivo Público”

The 3rd place winner received a prize of 2 000 euros and was awarded to the work “Arquivo Público”, developed by Diogo Correia and Ricardo Campos.

“Arquivo Público” is a web application focused on the contents published on the Público newspaper website over time and preserved by Arquivo.pt.

As a result, we have a web interface that allows the visualization of archived news about a specific subject and also the representation of the number of news, most frequent terms and geographical reference.

Honorable Mention granted by Público newspaper

The Público newspaper, official partner of the 5th edition of the Arquivo.pt Award, granted an Honorable Mention to the work “Arquivo Público”, carried out by Diogo Correia and Ricardo Campos.

Photos of the award cerimony

The award ceremony took place during the commemorative session of the National Day of Scientific Culture, on November 24th 2022, at the Teatro Thalia, in Lisbon.

The awards were presented by the Minister of Science, Technology and Higher Education, Elvira Fortunato, the President of the Board of Directors of FCT, Madalena Alves, and the representative of the media partner, the science editor of Público newspaper, Teresa Firmino.

Image Gallery

Ceriminónia de entrega dos prémios 2022
Ceriminónia de entrega dos prémios 2022
Ceriminónia de entrega dos prémios 2022
Ceriminónia de entrega dos prémios 2022
Ceriminónia de entrega dos prémios 2022
Ceriminónia de entrega dos prémios 2022
Ceriminónia de entrega dos prémios 2022
Ceriminónia de entrega dos prémios 2022 Ceriminónia de entrega dos prémios 2022 Ceriminónia de entrega dos prémios 2022 Ceriminónia de entrega dos prémios 2022 Ceriminónia de entrega dos prémios 2022 Ceriminónia de entrega dos prémios 2022 Ceriminónia de entrega dos prémios 2022

Créditos das fotos: Pedro Ferreira – FCT | FCCN | Arquivo.pt

Video of the cerimony

Flash interview videos

Dissemination materials

Press

Short-link to this page: arquivo.pt/winners2022

Participation of Arquivo.pt in the meetings of the International Internet Preservation Consortium

thumbnail_GA_WAC2022

Last updated on August 1st, 2023 at 05:37 pm

IIPC Web Archiving Conference

The International Internet Preservation Consortium (IIPC), a consortium that brings together Web preservation initiatives from around the world, held its General Assembly with its members between May 17 and 19, 2022.

The following week, between May 24 and 25, held the IIPC Web Archiving Conference (IIPC WAC), online as in the previous year due to the contingencies of the Covid-19 pandemic.

The 2022 edition of these two events was hosted by the Library of Congress.

Arquivo.pt resources and initiatives presented at the IIPC WAC 2022

The IIPC Web Archiving Conference is an initiative open to the community, where people or entities interested in the Web preservation domain may participate.

The Arquivo.pt contributed to the Ligthtning Talks sessions (session 5 and session 13).

The Arquivo.pt presentations focused on the resources and initiatives that this service has lately developed for the community.

Cultural heritage on the Web: the online presence of museums

Last updated on July 7th, 2022 at 09:26 pm

The Portuguese Museums Network was the community invited to participate in the cycle of three webinars entitled “Cultural Heritage on the Web: online presence of museums”.

The aim is to raise awareness among museum managers and professionals about the importance of preserving content published on the Web and to make known the services and tools of Arquivo.pt.

This initiative is promoted by the Direção Geral do Património Cultural, through the Departamento de Museus, Conservação e Credenciação and Divisão de Museus e Credenciação, which welcomed and integrated in its training offer the proposal of Arquivo.pt (FCT, I.P.) .

Information and materials

June 21st, 2022 – The Arquivo.pt and the preservation of digital memory (1st webinar)

In this session Arquivo.pt is presented as a useful service to museums and institutions that the community can count on to preserve digital cultural heritage, specifically Web content.

  • Speaker: Ricardo Basílio, digital curator (in substitution of Daniel Gomes, manager of Arquivo.pt)
  • Duration: 15h30 -17h00
  • Slides (PDF)
  • Video

June 22, 2022 – Publishing Well to Preserve Well (2nd Webinar)

This session deals with the aspects that an institution must take into account to create and maintain preservable websites.

  • Speaker: Pedro Gomes, responsible for the Arquivo.pt collections
  • Duration: 15h30 -17h00
  • Slides
  • Vídeo

June 27, 2022 – Archiving the Web: DIY (3rd Webinar)

This session offers a tutorial for creating a local web archive, recording contentes in a standard format and using open tools that any person can use.

  • Speaker: Ricardo Basílio, digital curator
  • Duration: 15h30 -17h00
  • Vídeo
  • Slides

June 28, 2022 – Repeat of the first session (extra session)

Open session for those who were not able to participate in the 1st session.

  • Speaker: Ricardo Basílio, digital curator
  • Duration: 15h30 -17h00
  • Video
  • Slides

Online exhibition: discover museums’ online presence over time

 

Municipality of Sines and Arquivo.pt together on the International Archives Day

thumbnail-sines-dia-internacional-dos-arquivos

Last updated on June 27th, 2022 at 08:40 am

The Municipal Archive of the Municipality of Sines and Arquivo.pt celebrated the International Archives Day, June 9, at the Salão Nobre dos Paços do Concelho, with a Workshop on preserving the digital memory of Sines (Portugal).

The meeting was broadcast online with the aim of sharing with the community of archivists what has been an experience of collaborative curation of Web content.

Collaboration between a municipal archive and a web archive

This meeting took place in the continuity of a collaboration between the two teams developed during the pandemic period.

The Arquivo Municipal de Sines made a selective and systematic collection of Web content related to the Municipality of Sines, with the collaboration of local media, such as Rádio Miróbriga and Rádio Sines.

In turn, Arquivo.pt contributed with training on tools, like Webrecorder.net, that records in standardized format and prepared useful services, such as SavePageNow that allows to record pages on the fly directly on Arquivo.pt.

Local history is better with preserved Web pages

From this collaboration resulted the preservation of thousands of Web pages (about 200 Gigabytes of information) about the experience of the pandemic in the geographical area of Sines and Santiago do Cacém.

The copies of the Web Archive Files (WARCs) sent to Arquivo.pt have been integrated to become available.

Presentations

Cryptocurrencies and web curation on the 15th anniversary in Viseu

Last updated on January 10th, 2023 at 04:27 pm

Session of Arquivo.pt at the Jornadas 2022

Arquivo.pt was at the annual meeting Jornadas de Computação Científica 2022, held from May 31st to June 2nd, at the Instituto Politécnico de Viseu.

Cryptocurrencies and web curation were the starting point for sharing the news of the service and talking about the work developed since the last edition of the Jornadas.

Zapping session remembered the 15 years of Arquivo.pt

Arquivo.pt was created in 2007 with the goal of collecting the Portuguese Web. After fifteen years it continues its mission, collecting, but mainly facilitating the access to preserved contents, both for the researcher and the common citizen.

In the Zapping session at the conference, in which each FCCN service presented its services, the Arquivo.pt was highlighted for its long-standing activity in Web preservation.

Training with the Library of the Escola Superior de Tecnologia e Gestão

The Arquivo.pt team was in the Library of the School of Technology and Management (ESTGV) in an extra session of the conference dedicated to digital preservation, mainly to institutional content published on the Web.

The training was promoted by the Library team, especially Dr. Rosa Silva, Coordinator of the service, and had the participation of the community. Besides the presentations, there was an opportunity to share ideas and point out future collaborations.

Paulo Medeiros, responsible for the service of Culture, Communication and Documentation, presented the institutional channels of the Instituto Politécnico de Viseu. These channels are increasingly present on the Web, such as the magazine Polistécnica that went digital in 2012, the scientific journal Millenium and the video channel Politécnico TV.

Arquivo.pt showed how any person or institution can have their Web contents preserved in an adequate format. To save contents directly on Arquivo.pt you can use the new SavePageNow recording service. To make a local Web archive you can use ArchiveWeb.page – Webrecorder.net.

Arquivo.pt APIs presented to Internet technologies students

The Arquivo team was in the classroom, thanks to the excellent welcome given by Prof. Dr. Valter Alves, director of the Design and Multimedia Technology course. Vasco Rato, Web developer of the Arquivo.pt, presented the APIs of the Arquivo.pt (Applications Programming Interfaces) for the automatic processing of preserved information.

By using the APIs of Arquivo.pt the students can make assignments for the technology subjects and compete to the Arquivo.pt Award.

Image gallery

Daniel Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Daniel Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Daniel Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Sessão de formação na Biblioteca da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Aula no curso de Tecnologia Design e Multimédia da ESTGV

Daniel Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Pedro Gomes na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Ricardo Basílio na sessão do Arquivo.pt nas Jornadas FCCN 2022 em Viseu Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Sessão de formação na Biblioteca da ESTGV Aula no curso de Tecnologia Design e Multimédia da ESTGV Aula no curso de Tecnologia Design e Multimédia da ESTGV Aula no curso de Tecnologia Design e Multimédia da ESTGV Aula no curso de Tecnologia Design e Multimédia da ESTGV Aula no curso de Tecnologia Design e Multimédia da ESTGV

Web pages for the history of the Instituto Politécnico de Viseu

In 2018, the library team developed a project with the participation of young students that resulted in a documentary short film where memories of old web pages, preserved by Arquivo.pt, were included.

Arquivo404 presents web-archived pages instead of “pages not found”

thumbnail- erro404-en-

Last updated on November 14th, 2023 at 02:46 pm

Does your website presents “Error 404 – Page not found” messages to your users?

Arquivo.pt offers a solution for this problem through Arquivo404.

Just insert a single line of code in the page that generates the 404 error message on your website and web-archived pages will be presented to your users instead of pages not found.

See these examples on websites that installed arquivo404.

How does Arquivo404 work?

example-fccn-pt-arquivo404-en

When a page is no longer on a website, Arquivo404 checks if a preserved version exists.

When a user tries to access a page that is no longer available on a website, Arquivo404 automatically checks if there is a version of that page preserved in Arquivo.pt.

If the page exists in Arquivo.pt, a link is presented so that the user may visit this version. If it does not exist, the normal error page is displayed.

See Arquivo404 at work in this example of an error page that presents a link automatically generated by Arquivo404.

How to install arquivo404 on your website?

The simplest implementation of arquivo404 is to insert the following Javascript on the HTML code that generates the “Page not found” message:

<script type="text/javascript" src="https://arquivo.pt/arquivo404.js" async defer onload="ARQUIVO_NOT_FOUND_404.call();"></script>

The code in Arquivo404 can easily be adapted. You can for example create a customised error message.

Hint for WordPress websites: When editing the 404 error page and inserting the arquivo404 script inside the <body>, you must put the <!– wp:html –> tag at the beginning and the <!– /wp:html –> tag at the end, otherwise the script will be deleted.

If you have any questions or issues, please contact us!

Know more

Short link to this page: arquivo.pt/arquivo404en

SavePageNow to record webpages immediately on Arquivo.pt

thumb_savepagenow

Last updated on August 23rd, 2022 at 11:51 am

Arquivo.pt launched a new version, called Francisco, on the 19th of January 2022.

The SavePageNow function stands out, allowing anyone to save a Web page to be preserved by Arquivo.pt.  It is only necessary to enter a page’s address and browse through its contents.

Arquivo.pt SavePageNow was inspired on the Internet Archive Save Page Now and implemented using webrecorder pywb.

For example, a publication on the FCCN blog marking the 30th anniversary of the Internet in Portugal was saved with SavePageNow and preserved at Arquivo.pt. This way, anyone using SavePageNow is contributing to the contents published on the Internet not being lost.

 

Help us to improve!

The user interfaces have been recoded to be optimized, so we need your help to test them in different devices of various brands (e.g. mobile phones, tablets, laptops).

If you detect any problems, please contact us!

Remember to always send the address of the page where you detected the problem.

To Know more