Arquivo.pt certified as an open data provider

selo-dados-gov

Last updated on September 16th, 2021 at 09:45 am

Arquivo.pt has been collaborating with Agência Modernização Administrativa (AMA) with the aim of improving the preservation of Public Administration websites.

Collaboration is based on three action points:

AMA is the public organisation responsible for promoting digital means in Public Administration and aims to modernise and simplify citizens’ access to State services.

Arquivo.pt is a service operated by the Fundação para a Ciência e Tecnologia I.P. that preserves data published on the Web between 1996 and the present day, making them accessible to any citizen for memory and research purposes.

EU open data directive includes documents on websites

The Directive (EU) 2019/1024 of the European Parliament and of the Council of 20 June 2019 on open data and the re-use of public sector information stipulates the following:

“(30) This Directive lays down the definition of the term ‘document’ and that definition should include any part of a document. The term ‘document’ should cover any representation of acts, facts or information — and any compilation of such acts, facts or information — whatever its medium (paper, or electronic form or as a sound, visual or audiovisual recording.

(34) To facilitate re-use, public sector bodies should, where possible and appropriate, make documents, including those published on websites, available through an open and machine-readable format and together with their metadata, at the best level of precision and granularity, in a format that ensures interoperability

(35) A document should be considered to be in a machine-readable format if it is in a file format that is structured in such a way that software applications can easily identify, recognise and extract specific data from it. Data encoded in files that are structured in a machine-readable format should be considered to be machine-readable data. A machine-readable format can be open or proprietary. They can be formal standards or not.

(60) The Commission should facilitate the cooperation among Member States and support the design, testing, implementation and deployment of interoperable electronic interfaces that enable more efficient and secure public services.

Arquivo.pt is a public service that has the mission of preserving documents published on Internet sites to enable their long-term open access and provides interoperable electronic interfaces (APIs) for their automatic processing.

The Portuguese Law No. 68/2021 of 2021-08-26 approves the general principles on open data and transposes the European Directive.

Arquivo.pt was certified as a Public Administration open data provider

The AMA recognized Arquivo.pt as a public service and open data provider and awarded its certification seal on the Open Data Portal.

Arquivo.pt collects general information published on the Web of interest to the Portuguese community. However, it is also responsible for the preservation of Public Administration websites, such as the Portal do Governo, in collaboration with the Management Center for the Government Electronic Network (CEGER).

Any citizen can access the open data resulting from these historical archives and, for example, search for official information published on the websites of successive governments.

In 2021, Arquivo.pt provided open access to over 10 billion files (721 TB) from 27 million websites. The open data preserved by Arquivo.pt can be explored through the search interface, automatically through API (https://arquivo.pt/api) or by reusing derived datasets.

Derived datasets available on the Open Data Portal

Besides the original web artefacts preserved at Arquivo.pt, this service has generated open datasets derived from its activities, which are now available in open access so that they can be reused:

Resources list

“Art Forever on the Web”: Cycle of Webinars

composicao sobre Colectiva de Artistas 2008 Quadrado Azul

Last updated on July 6th, 2021 at 01:23 pm

composicao sobre Colectiva de Artistas 2008 Quadrado Azul

Colectiva de Artistas. 2008.04.19 a 2008.06.07. Galeria Quadrado Azul. Porto. Composition from a Webpage preserved on Arquivo.pt: www.quadradoazul.pt, 22nd October 2008.

On April 29, May 27 and July 1, from 3 to 4:30 pm, webinars geared to the community of artists, curators, gallerists and event producers will be held, open also to anyone interested in learning more about preserving art websites.

Throughout the sessions, participants will learn in detail about the functionalities of Arquivo.pt in order to take advantage of this public Web preservation service. They will have technical information, in the form of recommendations and best practices, to create preservable websites. Finally, they will learn how to use available tools to save their websites in a standardized format so that their contents are not lost.

This cycle of Webinars is an initiative of the “Forever” Project, a collaboration between the Calouste Gulbenkian Foundation Art Library and Arquivo.pt under the ROSSIO infrastructure.

For more details and sharing, please see the program (PDF) (in Portuguese).

Sign up!

April 29 – The Arquivo.pt and the preservation of digital memory
May 27 – Recommendations for creating preservable websites for the future
July 1 – Archiving the Web: do-it-yourself!

Held sessions presentations

Online archives or archives of the online?

thumbnail_tendencias

At the end of 2020, we recommend some texts that put the future in perspective.

We highlight the theme of preserving online content presented in the ebook “Tendências 2021” (Trends 2021). The contribution of Daniel Gomes, the Arquivo.pt manager, was entitled “Arquivos online ou do online?” (Online archives or archives of the online?).

I was invited to write about the challenges and threats to online archives. The first question that came to me was what is meant by an “online archive”?

My concern lies in the “archives of the online” because there is not even an established awareness about their need, whether at an academic, governmental or individual level.

It is technologically impossible to preserve all information available online. But it is absurd not to be aware that we have to preserve some of the information online for short, medium and long term access.

The complete text (in Portuguese) is available at pages 23 to 26 of the open-access book “Tendências 2021”.

The challenge is to cultivate awareness about the importance of preserving content online by learning how to do it in practice.

Happy New Year!

Arquivo.pt training in Azores islands

Daniel Gomes in Azores islands

Memorial (hight quality preservation) and image search were highlighted as new developments in Arquivo.pt during Jornadas de Computação Científica 2019, held from 6 to 8 at the University of Azores in Ponta Delgada.

On the first day of this annual event, Arquivo.pt developed a training session in 4 parts:

Participants learned about the web preservation service offered to the community by the Arquivo.pt that for the purpose of researching and safeguarding the digital heritage, and how they can help preserve the Web.

In addition to the Jornadas 2019, the Arquivo.pt team also made two presentations in class context. The first was to students of the Informatics – Networks and Multimedia course at the University of Azores, and the second at the Escola Secundária das Laranjeiras (high school) in Ponta Delgada.

To schedule a training session with Arquivo.pt, contact us.

Jornadas 2019

Daniel Gomes in Azores islands
Universidade dos Açores
Jornadas 2019
Jornadas 2019 Melo
Jornadas 2019 Daniel Bicho
Açores
Aula Universidade dos Açores
Açores Escola das Laranjeiras
Daniel Gomes in Azores islands Universidade dos Açores Jornadas 2019 Jornadas 2019 Melo Jornadas 2019 Daniel Bicho Açores Aula Universidade dos Açores Açores Escola das Laranjeiras

 

Webinar Web Archiving in academic libraries

Preservation- workflow

Last updated on April 3rd, 2019 at 10:51 am

Web archiving process

“Curation of preserved websites – how it works” was the subject of the webinar promoted by  Associação Portuguesa de Bibliotecários, Arquivistas e Documentalistas (APBAD, Lisbon), the Portuguese association of librarians, past October 9, and presented by Ricardo Basílio, librarian and digital curator at Arquivo.pt.

Gathering the online memory of the Universities

The hands-on presentation showed how anyone, even a non-TI expert, can adequately capture, store and replay a website or a social page of an institutional website. Basílio also gave specific examples on how to gather and share collections of institutional contents previously published on the Web: a list, an exhibition, a recovery of a past content to be published on Twitter or Facebook, etc.

A librarian can be a curator of websites

Human and qualitative evaluation is the focus of the digital curator, even when we use such a proficient tool like Webrecorder. The most important point is to enable librarians to practice micro-archiving and create local collections.

Video (40 minutes, in Portuguese)
Presentation (PDF, in Portuguese)