Arquivo.pt at Jornadas de Computação Científica 2018

Last updated on October 2nd, 2018 at 02:12 pm

The Jornadas de Computação Científica 2018 were in April, in Braga Portugal, at the International Iberian Nanotechnology Laboratory (INL).

At the event, the Portuguese Secretary of State for Science, Technology and Higher education, Maria Fernanda Rollo, spoke about the Digital Future.

Arquivo.pt team and some guests presented new services and applications.

Plenary session

Presentations about Arquivo.pt

New API to search Arquivo.pt

Last updated on October 1st, 2018 at 04:42 pm

On February 23, 2018 Arquivo.pt launched a new version called Zeus.

This new version includes a new programming interface (Arquivo.pt API) and improvements in the mobile interfaces.

Arquivo.pt API: the oficial API

The new application programming interface, called Arquivo.pt API, allows you to search and access information preserved automatically. The purpose of this new API was to aggregate functions and information that were provided through the various APIs developed previously. The search can be done by URL or by terms, and a JSON object with the response items is returned.

This new API can be useful to compete for the Arquivo.pt Prizes.

To know more

We preserved the Portuguese Local Elections of 2017

Last updated on January 23rd, 2018 at 10:15 am

Arquivo.pt performed 2 web crawls of information related with the Portuguese Local Elections of 2017.

We appealed the community to contribute with suggestions of relevant Web pages so that we could preserve them.

The 2 crawls occurred during and after the campaign period, using the list of 410 Web pages suggested by the community and 13 887 web pages found automatically using search engines.

The result was an archive of 2 265 887 Web resources (360 GB).

Among the preserved web pages are the official sites of the candidates, news, blogs and articles with personal opinions about the elections.

The Arquivo.pt respects an embargo period of 1 year, and for that reason this collection will only be available by the end of 2018.

Meanwhile, you can consult the preserved pages about the previous elections of 2013, such as:

We would like to thank all the volunteers that collaborated with this initiative.

Arquivo.pt goes mobile!

página inicial do Arquivo.pt - versão mobile

On November 28 2017, Arquivo.pt launched a new version named Afrodite.

The main novelty is the adaptation of user interfaces to mobile devices.

On the other hand, it also enables access to the mobile versions of the preserved sites.

Arquivo.pt began to preserve the mobile web too!

Mobile version of the homepage of Arquivo.pt.

It is now easier to use Arquivo.pt everywhere

Using your mobile phone, try searching for all of the versions that Arquivo.pt preserved from the website of the organization where you worked or studied.

List of preserved versions of a site.

Mobile version of textual search.

Mobile versions of preserved sites can also be accessed.

And more…

Improvements were also made to the desktop user interfaces, including a new responsive footer and a new language selection bar.

The alpha version of a new API was also published to improve the automatic access to the information preserved by Arquivo.pt.

To know more, see the technical details of the new version of Arquivo.pt.

It is very important that you help us to testing this new version!

There are numerous models of mobile devices and we cannot test them all.

We would appreciate if could you spent 1 minute testing this new version using your phone or tablet.

If you notice any problems, we kindly ask you to let us know about it. Please send us also the model of your device and browser you used.

Your contribution is very valuable to guarantee the quality of the service provided to the community.

New informative site of Arquivo.pt!

Novo site sobre.arquivo.pt

On August 2, 2017, a new version of the informative website about Arquivo.pt (available on sobre.arquivo.pt/en/) was launched.

This new version consisted mainly on the adoption of the WordPress platform as replacement for an obsolete version of Plone.

The following improvements are highlighted:

If you notice any problems or wish to make any suggestions for improvement, please contact us!

Your opinion is valuable.

Investiga XXI research project has started

Last updated on September 25th, 2017 at 04:21 pm

Investiga XXI research project is publicly sharing the first contributions based on Arquivo.pt.

The main goal is to promote the use of Arquivo.pt as a source and tool for scientific research.

The ongoing projects adopt perspectives from the Digital Humanities in their approach to the different objects of study:

To know more:

If you want to collaborate, contact us.

Arquivo.pt improves advanced search

Arquivo.pt launched a new version on June 23, 2017 called Venus, in which the Advanced Search operators were improved.

It stands out the improvement of the search operator site that allows to restrict searches to certain preserved sites. For example:

The reproduction of preserved pages containing error messages was also reduced.

Know More

Arquivo.pt preserved websites about Research & Development projects funded by the EU

EuropeanCommission

Last updated on July 27th, 2021 at 04:07 pm

Arquivo.pt automatically identified R&D project websites to preserve their content. It preserved 52 million web files (7 TB) related to science for future access.

R&D websites publish valuable information but are being lost

Websites about Research and Development (R&D) projects are increasingly being used to publish important scientific information that complements published literature (e.g. data sets, documentation or software). However, after projects ending, the corresponding websites usually disappear causing a permanent loss of unique and valuable scientific information.
Percentage of project URLs from the EU Open Data Portal that referenced relevant content in November 2015 distributed per work programme since FP4 (1994). 
Percentage of project URLs from the EU Open Data Portal that referenced relevant content in November 2015 distributed per work programme since FP4 (1994).

Online information related to R&D projects is not being fully documented. For example, information about the URLs of projects funded by the 7th Framework Program (FP7) available at the European Union’s Open Data Portal is missing for 92% of the projects.

Arquivo.pt automatically identified URLs related to Research and Development projects

The main objective of Arquivo.pt is to preserve online information for scientific and academic purposes. Therefore, it developed a pragmatic and low-cost process that automatically identifies URLs related to R&D projects to be systematically preserved. Automatic identification is achieved through the combination of open data sets with free search services. This work is detailed in an article published at the International Conference on Digital Preservation 2016.

All the data sets and tools developed during this research have been made publicly available in open access so that they can be reused and collaboratively enhanced.

52 million web files related to science were preserved

The application of the developed process already enabled the preservation of 52 million files (7 TB) obtained from 53 993 websites of R&D projects financed since the FP4 (1994), such as the WEZARD project funded by FP7 aimed at “preparing the future research community in the area of air transport system robustness when it is faced with weather hazards”. The website for this project (www.wezard.eu) is no longer available online. However, it was preserved and can be accessed at Arquivo.pt.
All the websites identified and preserved during this project are accessible through Arquivo.pt since March 2017.
Preserved website of the WEZARD project (www.wezard.eu), funded by FP7 between 2011 and 2013, available at Arquivo.pt.
Preserved website of the WEZARD project (www.wezard.eu), funded by FP7 between 2011 and 2013, available at Arquivo.pt.

Contributions to complement the European Open Data Portal data sets

The developed process was applied to the data sets published through the European Open Data Portal to try to complement the missing information regarding project URLs. The obtained results showed that the completeness of the FP7 data set was improved by 86.6%.

All the resulting data sets were made publicly available so that they can be improved and reused by other organizations also interested on preserving this digital heritage (FP4FP5FP6FP7).

References

Are you a researcher?

Blogs that stand for History: training offered by Arquivo.pt

Blogs that stand for history Arquivo.pt training course

Last updated on September 26th, 2017 at 10:27 am

“How can my blog stay in the Digital History of Portugal?” Is the starting question for this meeting dedicated to digital preservation.

Blogs that stand for history Arquivo.pt training course
Blogs that stand for history – Arquivo.pt training course

On the 23rd February 2017, the FCCN unit of the FCT, in Lisbon, responsible for Arquivo.pt, hosted a free training session for bloggers in the areas of technology, lifestyle and fashion. Under the motto of working to leave their blogs in the history of the Portuguese Web, this set of bloggers joined the research infrastructure Arquivo.pt attending sessions on digital preservation techniques