New informative site of Arquivo.pt!

Novo site sobre.arquivo.pt

On August 2, 2017, a new version of the informative website about Arquivo.pt (available on sobre.arquivo.pt/en/) was launched.

This new version consisted mainly on the adoption of the WordPress platform as replacement for an obsolete version of Plone.

The following improvements are highlighted:

If you notice any problems or wish to make any suggestions for improvement, please contact us!

Your opinion is valuable.

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Investiga XXI research project has started

Investiga XXI research project is publicly sharing the first contributions based on Arquivo.pt.

The main goal is to promote the use of Arquivo.pt as a source and tool for scientific research.

The ongoing projects adopt perspectives from the Digital Humanities in their approach to the different objects of study:

To know more:

If you want to collaborate, contact us.

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Arquivo.pt preserved websites about Research & Development projects funded by the EU

EuropeanCommission

Arquivo.pt automatically identified R&D project websites to preserve their content. It preserved 52 million web files (7 TB) related to science for future access.

R&D websites publish valuable information but are being lost

Websites about Research and Development (R&D) projects are increasingly being used to publish important scientific information that complements published literature (e.g. data sets, documentation or software). However, after projects ending, the corresponding websites usually disappear causing a permanent loss of unique and valuable scientific information.
Percentage of project URLs from the EU Open Data Portal that referenced relevant content in November 2015 distributed per work programme since FP4 (1994). 
Percentage of project URLs from the EU Open Data Portal that referenced relevant content in November 2015 distributed per work programme since FP4 (1994).
Online information related to R&D projects is not being fully documented. For example, information about the URLs of projects funded by the 7th Framework Program (FP7) available at the European Union’s Open Data Portal is missing for 92% of the projects.

Arquivo.pt automatically identified URLs related to Research and Development projects

The main objective of Arquivo.pt is to preserve online information for scientific and academic purposes. Therefore, it developed a pragmatic and low-cost process that automatically identifies URLs related to R&D projects to be systematically preserved. Automatic identification is achieved through the combination of open data sets with free search services. This work is detailed in an article published at the International Conference on Digital Preservation 2016.

All the data sets and tools developed during this research have been made publicly available in open access so that they can be reused and collaboratively enhanced.

52 million web files related to science were preserved

The application of the developed process already enabled the preservation of 52 million files (7 TB) obtained from 53 993 websites of R&D projects financed since the FP4 (1994), such as the WEZARD project funded by FP7 aimed at “preparing the future research community in the area of air transport system robustness when it is faced with weather hazards”. The website for this project (www.wezard.eu) is no longer available online. However, it was preserved and can be accessed at Arquivo.pt.
All the websites identified and preserved during this project are accessible through Arquivo.pt since March 2017.
Preserved website of the WEZARD project (www.wezard.eu), funded by FP7 between 2011 and 2013, available at Arquivo.pt.
Preserved website of the WEZARD project (www.wezard.eu), funded by FP7 between 2011 and 2013, available at Arquivo.pt.

Contributions to complement the European Open Data Portal data sets

The developed process was applied to the data sets published through the European Open Data Portal to try to complement the missing information regarding project URLs. The obtained results showed that the completeness of the FP7 data set was improved by 86.6%.

All the resulting data sets were made publicly available so that they can be improved and reused by other organizations also interested on preserving this digital heritage (FP4FP5FP6FP7).

References

Are you a researcher?

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Blogs that stand for History: training offered by Arquivo.pt

Blogs that stand for history Arquivo.pt training course

“How can my blog stay in the Digital History of Portugal?” Is the starting question for this meeting dedicated to digital preservation.

Blogs that stand for history Arquivo.pt training course
Blogs that stand for history – Arquivo.pt training course

On the 23rd February 2017, the FCCN unit of the FCT, in Lisbon, responsible for Arquivo.pt, hosted a free training session for bloggers in the areas of technology, lifestyle and fashion. Under the motto of working to leave their blogs in the history of the Portuguese Web, this set of bloggers joined the research infrastructure Arquivo.pt attending sessions on digital preservation techniques

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Training about Arquivo.pt in April at UTAD, Vila Real, Portugal

Arquivo.pt formação gratuita UTAD 2017

This training will take place during the event Jornadas FCCN 2017 at UTAD from 19 to 21 April.

Training agenda

This training course will take place at Universidade de Trás-os-Montes e Alto Douro (UTAD), 20 de abril, 14:30-16:00.

  • Arquivo.pt: an innovative service at your disposal
  • How to publish preservable information for the future
  • Automatic access to Arquivo.pt (APIs)

Do not miss the Zapping of other services at your service!

We also highlight the “Zapping session of FCCN projects and services” (April 20, 9:30). During just 1 hour anyone can get to know all the services offered by the FCCN, free or at no cost to the academic community.

Registrations

Registration is free and includes social events. However, the number of registrations is limited and we accept submissions in order of submission. The main objective of Jornadas FCCN is to interact with the local communities.

Please spread the word about it to potential interested parties.

Related Links

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Sites crawled in 2015 are now available through Arquivo.pt!

The information collected from the Web during 2015 is now avaliable in Arquivo.pt!

Remember and investigate historical events of 2015 such as the terrorist attacks at Charlie Hebdo and Bataclan or the Greek crisis.

Charlie Hebdo Cover

 

Now you can acess 835 milion preserved files (35TB) from 2 milion websites collected in 2015.

Examples of preserved pages

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

New release of Arquivo.pt with improved replay quality!

Antes e Depois site preservado aeropaixão

Arquivo.pt released a new version of its service on the 25th of January 2017.

The new version named PyCDX introduces significant improvements in the replay quality of the preserved pages.

These improvements resulted from the adoption of PyWb technology developed by Ilya Kreymer.

The replay of the preserved pages is now more comprehensive, with the loading of additional images, PDF, CSS, and other Web content that previously were not reproduced.

Examples of improvements

AeroPaixao website before and after improved replay
Replay of the preserved page http://aero-paixao.planetaclix.pt/A320.htm before and after the new version of Arquivo.pt.
replay healthy-workplaces.eu before and after new version of arquivo.pt
Replay of the preserved page healthy-workplaces.eu before and after the new version of Arquivo.pt.

 

Replay of the archived page europa.eu before and after new release of Arquivo.pt
Replay of the preserved page http://europa.eu/ before and after the new version of Arquivo.pt.

 

Know More

Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Arquivo.pt – new version

Arquivo.pt new version 2016

Arquivo.pt released a new version on the 7th of November 2016.

The new version named Hercules presents design improvements, specially in the user interface for the replay of preserved Web pages, such as:

  • Minimize and Maximize options for the toolbar by clicking in the upper right corner, so that users can visualize the preserved Web page in full screen;
Arquivo.pt new version 2016
Arquivo.pt new version 2016
Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone

Arquivo.pt in Iceland

Arquivo.pt Iceland IIPC 2016

The Arquivo.pt team went to the biggest international Web archiving conference.

Arquivo.pt Iceland IIPC 2016
Arquivo.pt Iceland IIPC 2016

The conference, organized by the International Internet Preservation Consortium (IIPC), occurred from the 11th to the 15th of April 2016.

Arquivo.pt contributed with 5 presentations in the conference.

The slides are available in the following links:

For more informations about the conference visit http://www.netpreserve.org/general-assembly/2016/overview
Share on FacebookTweet about this on TwitterShare on LinkedInShare on Google+Email this to someone