Before lunch get to know in firsthand the Announcement of the Arquivo.pt 2018 Prize, which will aim to foster innovative research using resources preserved from the Web.
During the lunch, which we are pleased to offer, you can chat with the speakers and the Arquivo.pt team.
Workshops: research use cases and training
Research using Arquivo.pt
Get to know the research work already done in several areas using Arquivo.pt.
FCT-FCCN’s Advanced Services Area, which includes Arquivo.pt and the Video Services, has opened a vacancy for complementary training in infrastructure and services for science management (scholarship grant).
The activities to be carried out will be related to the training and dissemination of advanced services for scientific research and higher education.
The deadline for submitting applications is October 27, 2017.
Campaign websites are historically relevant. However, they are difficult to identify because they appear and disappear quickly. Moreover, they are often exclusively referenced through printed media (e.g. posters).
That’s why your collaboration is essential!
To help, simply add addresses of pages or sites related to the Municipal Elections of 2017 through the following link:
Arquivo.pt automatically identified R&D project websites to preserve their content. It preserved 52 million web files (7 TB) related to science for future access.
R&D websites publish valuable information but are being lost
Arquivo.pt automatically identified URLs related to Research and Development projects
The main objective of Arquivo.pt is to preserve online information for scientific and academic purposes. Therefore, it developed a pragmatic and low-cost process that automatically identifies URLs related to R&D projects to be systematically preserved. Automatic identification is achieved through the combination of open data sets with free search services. This work is detailed in an article published at the International Conference on Digital Preservation 2016.
52 million web files related to science were preserved
The application of the developed process already enabled the preservation of 52 million files (7 TB) obtained from 53 993 websites of R&D projects financed since the FP4 (1994), such as the WEZARD project funded by FP7 aimed at “preparing the future research community in the area of air transport system robustness when it is faced with weather hazards”. The website for this project (www.wezard.eu) is no longer available online. However, it was preserved and can be accessed at Arquivo.pt.
All the websites identified and preserved during this project are accessible through Arquivo.pt since March 2017.
Contributions to complement the European Open Data Portal data sets
The developed process was applied to the data sets published through the European Open Data Portal to try to complement the missing information regarding project URLs. The obtained results showed that the completeness of the FP7 data set was improved by 86.6%.
All the resulting data sets were made publicly available so that they can be improved and reused by other organizations also interested on preserving this digital heritage (FP4, FP5, FP6, FP7).
European Open Data Portal Database complemented by Arquivo.pt with the proposed process. The new project URLs are available at column “Identified Websites” of the files FP4, FP5, FP6, FP7.