Software Engineer Java/Linux needed at Arquivo.pt

Last updated on October 1st, 2018 at 04:40 pm

Arquivo.pt is looking for a Software Engineer Java/Linux to execute the following functions:

  • Maintenance and quality control of the service;
  • Development of large scale distributed computer systems;
  • Software maintenance and corrective evolution;
  • Interaction with the scientific community to establish collaborations.

Requirements

  • Bachelor, master or PhD in computer science;
  • Experience with: Java J2EE, Java Server Pages, Java Beans, Python, Tomcat, Maven/Ant,  Linux  and Apache HTTPd servers;
  • Knowledge about architecture, development and operation of distributed computer systems;
  • Good level of English.

Preferences

  • Source code repositories management (ex. Git);
  • Automatic tests platform management (ex. Jenkins, SeleniumHQ, SonarCube);
  • Participation in open source collaboration projects;
  • Internet production systems management experience;
  • Knowledge of Big Data/NoSQL (ex. Hadoop, HBase, Lucene, Solr);
  • Knowledge of Information Retrieval or Machine Learning;
  • Experience in web archive technologies (ex. Wayback Machine, Heritrix, NutchWAX);
  • Participation in Research & Development projects.

Applications

Arquivo.pt Award will be given at Ciência 2018

Last updated on October 26th, 2018 at 11:36 am

Arquivo.pt 2018 award will be given at  “Ciência 2018 – Encontro com a Ciência e Tecnologia”, wich will take place from the 2nd to the 4th of July, at the Lisbon Congress Centre.

The European Commissioner for Research, Science and Innovation , Carlos Moedas, will present the award at the plenary session “Novas fronteiras da era digital na Europa e no Mundo”, on July 3 at 17h30. The 1.º prize is 10 000 euros, the 2.º is 3 000 euros and the 3.º is 2 000 euros.

This was the Arquivo.pt award first edition. The competition ended on May 4 and 27 proposals from various fields such as: media, technology, computer science, tourism, cultural and historic patrimony, were received. This award recognises originality and innovation in Arquivo.pt’s applicability and the importance of web preservation.

Sites crawled in 2016 are now available through Arquivo.pt!

Last updated on October 1st, 2018 at 04:41 pm

The information collected from the Web during 2016 is now available at Arquivo.pt.

Remember events of 2016 such as the United  States Presidential Elections or UEFA Euro 2016.

Now you can access 964 million preserved files (58TB) from 2 million websites collected in 2016.

More examples of preserved pages

Web archives for research: Call for Proposals!

RESAW@Porto2018 workshop

An introduction to web archives for Humanities and Social Science research.

This one-day workshop will be held as part of TPDL 2018 on 13 September 2018, Porto, Portugal.

RESAW is a network of researchers and web archivists that promotes the development of Research infrastructures for the Study of Archived Web Material.

The organizing committee is composed by  Jane Winters (Professor of Digital Humanities at the University of London) and Daniel Gomes (Arquivo.pt).

Call for Proposals

We are seeking proposals from potential contributors to the workshop in two main areas:

  • Work presenting the state of the field and discussing the opportunities and challenges of using this new kind of primary source for research. This may include demonstrations of existing web archives.
  • Presentations of ground-breaking Humanities and Social Science research drawing on web archives, from small-scale analyses of individual websites to large-scale investigations of entire domains.

Apply

To apply, just submit a 300-500 word abstract via our online form. Final presentations should be around 30 minutes in length.

Deadline for applications: 15 June 2018, 17.00 (GMT+1).

All details at: http://arquivo.pt/resawPorto2018

Spread the word!

Please disseminate among potentially interested authors and attendees.

Thanks!

Jane Winters & Daniel Gomes

Arquivo.pt at Jornadas de Computação Científica 2018

Last updated on October 2nd, 2018 at 02:12 pm

The Jornadas de Computação Científica 2018 were in April, in Braga Portugal, at the International Iberian Nanotechnology Laboratory (INL).

At the event, the Portuguese Secretary of State for Science, Technology and Higher education, Maria Fernanda Rollo, spoke about the Digital Future.

Arquivo.pt team and some guests presented new services and applications.

Plenary session

Presentations about Arquivo.pt

Grants to collaborate with Arquivo.pt

Last updated on October 1st, 2018 at 04:41 pm

The ROSSIO Project-  Ciências Sociais, Artes e Humanidades has an open call for 3 grants in colaboration with Arquivo.pt. The call is open until the 10th of May 2018.

Check for application details at the links below

Digital Curator / Information Manager

Main tasks

  • Creation and management of themed collections in digital format.
  • Quality control and tests.
  • Training and users support
  • Critical review of scientific and technical documents (ex. scientific articles, operational documents).
  • Validation and information management in multiple digital formats (ex. videos, web pages, documents).
  • Identification and documentation of users needs.

Application details

Community Manager

Main tasks

  • User support with creation and management of community interaction.
  • Creation of strategic communication and training.
  • Communication quality control support.
  • Content creation and social media dissemination.
  • Services needs assessment according to users real performance
  • Networking management

Application details

Social Sciences and Humanities Researcher

Main tasks

  • Research to demonstrate the scientific utility of digital platforms
  • Training in community involvement for researchers

Application details

New API to search Arquivo.pt

Last updated on October 1st, 2018 at 04:42 pm

On February 23, 2018 Arquivo.pt launched a new version called Zeus.

This new version includes a new programming interface (Arquivo.pt API) and improvements in the mobile interfaces.

Arquivo.pt API: the oficial API

The new application programming interface, called Arquivo.pt API, allows you to search and access information preserved automatically. The purpose of this new API was to aggregate functions and information that were provided through the various APIs developed previously. The search can be done by URL or by terms, and a JSON object with the response items is returned.

This new API can be useful to compete for the Arquivo.pt Prizes.

To know more

Arquivo.pt Prizes 2018: Call for submissions!

Last updated on October 26th, 2018 at 11:37 am

Applications for the Arquivo.pt Prizes 2018 are open until May 4.

The 1st prize is 10 000 EURO, individual or group applications can be submitted about any theme. The only requirement is to use Arquivo.pt as the main source of information.

To apply, you will need to submit a text and a short video describing the work done. The use of the Portuguese language is mandatory.

Know more at:
http://arquivo.pt/prizes

Spread the word about it among potential candidates!

How to improve an online service (video)?

Improving the robustness of the Arquivo.pt web archive (video thumbnail)

Last updated on April 3rd, 2019 at 11:59 am

Arquivo.pt recommendations to improve the quality of online services


This presentation provides an overview of the architecture and functioning of the system that supports the Arquivo.pt web archive.

It shares the main lessons learned from the experience of developing and maintaining this service for 10 years.

We believe that these recommendations can be useful to improve the quality of any web-based information system.

Share it with your IT department!

We preserved the Portuguese Local Elections of 2017

Last updated on May 8th, 2023 at 05:08 pm

Arquivo.pt performed 2 web crawls of information related with the Portuguese Local Elections of 2017.

We appealed the community to contribute with suggestions of relevant Web pages so that we could preserve them.

The 2 crawls occurred during and after the campaign period, using the list of 410 Web pages suggested by the community and 13 887 web pages found automatically using search engines.

The manual identification process originated a list of 337 addresses which documented candidacies for the 2017 Municipal Elections. Note that 46% of these addresses referenced the social media platform Facebook.com. Much of this content of national interest could not be preserved because this foreign private company does not allow it.

The final result was an archive of 2 265 887 Web resources (360 GB).

Among the preserved web pages are the official sites of the candidates, news, blogs and articles with personal opinions about the elections.

The Arquivo.pt respects an embargo period of 1 year, and for that reason this collection will only be available by the end of 2018.

Meanwhile, you can consult the preserved pages about the previous elections of 2013, such as:

We would like to thank all the volunteers that collaborated with this initiative.