Examples of external research work performed using Arquivo.pt
Scientific articles
- Adam Jatowt et al., Is this news article still relevant? Ranking by contemporary relevance in archival search, IJDL 2023.
- Ricardo Camposet al. , Public News Archive: A Searchable Sub-archive to Portuguese Past News Articles, ECIR 2023.
- Francisco Gonçalves, Text2Storyline: Generating Enriched Storylines from Text, ECIR 2023.
- Gonçaço Melo da Silva et al., ROSSIO Infrastructure: A Digital Humanities Platform to Explore the Portuguese Cultural Heritage. Information 2022.
- Alina Yanchuk et al., Automatic Classification of Stigmatizing Articles of Mental Illness: The Case of Portuguese Online Newspapers, ADBIS 2022.
- Nuno Miquelina et al., Generating a European Portuguese BERT Based Model Using Content from Arquivo.pt Archive, IDEAL 2022.
- Himarsha R. Jayanetti et al., Robots Still Outnumber Humans in Web Archives, But Less Than Before, TPDL 2022.
- Sawood Alam et al., Profiling Web Archival Voids for Memento Routing, JCDL 2021.
- Moisés Rockembach and Anabela Serrano, Climate change and web archives: an Ibero-American study based on the Portuguese and Brazilian contexts, Records Management Journal 2021.
- Paulo Martins and J.C. Ramalho, Knowledge graph of press clippings referring social minorities, CEUR-WS 2021.
- Flávio Martins and André Mourão, Revisionista.PT: Uncovering the News Cycle Using Web Archives, ECIR 2020.
- Francisco Cádima, A RTP em ambiente digital: dos anos 90 à atualidade – um enquadramento teórico. PAULUS: Revista De Comunicação Da FAPCOM 2020.
- Ahmed AlSum et al., Profiling web archive coverage for top-level domain and content language. IJDL 2014.
- András Garzó et al.. Cross-lingual web spam classification, WWW 2013.
Research data sets derived from Arquivo.pt
- Diego Alves, 2019 European Parliamentary Elections – Raw texts, Harvard Dataverse 2023.
- Diego Alves, 2019 European Parliamentary Elections – CoNLL-U texts, Harvard Dataverse 2023.
- David Batista, A n-grams collection extracted from the Portuguese Web, Harvard Dataverse 2022.