Last updated on October 11th, 2024 at 04:02 pm
Examples of external research work performed using Arquivo.pt
Scientific and academic articles
- Daniela Major, The uses of History in the media coverage of the European Union 2004 – 2019, PhD thesis, 2024.
- Nuno Guimarães et. al., Perfil Público: Automatic Generation and Visualization of Author Profiles for Digital News Media, PROPOR, 2024.
- R. Lopes et al., GlórIA – A Generative and Open Large Language Model for Portuguese (Preprint), arXiv 2024.
- R. Lopes et al., GlórIA – ArquivoPT News PT-PT Dataset – Hugging Face Model card, 2024.
- L. Frew, Making Changes in Webpages Discoverable: A Change-Text Search Interface for Web Archives, arXiv 2023.
- J. Tavares, Towards an Automated Media Chart: Framing News Articles with Natural Language Processing Techniques, MSc thesis 2023.
- F. Hantke, You Call This Archaeology? Evaluating Web Archives for Reproducible Web Security Measurements, ACM CCS 2023.
- Adam Jatowt et al., Is this news article still relevant? Ranking by contemporary relevance in archival search, IJDL 2023.
- Agrawal et al., Advancements in NSFW Content Detection: A Comprehensive Review of ResNet-50 Based Approaches, IJSAE 2023
- Ricardo Campos et al., Public News Archive: A Searchable Sub-archive to Portuguese Past News Articles, ECIR 2023.
- Francisco Gonçalves, Text2Storyline: Generating Enriched Storylines from Text, ECIR 2023.
- Bruna Ramalho Galamba, The Fortress of Santa Catarina de Ribamar (Portimão) as a Proposal for Good Practices of Military Heritage Preservation, Cultural Sustainable Tourism 2022.
- D. Tércio, Terpsicore – dance and performing arts archive, Dance Data, Cognition, and Multimodal Communication 2022.
- Gonçaço Melo da Silva et al., ROSSIO Infrastructure: A Digital Humanities Platform to Explore the Portuguese Cultural Heritage. Information 2022.
- Alina Yanchuk et al., Automatic Classification of Stigmatizing Articles of Mental Illness: The Case of Portuguese Online Newspapers, ADBIS 2022.
- Nuno Miquelina et al., Generating a European Portuguese BERT Based Model Using Content from Arquivo.pt Archive, IDEAL 2022.
- Himarsha R. Jayanetti et al., Robots Still Outnumber Humans in Web Archives, But Less Than Before, TPDL 2022.
- Sawood Alam et al., Profiling Web Archival Voids for Memento Routing, JCDL 2021.
- C. Cardoso, How the portuguese media represented the first racialized female MP head of a political party, IAMCR 2021.
- Moisés Rockembach and Anabela Serrano, Climate change and web archives: an Ibero-American study based on the Portuguese and Brazilian contexts, Records Management Journal 2021.
- Paulo Martins and J.C. Ramalho, Knowledge graph of press clippings referring social minorities, CEUR-WS 2021.
- Eilaf Eid Alotaibi, Saudi Females Beginners’ Attitudes Towards Full-online Learning Through EFL Virtual Classrooms During COVID-19 Pandemic, Arab World English Journal 2021
- Flávio Martins and André Mourão, Revisionista.PT: Uncovering the News Cycle Using Web Archives, ECIR 2020.
- Francisco Cádima, A RTP em ambiente digital: dos anos 90 à atualidade – um enquadramento teórico. PAULUS: Revista De Comunicação Da FAPCOM 2020.
- B. Niveditha, A Study of Availability and Recovery of URLs in Library and Information Science Scholarly Journals, AJIST 2020.
- P. Nunes-Silva et al., Applications of RFID technology on the study of bees, Insectes Sociaux 2019.
- F. Nanni, Collecting Primary Sources from Web Archives: A Tale of Scarcity and Abundance, The SAGE Handbook of Web History 2019.
- M. Aturban et al., Collecting 16K archived web pages from 17 public web archives, arxiv 2019.
- J.P. Amorim, Mature students’ access to higher education: A critical analysis of the impact of the 23+ policy in Portugal, EJED 2018.
- P. Webster, Existing web archives, The SAGE handbook of web history 2018.
- T. Samar, Quantifying retrieval bias in Web archive search, IJDL 2018.
- D. Siklósi, Assessing the Quality of Web, PhD thesis 2016.
- S. Antunes, Interfaces para um museu do web design português, PhD thesis 2015.
- Ahmed AlSum et al., Profiling web archive coverage for top-level domain and content language. IJDL 2014.
- András Garzó et al.. Cross-lingual web spam classification, WWW 2013.
- Search for more references through Google Scholar
Research data sets derived from Arquivo.pt
- Ricardo Lopes, João Magalhães, David Semedo, GlórIA – ArquivoPT News PT-PT Dataset – Hugging Face Model card (paper)
- Nuno Miquelina, Paulo Quaresma, Vitor Nogueira, Generating a European Portuguese BERT Based Model Using Content from Arquivo.pt Archive, (Preprint)
- Diego Alves, 2019 European Parliamentary Elections – Raw texts, Harvard Dataverse 2023.
- Diego Alves, 2019 European Parliamentary Elections – CoNLL-U texts, Harvard Dataverse 2023.
- David Batista, A n-grams collection extracted from the Portuguese Web, Harvard Dataverse 2022.