Identifying Encapsulating Anaphoras: comparison between automated and human analysis

Authors

DOI:

https://doi.org/10.21680/1517-7874.2025v27n1ID38802

Abstract

Encapsulating anaphoras are the focus of this study, highlighting their relevance for the analysis and teaching of writing. Considering the advances in the use of Artificial Intelligence (AI) in computational linguistics, the research is justified by the need to integrate innovative methodologies in textual analysis. The motivation arises from the gap in the application of large language models (LLMs) to identify and categorize descriptive and opinionated encapsulations in essays. The study proposes a hybrid approach that combines human analysis, rich in contextual nuances, with the capabilities and scalability of LLMs using zero-shot and few-shot prompts. The experiments carried out with Enem essays show that the use of few-shot prompts significantly improves the identification of encapsulating anaphoras by LLMs, when compared to the use of zero-shot prompts, but still below that observed in human analyses. In short, this work seeks to contribute to the advancement of research in textual linguistics and AI, offering a new perspective for the analysis of written texts and demonstrating the potential of combining human and computational methods to identify complex linguistic patterns.

Keywords: Encapsulated Anaphoras; Textual Analysis; Enem Essay; Large Language Model.

Downloads

Download data is not yet available.

Author Biographies

Osmar de Oliveira Braz Junior, Universidade do Estado de Santa Catarina

Bachelor's degree in Computer Science from the University of Southern Santa Catarina (1997) and Master's degree in Production Engineering from the Federal University of Santa Catarina (2000). Assistant Professor at the State University of Santa Catarina (UDESC) and hourly Professor at the University of Southern Santa Catarina (UNISUL). He has experience in the area of ​​Computer Science, with emphasis on Software Engineering, working mainly in the following areas: information systems, distance education, software engineering and database.

Roberlei Alves Bertucci, Universidade Tecnólogica Federal do Paraná

Graduated in Portuguese-English Literature from PUCPR (2004); Master in Literature (Linguistic Studies) from UFPR (2007) and PhD in Linguistics from USP (2011), having completed part of his doctorate at Université Paris 8 (2009-2010). He developed post-doctoral research at Bar-Ilan University in Israel (2012). He is currently a professor at the Federal Technological University of Paraná (UTFPR). He is interested in different grammatical (formal) processes of meaning production in natural languages, such as: syntax, semantics and pragmatics of natural languages, especially Brazilian Portuguese; linguistic description and analysis in the verbal and nominal domains, especially through technological tools; and application of linguistic foundations and discoveries to digital technological tools.

Renato Fileto, Universidade Federal de Santa Catarina

Renato Fileto holds a Bachelor degree in Computer Science from the Federal University of Uberlândia (1992), a Master degree (1994) and a Doctorate degree (2003) in Computer Science from Campinas State University, Brazil, with an internship at Georgia Institute of Technology, USA (2002), and a Post-Doctorate from the University of São Paulo (2012). His research carrier has been intertwined with activities in the industry. Since 2006, he is a permanent professor at the Department of Informatics and Statistics (INE) of Santa Catarina Federal University (UFSC), in Florianópolis-SC, Brazil. His research area is data science, with the focus in intelligent systems for data analytics.

Published

18-06-2025

How to Cite

DE OLIVEIRA BRAZ JUNIOR, Osmar; ALVES BERTUCCI, Roberlei; FILETO, Renato. Identifying Encapsulating Anaphoras: comparison between automated and human analysis. Revista do GELNE, [S. l.], v. 27, n. 1, p. e38802, 2025. DOI: 10.21680/1517-7874.2025v27n1ID38802. Disponível em: https://www.periodicos.ufrn.br/gelne/article/view/38802. Acesso em: 11 mar. 2026.

Issue

Section

Dossiê temático: Linguística Textual brasileira: uma homenagem à Profa. Dra. Mônica Magalhães Cavalcante