What question did this study set out to answer?

The aim is to develop a process for identifying and classifying overt and covert racism in text, particularly in large datasets.

February 11, 2026Open Access

Machines Do See Color: Using LLMs to Classify Overt and Covert Racism in Text

Key Points

The aim is to develop a process for identifying and classifying overt and covert racism in text, particularly in large datasets.
Propose a generalizable process for coding racism in text.
Construct labeled datasets for overt and covert racism.
Train XLM-RoBERTa for supervised classification.
Utilize a corpus of tweets related to the Ecuadorian indígena community.
XLM-R and XLM-R-Racismo models outperform existing approaches in classifying racism.
Demonstrated effectiveness using a large dataset of tweets between 2018 and 2021.

Abstract

Extant work has identified two discursive forms of racism: overt and covert. While both forms have received attention in scholarly work, research on covert racism has been limited. Its subtle and context-specific nature has made it difficult to systematically identify covert racism in text, especially in large corpora. In this article, we first propose a theoretically driven and generalizable process to identify and classify covert and overt racism in text. This process allows researchers to construct coding schemes and build labeled datasets. We use the resulting dataset to train XLM-RoBERTa, a cross-lingual large language model (LLM) for supervised classification with a cutting-edge contextual understanding of text. We show that XLM-R and XLM-R-Racismo, our pretrained model, outperform other state-of-the-art approaches in classifying racism in large corpora. We illustrate our approach using a corpus of tweets relating to the Ecuadorian indígena community between 2018 and 2021.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Gordillo et al. (Sat,) studied this question.

www.synapsesocial.com/papers/698c1bcd267fb587c655dbbc — DOI: https://doi.org/10.1177/00491241251412360

Authors

Diana Davila Gordillo

Joan C. Timoneda

Sebastián Vallejo Vera

Journals

Sociological Methods & Research

Actions

Institutions

Purdue University West Lafayette

Western University

Leiden University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Machines Do See Color: Using LLMs to Classify Overt and Covert Racism in Text

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion