Skip to Main Content (Press Enter)

Logo UNITO
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione

UNI-FIND
Logo UNITO

|

UNI-FIND

unito.it
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione
  1. Pubblicazioni

Stereohoax: a multilingual corpus of racial hoaxes and social media reactions annotated for stereotypes

Articolo
Data di Pubblicazione:
2024
Abstract:
Stereotypes have been studied extensively in the felds of social psychology and, especially with the recent advances in technology, in computational linguistics. Stereotypes have also gained even more attention nowadays because of a notable rise in their dissemination due to demographic changes and world events. This paper focuses on ethnic stereotypes related to immigration and presents the StereoHoax corpus, a multilingual dataset of 17,814 tweets in French, Italian, and Spanish. The corpus includes conversational threads reporting on and responding to racial hoaxes about immigrants, which we defne as false claims of unlawful actions attributed to specifc ethnic groups. This work describes the data collection process and the fne-grained annotation scheme we used, which is based mainly on the Stereotype Content Model adapted to the study applied to immigrants of Bosco et al. (2023). Quantitative and qualitative analyses show the distribution and correlation of annotated categories across languages, revealing, for instance, intercultural diferences in the expression of stereotypes through forms of discredit. To validate our data, we performed four machine learning experiments using pre-trained BERT-like models in order to lay a foundation for automatic stereotype detection research. Leveraging the StereoHoax corpus, we gained crucial insights into the importance of context, especially in relation to the detection of implicit stereotypes. Overall, we believe that the StereoHoax corpus will prove to be a valuable resource for the automatic detection of stereotypes regarding immigrants and the study of the linguistic and psychological patterns associated with their dissemination.
Tipologia CRIS:
03A-Articolo su Rivista
Keywords:
Racial Hoax, Immigration, Social psychology, Natural language processing, Stereotype detection, Corpus analysis
Elenco autori:
Wolfgang S. Schmeisser-Nieto, Alessandra Teresa Cignarella, Tom Bourgeade, Simona Frenda, Alejandro Ariza-Casabona, Mario Laurent, Paolo Giovanni Cicirelli, Andrea Marra, Giuseppe Corbelli, Farah Benamara, Cristina Bosco, Véronique Moriceau, Marinella Paciello, Viviana Patti, Mariona Taulé & Francesca D’Errico
Autori di Ateneo:
BOSCO Cristina
PATTI Viviana
Link alla scheda completa:
https://iris.unito.it/handle/2318/2040052
Link al Full Text:
https://iris.unito.it/retrieve/handle/2318/2040052/1451918/s10579-024-09791-3.pdf
Pubblicato in:
LANGUAGE RESOURCES AND EVALUATION
Journal
Progetto:
BOSCO C. - CSP - Bando Challenges for Europe - Progetto "STERHEOTYPES: STudying European Racial Hoaxes and sterEOTYPES"- Prot. 2019.AAI4049.U4520/SM/pv
  • Dati Generali
  • Aree Di Ricerca

Dati Generali

URL

https://link.springer.com/article/10.1007/s10579-024-09791-3?utm_source=rct_congratemailt&utm_medium=email&utm_campaign=oa_20241219&utm_content=10.1007/s10579-024-09791-3#citeas

Aree Di Ricerca

Settori (12)


PE6_7 - Artificial intelligence, intelligent systems, natural language processing - (2024)

CIBO, AGRICOLTURA e ALLEVAMENTI - Farmacologia Veterinaria

CULTURA, ARTE e CREATIVITA' - Culture moderne

INFORMATICA, AUTOMAZIONE e INTELLIGENZA ARTIFICIALE - Digitalizzazione della Cultura e della Creatività

INFORMATICA, AUTOMAZIONE e INTELLIGENZA ARTIFICIALE - Digitalizzazione della Società e della Pubblica Amministrazione

INFORMATICA, AUTOMAZIONE e INTELLIGENZA ARTIFICIALE - Salute e Informatica

LINGUE e LETTERATURA - Anglistica e angloamericanistica

LINGUE e LETTERATURA - Francesistica

PIANETA TERRA, AMBIENTE, CLIMA, ENERGIA e SOSTENIBILITA' - Diritto dell'Ambiente

PIANETA TERRA, AMBIENTE, CLIMA, ENERGIA e SOSTENIBILITA' - Informatica e Ambiente

SCIENZE MATEMATICHE, CHIMICHE, FISICHE - Fisica delle Particelle e dei Nuclei

SCIENZE MATEMATICHE, CHIMICHE, FISICHE - Laboratori innovativi, strumentazione e modellizzazione fisica
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 25.4.2.0