Skip to Main Content (Press Enter)

Logo UNITO
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione

UNI-FIND
Logo UNITO

|

UNI-FIND

unito.it
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione
  1. Pubblicazioni

Treebanking user-generated content: a UD based overview of guidelines, corpora and unified recommendations

Articolo
Data di Pubblicazione:
2022
Abstract:
This article presents a discussion on the main linguistic phenomena which cause difficulties in the analysis of user-generated texts found on the web and in social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework of syntactic analysis. Given on the one hand the increasing number of treebanks featuring user-generated content, and its somewhat inconsistent treatment in these resources on the other, the aim of this article is twofold: (1) to provide a condensed, though comprehensive, overview of such treebanks—based on available literature—along with their main features and a comparative analysis of their annotation criteria, and (2) to propose a set of tentative UD-based annotation guidelines, to promote consistent treatment of the particular phenomena found in these types of texts. The overarching goal of this article is to provide a common framework for researchers interested in developing similar resources in UD, thus promoting cross-linguistic consistency, which is a principle that has always been central to the spirit of UD.
Tipologia CRIS:
03A-Articolo su Rivista
Elenco autori:
Manuela Sanguinetti, Cristina Bosco, Lauren Cassidy, O ̈zlem C ̧etinog ̆lu, Alessandra Teresa Cignarella, Teresa Lynn, Ines Rehbein, Josef Ruppenhofer, Djame ́ Seddah, Amir Zeldes
Autori di Ateneo:
BOSCO Cristina
Link alla scheda completa:
https://iris.unito.it/handle/2318/1842461
Link al Full Text:
https://iris.unito.it/retrieve/handle/2318/1842461/942452/LREV2022.pdf
Pubblicato in:
LANGUAGE RESOURCES AND EVALUATION
Journal
  • Dati Generali

Dati Generali

URL

https://link.springer.com/content/pdf/10.1007/s10579-022-09581-9.pdf
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 25.4.2.0