Skip to Main Content (Press Enter)

Logo UNITO
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione

UNI-FIND
Logo UNITO

|

UNI-FIND

unito.it
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione
  1. Pubblicazioni

Detection of Privacy-Harming Social Media Posts in Italian

Contributo in Atti di convegno
Data di Pubblicazione:
2023
Abstract:
As many psychological and sociological study reveal, many people disclose too much privacy-harming information in social media in the form of text and multimedia posts, thus exposing themselves and other persons to several security risks. Consequently, many researchers have addressed this problem by investigating on the detection and analysis of the so-called self-disclosure behavior in social media and blogging platforms. Among the others, content sensitivity analysis has emerged as a promising research direction, but, so far, it has only focused on English text posts, although it is well-known that people tend to disclose mostly in their own native languages. Therefore, in this paper, we address this limitation by proposing a new text corpus of Italian posts that we have annotated following to the anonymity assumption. We then apply several language models based on transformers to classify them according to their sensitivity. Moreover, since Italian is a lower-resource language compared to English, we also apply some multilingual zero-shot transfer learning architectures trained on a rich and manually annotated English corpus and tested on the Italian one. We show experimentally that the approaches trained directly on the Italian corpus, still outperform multilingual ones trained on the English data and tested on Italian, although some of them exhibit promising prediction performances.
Tipologia CRIS:
04A-Conference paper in volume
Keywords:
Privacy, Neural language models, Social media
Elenco autori:
Peiretti, Federico; Pensa, Ruggero G.
Autori di Ateneo:
PENSA Ruggero Gaetano
Link alla scheda completa:
https://iris.unito.it/handle/2318/1925050
Link al Full Text:
https://iris.unito.it/retrieve/handle/2318/1925050/1176808/main.pdf
Titolo del libro:
SocialSec 2023: Security and Privacy in Social Networks and Big Data
Pubblicato in:
LECTURE NOTES IN COMPUTER SCIENCE
Journal
LECTURE NOTES IN COMPUTER SCIENCE
Series
  • Dati Generali
  • Aree Di Ricerca

Dati Generali

URL

https://link.springer.com/chapter/10.1007/978-981-99-5177-2_12

Aree Di Ricerca

Settori (13)


PE6_11 - Machine learning, statistical data processing and applications using signal processing (e.g. speech, image, video) - (2022)

PE6_5 - Security, privacy, cryptology, quantum cryptography - (2022)

CIBO, AGRICOLTURA e ALLEVAMENTI - Farmacologia Veterinaria

CULTURA, ARTE e CREATIVITA' - Culture moderne

INFORMATICA, AUTOMAZIONE e INTELLIGENZA ARTIFICIALE - Digitalizzazione della Cultura e della Creatività

INFORMATICA, AUTOMAZIONE e INTELLIGENZA ARTIFICIALE - Digitalizzazione della Società e della Pubblica Amministrazione

INFORMATICA, AUTOMAZIONE e INTELLIGENZA ARTIFICIALE - Industria X.0

INFORMATICA, AUTOMAZIONE e INTELLIGENZA ARTIFICIALE - Salute e Informatica

LINGUE e LETTERATURA - Linguistica

PIANETA TERRA, AMBIENTE, CLIMA, ENERGIA e SOSTENIBILITA' - Diritto dell'Ambiente

PIANETA TERRA, AMBIENTE, CLIMA, ENERGIA e SOSTENIBILITA' - Informatica e Ambiente

SCIENZE DELLA VITA e FARMACOLOGIA - Tecnologie Farmaceutiche e Cosmetiche

SCIENZE MATEMATICHE, CHIMICHE, FISICHE - Teorie e modelli Matematici
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 25.6.1.0