GATTINA - GenerAtion of TiTles for Italian News Articles: A CALAMITA Challenge
Contributo in Atti di convegno
Data di Pubblicazione:
2024
Abstract:
We introduce a new benchmark designed to evaluate the ability of Large Language Models (LLMs) to generate Italian-language headlines for science news articles. The benchmark is based on a large dataset of science news articles obtained from Ansa Scienza and Galileo, two important Italian media outlets. Effective headline generation requires more than summarizing article content; headlines must also be informative, engaging, and suitable for the topic and target audience, making automatic evaluation particularly challenging. To address this, we propose two novel transformer-based metrics to assess headline quality. We aim for this benchmark to support the evaluation of Italian LLMs and to foster the development of tools to assist in editorial workflows.
Tipologia CRIS:
04A-Conference paper in volume
Keywords:
Benchmarking; CALAMITA Challenge; Headline generation; Italian; LLMs; Summarisation
Elenco autori:
Francis M.; Rinaldi M.; Gili J.; De Cosmo L.; Iannaccone S.; Nissim M.; Patti V.
Link alla scheda completa:
Titolo del libro:
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), Pisa, Italy, December 4-6, 2024
Pubblicato in: