VeryfIT - Benchmark of Fact-Checked Claims for Italian: A CALAMITA Challenge
Contributo in Atti di convegno
Data di Pubblicazione:
2024
Abstract:
Achieving factual accuracy is a known pending issue for language models. Their design centered around the interactive component of user interaction and the extensive use of “spontaneous” training data, has made them highly adept at conversational tasks but not fully reliable in terms of factual correctness. VeryfIT addresses this issue by evaluating the in-memory factual knowledge of language models on data written by professional fact-checkers, posing it as a true or false question. Topics of the statements vary but most are in specific domains related to the Italian government, policies, and social issues. The task presents several challenges: extracting statements from segments of speeches, determining appropriate contextual relevance both temporally and factually, and ultimately verifying the accuracy of the statements.
Tipologia CRIS:
04A-Conference paper in volume
Keywords:
benchmark; CALAMITA; CheckIT!; fact checking; factual knowledge; fake news; Italian
Elenco autori:
Gili J.; Patti V.; Passaro L.; Caselli T.
Link alla scheda completa:
Titolo del libro:
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), Pisa, Italy, December 4-6, 2024
Pubblicato in: