detección de noticias falsas

MultiClaim-2025-es

This dataset consists of fact-checks, social media posts and pairings between them. The dataset consists of 205,751 fact-checks in 39 languages and 28,092 social media posts in 27 languages. All the posts were previously reviewed by professional fact-checkers who also assigned appropriate fact-checks to them. There are 31,305 fact-check-to-post pairs, each post is paired with at least one fact-check. 26,774 of these pairs are monolingual and 4,212 are crosslingual. The dataset introduces crosslingual previously fact-checked claim retrieval (PFCR) as a new task.

CT–CWT–23-ES

El conjunto de datos se centra en tres temas: COVID-19, cambio climático y tecnología. El conjunto de datos en español es una combinación de CT-CWT-21, CT-CWT-22 y contenido recién recopilado. Se compone de tweets recogidos de cuentas de Twitter y transcripciones de políticos españoles, que son anotados manualmente por periodistas profesionales expertos en fact-checking. Cada tweet ha sido etiquetado usando tanto la imagen como el texto.