information extraction

FLARES

Read more about FLARES
Log in or register to post comments

The dataset consists of a set of news articles labeled with the text segments that answer the 5 Ws (who?, what?, when?, where?, why?) and with their reliability or credibility (reliable, partially reliable, unreliable).

FLARES 2024: Fine-Grained Language-based Reliability Detection in Spanish News - 5W1H identification

NLP topic

information extraction

Dataset

FLARES

Language

Spanish

Year

2024

DIANN-2018-ES

Read more about DIANN-2018-ES
Log in or register to post comments

The corpus is a collection of 500 abstracts from Elsevier journal papers related to the biomedical domain collected between 2017 and 2018. It is divided into two disjoined parts: training set (80%) and test set (20%). It is annotated with disabilities and negations and their scope.