information extraction

FLARES

The dataset consists of a set of news articles labeled with the text segments that answer the 5 Ws (who?, what?, when?, where?, why?) and with their reliability or credibility (reliable, partially reliable, unreliable).

DIANN-2018-ES

The corpus is a collection of 500 abstracts from Elsevier journal papers related to the biomedical domain collected between 2017 and 2018. It is divided into two disjoined parts: training set (80%) and test set (20%). It is annotated with disabilities and negations and their scope.