text classification
DIPROMATS
- Read more about DIPROMATS
- Log in or register to post comments
The DIPROMATS 2024 dataset is designed for the analysis of propaganda on social media, addressing both the techniques used and the underlying narratives. It contains tweets manually annotated in various languages and structured into three levels of analysis: binary detection of propaganda, classification into three groups of propaganda techniques, and detailed categorization into seven specific techniques. Additionally, it includes a multi-class, multi-label classification task to identify propaganda narratives associated with international actors.
RepLab-2014-Reputation
- Read more about RepLab-2014-Reputation
- Log in or register to post comments
RepLab-2014-Profiling
- Read more about RepLab-2014-Profiling
- Log in or register to post comments
MLDoc-ES
- Read more about MLDoc-ES
- Log in or register to post comments
Tweets emojis-ES
- Read more about Tweets emojis-ES
- Log in or register to post comments
ADoBo
- Read more about ADoBo
- Log in or register to post comments
MEDDOPROF
- Read more about MEDDOPROF
- Log in or register to post comments
DIPROMATS-ES 2023
- Read more about DIPROMATS-ES 2023
- Log in or register to post comments
DIPROMATS ES 2023 is composed of composed of tweets in Spanish written by diplomats. It includes 9591 tweets, 2997 of them published by 25 Chinese authorities, 1391 by 22 Russian authorities, 2465 tweets were published by 48 European authorities, and 40 authorities from the US provide 2738 tweets. The annotations provide information about whether a tweet contained propaganda techniques, 4 groups of propaganda techniques, 15 types of propaganda techniques.
ClinAIS 2023
- Read more about ClinAIS 2023
- Log in or register to post comments
The ClinAIS corpus is a randomly-selected subset of the background CodiEsp corpus, consisting of 1038 distinct clinical notes annotated with seven types of medical sections of the notes.