"Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior" (invalid dataset)

Antigoni-Maria Founta, Aristotle University of Thessaloniki
Constantinos Djouvas, Cyprus University of Technology
Despoina Chatzakou, Aristotle University of Thessaloniki
Ilias Leontiadis, Telefonica Research
Jeremy Blackburn, University of Alabama at Birmingham
Gianluca Stringhini, University College London
Athena Vakali, Aristotle University of Thessaloniki
Michael Sirivianos, Cyprus University of Technology
Nicolas Kourtellis, Telefonica Research

Publication Date

5-2-2019

Abstract

This dataset is invalid. The updated version of this Dataset is here: https://zenodo.org/record/3678559#.Xl9-Ji97FhE

Dataset for the "Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior" paper, published in ICWSM 2018. The full text of the paper can be found here.

The dataset provided here includes an updated version of the original dataset, with ~100k tweets annotated using the CrowdFlower platform:

hatespeech_labels.csv: contains ~100K rows, where every row consists of a unique Tweet ID and its according to majority annotation

UPDATE: It has come to our understanding that a number of the tweets are not available anymore for download on Twitter. Therefore, under request, we can provide one more file with the full ~100K tweet text, their associated majority label, and the number of votes for the majority label. The tweets are shuffled so that there is no connection between tweet IDs and texts (in order to be in line with the T&C of Twitter). To obtain the file contact the authors through email.

Please cite the paper in any published work that uses any of these resources.

@inproceedings{founta2018large,
    title={Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior},
    author={Founta, Antigoni-Maria and Djouvas, Constantinos and Chatzakou, Despoina and Leontiadis, Ilias and Blackburn, Jeremy and Stringhini, Gianluca and Vakali, Athena and Sirivianos, Michael and Kourtellis, Nicolas},
    booktitle={11th International Conference on Web and Social Media, ICWSM 2018},
    year={2018},
    organization={AAAI Press}
}

For any further questions contact a.m.founta at gmail dot com AND markos.charalambous at eecei.cut.ac.cy

Repository

Zenodo

Access Instructions

Access to this data is restricted.

Funder

Funder: European Commission
Funder DOI: 10.13039/501100000780
EnhaNcing seCurity And privacy in the Social wEb: a user centered approach for the protection of minors
691025

Link to Dataset

COinS

"Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior" (invalid dataset)

Publication Date

Abstract

Repository

Access Instructions

Funder

Search

Browse

Author Corner

Research Data Catalog

"Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior" (invalid dataset)

Authors

Publication Date

Abstract

Repository

Access Instructions

Funder

Share

Search

Browse

Author Corner