Dataset for "Who Let The Trolls Out? Towards Understanding State-Sponsored Trolls"

Publication Date

2-6-2019

Abstract

This is the dataset used for the study "Who Let The Trolls Out? Towards Understanding State-Sponsored Trolls". Savvas Zannettou, Tristan Caulfield, William Setzer, Michael Sirivianos, Gianluca Stringhini, Jeremy Blackburn. Arxiv, 2019. DOI: 10.5281/zenodo.2558560

The dataset consists of the data released by Twitter on October 2018 for Russian and Iranian state-sponsored troll accounts, which is available at https://about.twitter.com/en_us/values/elections-integrity.html#data as well as intermediate data that we generated after processing the raw data.
For instance, we include trained Word2Vec and LDA models, the output of our influence estimation experiments via Hawkes Processes, and a lot of other data necessary to reproduce the results in the paper.
To use the provided data simply download the compressed file from and make sure that the uncompressed data folder is in the same directory as the IPython Notebook.

The code used for this study can be found here: https://github.com/zsavvas/trolls_analysis

Please cite our paper if any publication, of any form and kind results of you using this data: @article{zannettou2018let, title={Who let the trolls out? towards understanding state-sponsored trolls}, author={Zannettou, Savvas and Caulfield, Tristan and Setzer, William and Sirivianos, Michael and Stringhini, Gianluca and Blackburn, Jeremy}, journal={arXiv preprint arXiv:1811.03130}, year={2018} }

Repository

Zenodo

Distribution License

Creative Commons Attribution 4.0 International License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Funder

Funder: European Commission
Funder DOI: 10.13039/501100000780
Cyber security cOmpeteNCe fOr Research anD InnovAtion
830927

Funder: European Commission
Funder DOI: 10.13039/501100000780
EnhaNcing seCurity And privacy in the Social wEb: a user centered approach for the protection of minors
691025

Share

COinS