Institutional Repository [SANDBOX]
Technical University of Crete
EN  |  EL

Search

Browse

My Space

Explainable machine learning pipeline for Twitter bot detection during the 2020 US Presidential Elections

Shevtsov Alexander, Tzagkarakis Christos, Antonakaki Despoina, Ioannidis Sotirios

Full record


URI: http://purl.tuc.gr/dl/dias/ACFF4C2F-785F-497F-A9DC-43FBC128AA4A
Year 2022
Type of Item Peer-Reviewed Journal Publication
License
Details
Bibliographic Citation A. Shevtsov, C. Tzagkarakis, D. Antonakaki, and S. Ioannidis, “Explainable machine learning pipeline for Twitter bot detection during the 2020 US Presidential Elections,” Software Impacts, vol. 13, Aug. 2022, doi: 10.1016/j.simpa.2022.100333. https://doi.org/10.1016/j.simpa.2022.100333
Appears in Collections

Summary

This study introduces a novel, reproducible and reusable Twitter bot identification system. The system uses a machine learning (ML) pipeline, fed with hundreds of features extracted from a Twitter corpus. The main objective of the proposed ML pipeline is to train and validate different state-of-the-art machine learning models, where the eXtreme Gradient Boosting (XGBoost) model is selected since it achieves the highest detection performance. The Twitter dataset was collected during the 2020 US Presidential Elections, and additional experimental evaluation on distinct Twitter datasets demonstrates the superiority of our approach, in terms of high bot detection accuracy.

Available Files

Services

Statistics