Paper published in a book (Scientific congresses and symposiums)
RABBIT: A tool for identifying bot accounts based on their recent GitHub event history
Chidambaram, Natarajan; Mens, Tom; Decan, Alexandre
2024In 21st International Conference on Mining Software Repositories
Peer reviewed
 

Files


Full Text
article.pdf
Author postprint (455.63 kB)
Request a copy

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Keywords :
GitHub events; classification model; bot identification
Abstract :
[en] Collaborative software development through GitHub repositories frequently relies on bot accounts to automate repetitive and error-prone tasks. This highlights the need to have accurate and efficient bot identification tools. Several such tools have been proposed in the past, but they tend to rely on a substantial amount of historical data, or they limit themselves to a reduced subset of activity types, making them difficult to use at large scale. To overcome these limitations, we developed RABBIT, an open source command-line tool that queries the GitHub Events API to retrieve the recent events of a given GitHub account and predicts whether the account is a human or a bot. RABBIT is based on an XGBoost classification model that relies on six features related to account activities and achieves high performance, with an AUC, F1 score, precision and recall of 0.92. Compared to the state-of-the-art in bot identification, RABBIT exhibits a similar performance in terms of precision, recall and F1 score, while being more than an order of magnitude faster and requiring considerably less data. This makes RABBIT usable on a large scale, capable of processing several thousand accounts per hour efficiently.
Disciplines :
Computer science
DOI :
10.1145/3643991.3644877
Author, co-author :
Chidambaram, Natarajan  ;  Université de Mons - UMONS > Faculté des Science > Service de Génie Logiciel
Mens, Tom  ;  Université de Mons - UMONS > Faculté des Sciences > Service de Génie Logiciel
Decan, Alexandre  ;  Université de Mons - UMONS > Faculté des Sciences > Service de Génie Logiciel
Language :
English
Title :
RABBIT: A tool for identifying bot accounts based on their recent GitHub event history
Publication date :
16 April 2024
Event name :
21st International Conference on Mining Software Repositories
Event organizer :
ACM
Event place :
Lisbon, Portugal
Event date :
15 & 16 April 2024
Audience :
International
Main work title :
21st International Conference on Mining Software Repositories
Publisher :
ACM, Lisbon, Portugal
ISBN/EAN :
979-8-4007-0587-8
Pages :
5
Peer reviewed :
Peer reviewed
Research unit :
S852 - Génie Logiciel
Research institute :
R300 - Institut de Recherche en Technologies de l'Information et Sciences de l'Informatique
Funders :
Service Public de Wallonie Recherche
F.R.S.-FNRS - Fonds de la Recherche Scientifique [BE]
Funding number :
2010235; J.0147.24; T.0149.22; F.4515.23
Funding text :
This work is supported by Service Public de Wallonie Recherche under grant number 2010235 - ARIAC by DigitalWallonia4.AI, and by the Fonds de la Recherche Scientifique – FNRS under grant numbers J.0147.24, T.0149.22, and F.4515.23.
Available on ORBi UMONS :
since 06 February 2024

Statistics


Number of views
64 (16 by UMONS)
Number of downloads
3 (2 by UMONS)

Bibliography


Similar publications



Contact ORBi UMONS