A team of researchers, including faculty at Binghamton University, has developed machine learning algorithms which can successfully identify bullies and aggressors on Twitter with 90 percent accuracy.
Effective tools for detecting harmful actions on social media are scarce, as this type of behavior is often ambiguous in nature and/or exhibited via seemingly superficial comments and criticisms. Aiming to address this gap, a research team featuring Binghamton University computer scientist Jeremy Blackburn analyzed the behavioral patterns exhibited by abusive Twitter users and their differences from other Twitter users.
"We built crawlers—programs that collect data from Twitter via variety of mechanisms," Blackburn says. "We gathered tweets of Twitter users, their profiles, as well as [social] network-related things, like who they follow and who follows them."
The researchers then performed natural language processing and sentiment analysis on the tweets themselves, as well as a variety of social network analyses on the connections between users. They describe their work in "Detecting Cyberbullying and Cyberaggression in Social Media," published in ACM Transactions on the Web.
The researchers developed algorithms to automatically classify two specific types of offensive online behavior, i.e., cyberbullying and cyberaggression. The algorithms were able to identify abusive users on Twitter with 90 percent accuracy. These are users who engage in harassing behavior, e.g. those who send death threats or make racist remarks to users.
"In a nutshell, the algorithms 'learn' how to tell the difference between bullies and typical users by weighing certain features as they are shown more examples," Blackburn says.
While this research can help mitigate cyberbullying, it is only a first step, he says.
"One of the biggest issues with cyber safety problems is the damage being done is to humans, and is very difficult to 'undo,'" Blackburn says. "For example, our research indicates that machine learning can be used to automatically detect users that are cyberbullies, and thus could help Twitter and other social media platforms remove problematic users. However, such a system is ultimately reactive: it does not inherently prevent bullying actions, it just identifies them taking place at scale. And the unfortunate truth is that even if bullying accounts are deleted, even if all their previous attacks are deleted, the victims still saw and were potentially affected by them."
Additional authors of the Transactions on the Web article are Despoina Chatzakou of the Centre for Research and Technology Hellas; Ilias Leontiadis of Samsung AI; Emiliano De Cristofaro of University College London; Gianluca Stringhini of Boston University; Athena Vakali of Aristotle University of Thessaloniki; and Nicolas Kourtellis of Telefonica Research.
Blackburn and the team are currently exploring pro-active mitigation techniques to deal with harassment campaigns.