Towards More Robust Methods of Cyberbullying Detection Público
Ziems, Caleb (Spring 2020)
Abstract
Cyberbullying is a pervasive problem in online communities. To identify cyberbullying cases in large-scale social networks, content moderators depend on machine learning classifiers for automatic cyberbullying detection. However, existing models remain unfit for real-world applications, largely due to a shortage of publicly available training data and a lack of standard criteria for assigning ground truth labels. In this study, we address the need for reliable data using an original annotation framework. Inspired by social sciences research into bullying behavior, we characterize the nuanced problem of cyberbullying using five explicit factors to represent its social and linguistic aspects. We model this behavior using social network and language-based features, which improves classifier performance. Lastly, we develop a method for inferring the target of aggression in the message thread, and we evaluate this approach on hand-labeled data. These results demonstrate the importance of representing and modeling cyberbullying as a social phenomenon.
Table of Contents
Introduction.............................1
Background..............................4
Data.......................................10
Feature Engineering................16
Model Evaluation....................27
Inferring the Target User.........32
Conclusion.............................36
About this Honors Thesis
School | |
---|---|
Department | |
Degree | |
Submission | |
Language |
|
Research Field | |
Palavra-chave | |
Committee Chair / Thesis Advisor | |
Committee Members |
Primary PDF
Thumbnail | Title | Date Uploaded | Actions |
---|---|---|---|
Towards More Robust Methods of Cyberbullying Detection () | 2020-04-27 21:40:26 -0400 |
|
Supplemental Files
Thumbnail | Title | Date Uploaded | Actions |
---|