Abusive Content Detection in Arabic Tweets Using Multi-Task Learning and Transformer-Based Models
نویسندگان
چکیده
Different social media platforms have become increasingly popular in the Arab world recent years. The increasing use of media, however, has also led to emergence a new challenge form abusive content, including hate speech, offensive language, and language. Existing research work focuses on automatic content detection as binary classification problem. In addition, existing task surrounding Arabic fails tackle dialect-specific phenomenon. Consequently, this two important issues task. study, we used multi-aspect annotation schema problem countries, based multi-class dialectal (DA)-specific More precisely, includes five attributes: directness, hostility, target, group, annotator. We specifically developed framework automatically detecting Twitter using natural language processing (NLP) techniques. different models machine learning (ML), deep (DL), pretrained (LMs) dataset. investigate impact other approaches, such multi-task (MTL), four MTL built top DA model (called MARBERT) trained Our LMs enhanced performance compared DL mentioned literature.
منابع مشابه
Abusive Language Detection on Arabic Social Media
In this paper, we present our work on detecting abusive language on Arabic social media. We extract a list of obscene words and hashtags using common patterns used in offensive and rude communications. We also classify Twitter users according to whether they use any of these words or not in their tweets. We expand the list of obscene words using this classification, and we report results on a n...
متن کاملAbusive Language Detection in Online User Content
Detection of abusive language in user generated online content has become an issue of increasing importance in recent years. Most current commercial methods make use of blacklists and regular expressions, however these measures fall short when contending with more subtle, less ham-fisted examples of hate speech. In this work, we develop a machine learning based method to detect hate speech on o...
متن کاملTransferring Multi-device Localization Models using Latent Multi-task Learning
In this paper, we propose a latent multi-task learning algorithm to solve the multi-device indoor localization problem. Traditional indoor localization systems often assume that the collected signal data distributions are fixed, and thus the localization model learned on one device can be used on other devices without adaptation. However, by empirically studying the signal variation over differ...
متن کاملMulti-objective Based Optimization Using Tap Setting Transformer, DG and Capacitor Placement in Distribution Networks
In this article, a multi-objective function for placement of Distributed Generation (DG) and capacitors with thetap setting of Under Load Tap Changer (ULTC) Transformer is introduced. Most of the recent articles have paidless attention to DG, capacitor placement and ULTC effects in the distribution network simultaneously. Insimulations, a comparison between different modes was carried out with,...
متن کاملUsing Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media
Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13105825