Spam Comments Detection on Instagram Using Machine Learning and Deep Learning Methods
نویسندگان
چکیده
The more popular a public figure on Instagram (IG), the number of followers also increase. When posts something, there are many comments from other users. In fact, all comments, not them relevant to post, such as advertising, links, or clickbait comments. type that irrelevant post is usually called spam Spam will interfere with information flow and may lead misleading information. This research compares machine learning (ML) deep (DL) classification methods based our collected Indonesian IG comment dataset. was conducted in following steps: dataset preparation, pre-processing, simple normalization, features generation using TF-IDF word embedding, application ML DL methods, performance evaluation, comparison. authors compare accuracy, F-1, precision, recall results. shows do significantly differ. Linear SVM, Extreme Tree (ET), Regression, Stochastics Gradient Descent algorithms can reach accuracy 0.93. At same time, method has highest 0.94 SimpleTransformer BERT architecture. difference between different.
منابع مشابه
SMS Spam Detection using Machine Learning Approach
Over recent years, as the popularity of mobile phone devices has increased, Short Message Service (SMS) has grown into a multi-billion dollars industry. At the same time, reduction in the cost of messaging services has resulted in growth in unsolicited commercial advertisements (spams) being sent to mobile phones. In parts of Asia, up to 30% of text messages were spam in 2012. Lack of real data...
متن کاملEmail Spam Detection A Machine Learning Approach
Machine learning is a branch of artificial intelligence concerned with the creation and study of systems that can learn from data. A machine learning system could be trained to distinguish between spam and non-spam (ham) emails. We aim to analyze current methods in machine learning to identify the best techniques to use in content-based spam filtering.
متن کاملWeb Spam Detection Using Machine Learning in Specific Domain Features
In the last few years, as Internet usage becomes the main artery of the life's daily activities, the problem of spam becomes very serious for internet community. Spam pages form a real threat for all types of users. This threat proved to evolve continuously without any clue to abate. Different forms of spam witnessed a dramatic increase in both size and negative impact. A large amount of E-mail...
متن کاملDetection and Filtering Spam using Feature Selection and Learning Machine Methods
In recent years, email has turned out to be among the pervasive and cost-effective tools of communication. In the meantime, spam emails have reduced their popularity and become offensive to all individuals and users applying this capability. Email filtering is the first solution to cope with this challenge. This is developed as a special type of text classification. A variety of methods includi...
متن کاملon the comparison of keyword and semantic-context methods of learning new vocabulary meaning
the rationale behind the present study is that particular learning strategies produce more effective results when applied together. the present study tried to investigate the efficiency of the semantic-context strategy alone with a technique called, keyword method. to clarify the point, the current study seeked to find answer to the following question: are the keyword and semantic-context metho...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lontar Komputer
سال: 2022
ISSN: ['2088-1541', '2541-5832']
DOI: https://doi.org/10.24843/lkjiti.2022.v13.i01.p05