Developing Analytical Tools for Arabic Sentiment Analysis of COVID-19 Data

نویسندگان

چکیده

Due to the widespread distribution of coronavirus and existence a massive quantity data on social networking sites, particularly Twitter, there was an urgent need develop model that evaluates users’ emotions determines how they feel about pandemic. However, absence resources assist Sentiment Analysis (SA) in Arabic hampered completion this endeavor. This work presents ArSentiCOVID lexicon, first largest SA lexicon for COVID-19 handles negation emojis. We design lexicon-based sentiment analyzer tool depends mainly perform three-way classification. Furthermore, we employ automatically assemble 42K annotated tweets COVID-19. conduct two experiments. First, test effect applying emoji rules created lexicon. The results indicate after emoji, negation, both rules, F-score improved by 2.13%, 4.13%, 6.13%, respectively. Second, applied ensemble method combines four feature groups (n-grams, polarity, emojis) as input features eight Machine Learning (ML) classifiers. reveal Random Forest (RF) Support Vector (SVM) classifiers best, combined are best representing produced maximum accuracy (92.21%), precision (92.23%), recall (92.23%) with 3.2% improvement over base model.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing a Method for Assessing and Managing the Risk of Covid-19; Rapid Covid-19 Hazard analysis

Background and aims: Work environments are constantly changing under the influence of various factors and newer risks are introduced. Rapid changes in science and technology, increasing the complexity of the industry, increased system integration and other factors have been shown to increase total risk in the past few decades. As well, risk management becomes increasingly critical in decreasing...

متن کامل

Document Embeddings for Arabic Sentiment Analysis

Research and industry are more and more focusing in finding automatically the polarity of an opinion regarding a specific subject or entity. Paragraph vector has been recently proposed to learn embeddings which are leveraged for English sentiment analysis. This paper focuses on Arabic sentiment analysis and investigates the use of paragraph vector within a machine learning techniques to determi...

متن کامل

Geographical Analysis of COVID-19 Epidemiology in Iran with Exploratory Spatial Data Analysis Approach (ESDA)

Background and Aim: The use of geophysical analysis of the epidemiology to identify geographical factors affecting the prevalence of the disease can be effective on community health policies to control the prevalence of the virus. Therefore, the present study is a geographical analysis of the COVID-19 epidemiology in Iran. Therefore, the purpose of this study is the geographical analysis of co...

متن کامل

Sentiment Analysis of Social Networking Data Using Categorized Dictionary

Sentiment analysis is the process of analyzing a person’s perception or belief about a particular subject matter. However, finding correct opinion or interest from multi-facet sentiment data is a tedious task. In this paper, a method to improve the sentiment accuracy by utilizing the concept of categorized dictionary for sentiment classification and analysis is proposed.  A categorized dictiona...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Algorithms

سال: 2023

ISSN: ['1999-4893']

DOI: https://doi.org/10.3390/a16070318