CaPBug-A Framework for Automatic Bug Categorization and Prioritization Using NLP and Machine Learning Algorithms

نویسندگان

چکیده

Bug reports facilitate software development teams in improving the quality of software. These include significant information related to problems encountered within a software, possible enhancement suggestions, and other potential issues. are typically complex too detailed; hence lot resources required analyze process them manually. Moreover, it leads delays resolution high priority bugs. Accurate timely processing bug based on their category plays role maintenance. Therefore, an automated categorization prioritization is needed address aforementioned Automated have been explored recently by many researchers; however, limited progress has made this regard. In research, we present novel framework, titled CaPBug, for reports. The framework implemented using Natural Language Processing (NLP) supervised machine learning algorithms. A baseline corpus built with six categories five levels analyzing more than 2000 Mozilla Eclipse repository. Four classification algorithms i.e., Naive Bayes, Random Forest, Decision Tree, Logistic Regression used categorize prioritize We demonstrate that CaPBug achieved accuracy 88.78% Forest classifier textual feature predicting category. Similarly, 90.43% was Synthetic Minority Over-Sampling Technique (SMOTE) applied class imbalance issue classes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media

Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...

متن کامل

Automatic bug triage using text categorization

Bug triage, deciding what to do with an incoming bug report, is taking up increasing amount of developer resources in large open-source projects. In this paper, we propose to apply machine learning techniques to assist in bug triage by using text categorization to predict the developer that should work on the bug based on the bug’s description. We demonstrate our approach on a collection of 15,...

متن کامل

A Comparative Study on Statistical Machine Learning Algorithms and Thresholding Strategies for Automatic Text Categorization

Two main research areas in statistical text categorization are similarity-based learning algorithms and associated thresholding strategies. The combination of these techniques significantly influences the overall performance of text categorization. After investigating two similarity-based classifiers (k-NN and Rocchio) and three common thresholding techniques (RCut, PCut, and SCut), we describe...

متن کامل

a framework for identifying and prioritizing factors affecting customers’ online shopping behavior in iran

the purpose of this study is identifying effective factors which make customers shop online in iran and investigating the importance of discovered factors in online customers’ decision. in the identifying phase, to discover the factors affecting online shopping behavior of customers in iran, the derived reference model summarizing antecedents of online shopping proposed by change et al. was us...

15 صفحه اول

Automatic Differentiation of Algorithms for Machine Learning

Automatic differentiation—the mechanical transformation of numeric computer programs to calculate derivatives efficiently and accurately—dates to the origin of the computer age. Reverse mode automatic differentiation both antedates and generalizes the method of backwards propagation of errors used in machine learning. Despite this, practitioners in a variety of fields, including machine learnin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3069248