Automatic bug triage using text categorization
نویسندگان
چکیده
Bug triage, deciding what to do with an incoming bug report, is taking up increasing amount of developer resources in large open-source projects. In this paper, we propose to apply machine learning techniques to assist in bug triage by using text categorization to predict the developer that should work on the bug based on the bug’s description. We demonstrate our approach on a collection of 15,859 bug reports from a large open-source project. Our evaluation shows that our prototype, using supervised Bayesian learning, can correctly predict 30% of the report assignments to
منابع مشابه
Automatic Bug Triage using Semi-Supervised Text Classification
In this paper, we propose a semi-supervised text classification approach for bug triage to avoid the deficiency of labeled bug reports in existing supervised approaches. This new approach combines naive Bayes classifier and expectationmaximization to take advantage of both labeled and unlabeled bug reports. This approach trains a classifier with a fraction of labeled bug reports. Then the appro...
متن کاملAnalysis of Bug Triage using Data Preprocessing (Reduction) Techniques
In the bug triage we have an unavoidable step of fixing the bugs which helps in correctly assigning a developer to a new bug. Text classification and binary classification techniques are applied to decrease the time cost in manual work and to enhance the working of automatic bug triage. We address the problem of data reduction and hence we combine the instance selection and the feature selectio...
متن کاملAnalysis of Bug Triage using Data Preprocessing (Reduction) Techniques
In the bug triage we have an unavoidable step of fixing the bugs which helps in correctly assigning a developer to a new bug. Text classification and binary classification techniques are applied to decrease the time cost in manual work and to enhance the working of automatic bug triage. We address the problem of data reduction and hence we combine the instance selection and the feature selectio...
متن کاملBug Triage Based on Ant System with Evaporation Factor Tuning
Bug Triaging is an important aspect of Bug Management in Open Source Systems. Bug Triaging pertains to assignment of a newly opened bug to an appropriate developer for resolution. This task is complicated in an open source environment because of the nature of the Open Source Development. Open Source Software Development comprises of the developers who are volunteers. The volunteers are distribu...
متن کاملIntegrating image data into biomedical text categorization
Categorization of biomedical articles is a central task for supporting various curation efforts. It can also form the basis for effective biomedical text mining. Automatic text classification in the biomedical domain is thus an active research area. Contests organized by the KDD Cup (2002) and the TREC Genomics track (since 2003) defined several annotation tasks that involved document classific...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004