Their Reporting Latency Bug Reports With Unknown Reporting Latency Feature Extraction Feature Extraction Model Construction Model Application Model Training Phase Deployment Phase Predicted Reporting Latency Label
نویسندگان
چکیده
Not all bugs in software would be experienced and reported by end users right away: Some bugs manifest themselves quickly and may be reported by users a few days after they get into the code base; others manifest many months or even years later, and may only be experienced and reported by a small number of users. We refer to the period of time between the time when a bug is introduced into code and the time when it is reported by a user as bug reporting latency. Knowledge of bug reporting latencies has an implication on prioritization of bug fixing activities—bugs with low reporting latencies may be fixed earlier than those with high latencies to shift debugging resources towards bugs highly concerning users. To investigate bug reporting latencies, we analyze bugs from three Java software systems: AspectJ, Rhino, and Lucene. We extract bug reporting data from their version control repositories and bug tracking systems, identify bug locations based on bug fixes, and back-trace bug introducing time based on change histories of the buggy code. Also, we remove nonessential changes, and most importantly, recover root causes of bugs from their treatments/fixes. We then calculate the bug reporting latencies, and find that bugs have diverse reporting latencies. Based on the calculated reporting latencies and features we extract from bugs, we build classification models that can predict whether a bug would be reported early (within 30 days) or later, which may be helpful for prioritizing bug fixing activities. Our evaluation on the three software systems shows that our bug reporting latency prediction models could achieve an AUC (Area Under the Receiving Operating Characteristics Curve) of 70.869%.
منابع مشابه
OPTIMIZATION-BASED MONITORING-SUPPORTED CALIBRATION OF A THERMAL PERFORMANCE SIMULATION MODEL
Building performance simulation is being increasingly deployed beyond the building design phase to support efficient building operation. Specifically, the predictive feature of the simulation-assisted building systems control strategy provides distinct advantages in view of building systems with high latency and inertia. Such advantages can be exploited only if model predictions can be relied u...
متن کاملCompilation a Model for Reporting Effectiveness for Stakeholders (Case Study in the Superum Audit Court)
Developing a Reporting Effectiveness Model for Stakeholders (Case Study in the Superum Audit Court) Abstract: The Court of Auditors' reports are an independent and rational assessment of the environment in the public sector and strive to enhance the value of government agencies through legislation and guidance to improve performance. The purpose of the present study is to conceptualize and ext...
متن کاملFast and Accurate Performance Analysis of LTE Radio Access Networks
An increasing amount of analytics is performed on data that is procured in a real-time fashion to make real-time decisions. Such tasks include simple reporting on streams to sophisticated model building. However, the practicality of such analyses are impeded in several domains because they are faced with a fundamental trade-off between data collection latency and analysis accuracy. In this pape...
متن کاملReno Ringermute : An audio data mining toolkit
This thesis presents Ringermute, an application designed to support audio feature recognition and machine learning, from the training and testing to the deployment phase. By choosing from a combination of feature extraction routines provided by plug-ins, a researcher can quickly produce files for input to standard data mining tools. The best combination of feature-extraction and classifier plug...
متن کاملRingermute: An Audio Data Mining Toolkit
This paper presents Ringermute, an application designed to support audio feature recognition and machine learning, from the training and testing to the deployment phase. By choosing from a combination of feature extraction routines provided by plug-ins, a researcher can quickly produce files for input to standard data mining tools. The best combination of feature-extraction and classifier plugi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012