CLUEBOX: A Performance Log Analyzer for Automated Troubleshooting
نویسندگان
چکیده
Performance problems in complex systems are often caused by underprovisioning, workload interference, incorrect expectations or bugs. Troubleshooting such systems is a difficult task faced by service engineers. We have built CLUEBOX, a non-intrusive toolkit that aids rapid problem diagnosis. It employs machine learning techniques on the available performance logs to characterize workloads, predict performance and discover anomalous behavior. By identifying the most relevant anomalies to focus on, CLUEBOX automates the most onerous aspects of performance troubleshooting. We have experimentally validated our methodology in a networked storage environment with real workloads. Using CLUEBOX to learn from a set of historical performance observations, we were able to distill over 2000 performance counters into 68 counters that succinctly describe a running workload. Further, we demonstrate effective troubleshooting of two scenarios that adversely impacted application response time: (1) an unknown competing workload, and (2) a file system consistency checker. By reducing the set of anomalous counters to examine to a dozen significant ones, CLUEBOX was able to guide a systems engineer towards identifying the correct root-cause rapidly.
منابع مشابه
Automated Troubleshooting of Mobile Networks Using Bayesian Networks
In the current telecommunication scenarios operators have to cope with fast technological changes while increasing operational efficiency, i.e. diminishing operational expenditures and, at the same time, maximising performance of the networks. In this paper we present an automated troubleshooting tool for cellular networks, based on Bayesian networks, which will contribute to improve operationa...
متن کاملSecure Bio-Cryptographic Authentication System for Cardless Automated Teller Machines
Security is a vital issue in the usage of Automated Teller Machine (ATM) for cash, cashless and many off the counter banking transactions. Weaknesses in the use of ATM machine could not only lead to loss of customer’s data confidentiality and integrity but also breach in the verification of user’s authentication. Several challenges are associated with the use of ATM smart card such as: card clo...
متن کاملClinical evaluation of Eastman Kodak's Ektachem 400 Analyzer.
We evaluated the performance of the Kodak Ektachem 400 Analyzer in a 16-week clinical trial. We assessed four potentiometric tests and nine colorimetric tests for precision and correlation with results obtained some other commonly used instruments (Technicon SMA II and C800 System, Du Pont aca II, and Baker CentrifiChem). The comparison was favorable for all tests except albumin, sodium, and ca...
متن کاملAutomated RRM Optimization of LTE networks using Statistical Learning
The mobile telecommunication industry has experienced a very rapid growth in the recent past. This has resulted in significant technological and architectural evolution in the wireless networks. The expansion and the heterogenity of these networks have made their operational cost more and more important. Typical faults in these networks may be related to equipment breakdown and inappropriate pl...
متن کاملIS-IS Network Design Solutions
• Extensive coverage of both underlying concepts and practical applications of the IS-IS protocol • Detailed explanation of how the IS-IS database works and relevant insights into the operation of the shortest path first (SPF) algorithm • Comprehensive tutorial on configuring and troubleshooting IS-IS on Cisco routers • Advanced information on IP network design and performance optimization stra...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008