Detecting large-scale system problems by mining console logs
University of California, Berkeley · Intel (United States)
Abstract
Surprisingly, console logs rarely help operators detect problems in large-scale datacenter services, for they often consist of the voluminous intermixing of messages from many software components written by independent developers. We propose a general methodology to mine this rich source of information to automatically detect system runtime problems. We first parse console logs by combining source code analysis with information retrieval to create composite features. We then analyze these features using machine learning to detect operational problems. We show that our method enables analyses that are impossible with previous methods because of its superior ability to create sophisticated features. We also show…
Citation impact
- FWCI
- 43.17
- Percentile
- 100%
- References
- 45
Authors
5Topics & keywords
- Computer science
- Parsing
- False positive paradox
- Software
- Source code
- Service (business)
- Server
- Scale (ratio)
- Peace, Justice and strong institutions