Logs captures a real-time snapshot of the system and are widely used to detect anomalies in complex environments. Since the systems are highly complex and generate large volume of logs, manual log inspection becomes infeasible. To reduce manual effort, we used optimized automated log parsing algorithm and utilized anomaly detection to find erroneous logs. Since the data are highly imbalanced with ratio (30:1), therefore any traditional machine learning approach would not be a good choice for classification.
Here we have used two different methodologies, one is supervised predictive analytics framework, and an unsupervised auto encoder, to find the anomaly. We have seen that our auto encoder model has captured lot of anomalies and the accuracy is also moderate (82%). Also, we have seen that