LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection (2401.04749v1)

Published 9 Jan 2024 in cs.LG, cs.AI, and cs.SE

Abstract: Log anomaly detection is a key component in the field of artificial intelligence for IT operations (AIOps). Considering log data of variant domains, retraining the whole network for unknown domains is inefficient in real industrial scenarios. However, previous deep models merely focused on extracting the semantics of log sequences in the same domain, leading to poor generalization on multi-domain logs. To alleviate this issue, we propose a unified Transformer-based framework for Log anomaly detection (LogFormer) to improve the generalization ability across different domains, where we establish a two-stage process including the pre-training and adapter-based tuning stage. Specifically, our model is first pre-trained on the source domain to obtain shared semantic knowledge of log data. Then, we transfer such knowledge to the target domain via shared parameters. Besides, the Log-Attention module is proposed to supplement the information ignored by the log-paring. The proposed method is evaluated on three public and one real-world datasets. Experimental results on multiple benchmarks demonstrate the effectiveness of our LogFormer with fewer trainable parameters and lower training costs.

References (31)

Citations (10)

View on Semantic Scholar

Summary

The paper introduces LogFormer, a two-stage method that pretrains on source log data and adapts to diverse domains using an adapter module.
The experimental results demonstrate superior performance with high precision, recall, and F1 scores on benchmark and real-world datasets.
The paper's approach effectively handles partially structured logs and preserves semantic context, reducing retraining overhead for anomaly detection.

Overview

Anomaly detection in log data is critical for maintaining the health and security of IT operations. With the ever-increasing volume and complexity of log data in different domains, finding an efficient and generalizable solution for identifying anomalies across various sources becomes essential. The model introduced in the discussed paper, known as LogFormer, provides a substantial step forward in this area, utilizing a Transformer-based architecture to detect log anomalies.

The Challenge of Log Anomaly Detection

Traditional log anomaly detection methods face limitations, especially when encountering logs from new or multiple domains. The approaches that preprocess logs usually lose valuable semantic information, and retraining existing models to accommodate new log data can be resource-intensive. Identifying anomalies can further be complicated by logs that are only partially structured and contain elements similar to natural language.

LogFormer Architecture

The proposed structure of LogFormer tackles these challenges through a two-stage process: pre-training and adapter-based tuning. Initially, the model is pre-trained on logs from a source domain to capture the underlying semantic patterns common in log data. Post pre-training, an Adapter component maps this knowledge to target domains with varying log characteristics. This approach enables the LogFormer to generalize across different log sources effectively. An added component, the Log-Attention module, is specifically designed to address the loss of parameter information, which is typically a byproduct of log parsing processes.

Experimental Results

LogFormer was rigorously tested against several benchmark datasets and a real-world dataset from a cloud service company. The results are promising, showcasing the model's superior performance in terms of accuracy and efficiency, as evidenced by its high precision, recall, and F1 scores. LogFormer not only demonstrated improvements over existing state-of-the-art models but also achieved this with fewer trainable parameters and reduced training costs, underlining its practical viability for industrial applications.

Conclusion

In summary, LogFormer serves as an innovative and effective solution for log anomaly detection across different domains without the need to extensively retrain the network. Its two-stage process and unique Log-Attention mechanism equip it to handle the intricacies of log data semantics, making it a robust tool for AI operations in IT environments.

PDF Markdown