Efficient Visual Fault Detection for Freight Train Braking System via Heterogeneous Self Distillation in the Wild
Abstract: Efficient visual fault detection of freight trains is a critical part of ensuring the safe operation of railways under the restricted hardware environment. Although deep learning-based approaches have excelled in object detection, the efficiency of freight train fault detection is still insufficient to apply in real-world engineering. This paper proposes a heterogeneous self-distillation framework to ensure detection accuracy and speed while satisfying low resource requirements. The privileged information in the output feature knowledge can be transferred from the teacher to the student model through distillation to boost performance. We first adopt a lightweight backbone to extract features and generate a new heterogeneous knowledge neck. Such neck models positional information and long-range dependencies among channels through parallel encoding to optimize feature extraction capabilities. Then, we utilize the general distribution to obtain more credible and accurate bounding box estimates. Finally, we employ a novel loss function that makes the network easily concentrate on values near the label to improve learning efficiency. Experiments on four fault datasets reveal that our framework can achieve over 37 frames per second and maintain the highest accuracy in comparison with traditional distillation approaches. Moreover, compared to state-of-the-art methods, our framework demonstrates more competitive performance with lower memory usage and the smallest model size.
- Cascade R-CNN: High Quality Object Detection and Instance Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 1483–1498.
- Novel Multistate Fault Diagnosis and Location Method for Key Components of High-Speed Trains. IEEE Transactions on Industrial Electronics 68, 3537–3547.
- Swipenet: Object detection in noisy underwater scenes. Pattern Recognition 132, 108926.
- Distilling Knowledge via Knowledge Review, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5006–5015.
- Xception: Deep Learning with Depthwise Separable Convolutions, in: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1800–1807.
- CentripetalNet: Pursuing High-Quality Keypoint Pairs for Object Detection, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10516–10525.
- Temporal feature enhancement network with external memory for live-stream video object detection. Pattern Recognition 131, 108847.
- YOLOX: Exceeding YOLO Series in 2021. arXiv e-prints arXiv:2107.08430 .
- Searching for MobileNetV3, in: IEEE/CVF International Conference on Computer Vision, pp. 1314–1324.
- Squeeze-and-excitation networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 42, 2011–2023.
- Caffe: Convolutional Architecture for Fast Feature Embedding, in: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 675–678.
- Instance-conditional knowledge distillation for object detection, in: Advances in Neural Information Processing Systems, pp. 16468–16480.
- FoveaBox: Beyound Anchor-Based Object Detection. IEEE Transactions on Image Processing 29, 7389–7398.
- CornerNet: Detecting Objects as Paired Keypoints, in: European Conference on Computer Vision, pp. 765–781.
- Automatic defect detection of metro tunnel surfaces using a vision-based inspection system. Advanced Engineering Informatics 47, 101206.
- Automatic defect detection of texture surface with an efficient texture removal network. Advanced Engineering Informatics 53, 101672.
- Feature Pyramid Networks for Object Detection, in: IEEE Conference on Computer Vision and Pattern Recognition, pp. 936–944.
- Grid R-CNN, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7355–7364.
- Shufflenet v2: Practical guidelines for efficient cnn architecture design, in: Computer Vision – ECCV 2018, pp. 122–138.
- Improving Object Detection by Label Assignment Distillation, in: IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 1322–1331.
- Visual vs internal attention mechanisms in deep neural networks for image classification and object detection. Pattern Recognition 123, 108411.
- Towards Balanced Learning for Instance Recognition. International Journal of Computer Vision 129, 1376–1393.
- Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 1137–1149.
- Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 658–666.
- Dsla: Dynamic smooth label assignment for efficient anchor-free object detection. Pattern Recognition 131, 108868.
- Railway Equipment Detection Using Exact Height Function Shape Descriptor Based on Fast Adaptive Markov Random Field. Optical Engineering 57, 1 – 14.
- Sparse R-CNN: End-to-End Object Detection with Learnable Proposals, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14449–14458.
- FCOS: Fully Convolutional One-Stage Object Detection, in: IEEE/CVF International Conference on Computer Vision, pp. 9626–9635.
- Efficient surface defect detection using self-supervised learning strategy and segmentation network. Advanced Engineering Informatics 52, 101566.
- Sa-dpnet: Structure-aware dual pyramid network for salient object detection. Pattern Recognition 127, 108624.
- High-speed train fault detection with unsupervised causality-based feature extraction methods. Advanced Engineering Informatics 49, 101312.
- Multi-view correlation distillation for incremental object detection. Pattern Recognition 131, 108863.
- Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training, in: European Conference on Computer Vision, pp. 260–275.
- Adversarial co-distillation learning for image recognition. Pattern Recognition 111, 107659.
- LGD: Label-Guided Self-Distillation for Object Detection. Proceedings of the AAAI Conference on Artificial Intelligence 36, 3309–3317.
- A Unified Framework for Fault Detection of Freight Train Images Under Complex Environment, in: IEEE International Conference on Image Processing, pp. 1348–1352.
- Real-Time Vision-Based System of Fault Detection for Freight Trains. IEEE Transactions on Instrumentation and Measurement 69, 5274–5284.
- Progressive privileged knowledge distillation for online action detection. Pattern Recognition 129, 108741.
- Localization Distillation for Dense Object Detection, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9407–9416.
- Objects as Points. arXiv e-prints arXiv:1904.07850 .
- AutoAssign: Differentiable Label Assignment for Dense Object Detection. arXiv preprint arXiv:2007.03496 .
- Deformable DETR: Deformable Transformers for End-to-End Object Detection, in: International Conference on Learning Representations.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.