Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
173 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Digital Twin-based Anomaly Detection with Curriculum Learning in Cyber-physical Systems (2309.15995v1)

Published 27 Sep 2023 in cs.LG, cs.CR, and cs.SE

Abstract: Anomaly detection is critical to ensure the security of cyber-physical systems (CPS). However, due to the increasing complexity of attacks and CPS themselves, anomaly detection in CPS is becoming more and more challenging. In our previous work, we proposed a digital twin-based anomaly detection method, called ATTAIN, which takes advantage of both historical and real-time data of CPS. However, such data vary significantly in terms of difficulty. Therefore, similar to human learning processes, deep learning models (e.g., ATTAIN) can benefit from an easy-to-difficult curriculum. To this end, in this paper, we present a novel approach, named digitaL twin-based Anomaly deTecTion wIth Curriculum lEarning (LATTICE), which extends ATTAIN by introducing curriculum learning to optimize its learning paradigm. LATTICE attributes each sample with a difficulty score, before being fed into a training scheduler. The training scheduler samples batches of training data based on these difficulty scores such that learning from easy to difficult data can be performed. To evaluate LATTICE, we use five publicly available datasets collected from five real-world CPS testbeds. We compare LATTICE with ATTAIN and two other state-of-the-art anomaly detectors. Evaluation results show that LATTICE outperforms the three baselines and ATTAIN by 0.906%-2.367% in terms of the F1 score. LATTICE also, on average, reduces the training time of ATTAIN by 4.2% on the five datasets and is on par with the baselines in terms of detection delay time.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (70)
  1. WADI: a water distribution testbed for research in the design of secure cyber physical systems. In Proceedings of the 3rd International Workshop on Cyber-Physical Systems for Smart Water Networks. 25–28.
  2. From Passive to Active: Learning Timed Automata Efficiently BT - NASA Formal Methods, Ritchie Lee, Susmit Jha, and Anastasia Mavridou (Eds.). Springer International Publishing, Cham, 1–19.
  3. Andrea Arcuri and Lionel Briand. 2011. A practical guide for using statistical tests to assess randomized algorithms in software engineering. Proceedings - International Conference on Software Engineering (2011), 1–10. DOI:http://dx.doi.org/10.1145/1985793.1985795 
  4. Ensuring safety, security, and sustainability of mission-critical cyber–physical systems. Proc. IEEE 100, 1 (2011), 283–299.
  5. CVAE-GAN: fine-grained image generation through asymmetric training. In Proceedings of the IEEE international conference on computer vision. 2745–2754.
  6. Curriculum Learning. In Proceedings of the 26th Annual International Conference on Machine Learning (ICML ’09). Association for Computing Machinery, New York, NY, USA, 41–48. DOI:http://dx.doi.org/10.1145/1553374.1553380 
  7. Multi-head CNN–RNN for multi-time series anomaly detection: An industrial case study. Neurocomputing 363 (2019), 246–260.
  8. Xinlei Chen and Abhinav Gupta. 2015. Webly supervised learning of convolutional networks. Proceedings of the IEEE International Conference on Computer Vision 2015 International Conference on Computer Vision, ICCV 2015 (2015), 1431–1439. DOI:http://dx.doi.org/10.1109/ICCV.2015.168 
  9. Pseudo-labeling curriculum for unsupervised domain adaptation. arXiv preprint arXiv:1908.00262 (2019).
  10. Visualizing and understanding curriculum learning for long short-term memory networks. arXiv preprint arXiv:1611.06204 (2016).
  11. Detecting concept drift: An information entropy based method using an adaptive sliding window. Intelligent Data Analysis 18, 3 (2014), 337–364. DOI:http://dx.doi.org/10.3233/IDA-140645 
  12. Matthias Eckhart and Andreas Ekelhart. 2018. Towards security-aware virtual environments for digital twins. In Proceedings of the 4th ACM workshop on cyber-physical system security. 61–72.
  13. Matthias Eckhart and Andreas Ekelhart. 2019. Security and Quality in Cyber-Physical Systems Engineering. Number March 2020. DOI:http://dx.doi.org/10.1007/978-3-030-25312-7 
  14. Student-teacher curriculum learning via reinforcement learning: predicting hospital inpatient admission location. In International Conference on Machine Learning. PMLR, 2848–2857.
  15. Abdulmotaleb El Saddik. 2018. Digital Twins: The Convergence of Multimedia Technologies. IEEE MultiMedia 25, 2 (2018), 87–92. DOI:http://dx.doi.org/10.1109/MMUL.2018.023121167 
  16. Matthias Fey and Jan Eric Lenssen. 2019. Fast graph representation learning with PyTorch Geometric. arXiv preprint arXiv:1903.02428 (2019).
  17. Anomaly detection in cyber physical systems using recurrent neural networks. Proceedings of IEEE International Symposium on High Assurance Systems Engineering March 2019 (2017), 140–145. DOI:http://dx.doi.org/10.1109/HASE.2017.36 
  18. Generative adversarial nets. Advances in Neural Information Processing Systems 3, January (2014), 2672–2680.
  19. Automated curriculum learning for neural networks. In international conference on machine learning. PMLR, 1311–1320.
  20. Guy Hacohen and Daphna Weinshall. 2019. On the power of curriculum learning in training deep networks. 36th International Conference on Machine Learning, ICML 2019 2019-June (2019), 4483–4496.
  21. Uncertainty-aware Robustness Assessment of Industrial Elevator Systems. Technical Report.
  22. Are Elevator Software Robust against Uncertainties? Results and Experiences from an Industrial Case Study. In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2022). Association for Computing Machinery, New York, NY, USA, 1331–1342. DOI:http://dx.doi.org/10.1145/3540250.3558955 
  23. A survey on generative adversarial networks: Variants, applications, and training. ACM Computing Surveys (CSUR) 54, 8 (2021), 1–49.
  24. Mentornet: Learning data-driven curriculum for very deep neural networks on corrupted labels. In International Conference on Machine Learning. PMLR, 2304–2313.
  25. Neil Eklund Justinian Rosca justinian, Nicholas Williard and Zhen Song. 2021. PHM Data Challenge. (Nov 2021). Retrieved November 12, 2021 from https://phmsociety.org/conference/annual-conference-of-the-phm-society/annual-conference-of-the-prognostics-and-health-management-society-2015/phm-data-challenge-3/
  26. Tom Kocmi and Ondrej Bojar. 2017. Curriculum learning and minibatch bucketing in neural machine translation. arXiv preprint arXiv:1707.09533 (2017).
  27. Allison Koenecke and Amita Gajewar. 2019. Curriculum learning in deep neural networks for financial forecasting. In Workshop on Mining Data for Financial Applications. Springer, 16–31.
  28. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Ijcai, Vol. 14. Montreal, Canada, 1137–1145.
  29. Moshe Kravchik and Asaf Shabtai. 2018. Detecting cyber attacks in industrial control systems using convolutional neural networks. Proceedings of the ACM Conference on Computer and Communications Security (2018), 72–83. DOI:http://dx.doi.org/10.1145/3264888.3264896 
  30. Kai A Krueger and Peter Dayan. 2009. Flexible shaping: How learning in small steps helps. Cognition 110, 3 (2009), 380–394. DOI:http://dx.doi.org/https://doi.org/10.1016/j.cognition.2008.11.014 
  31. Self-paced learning for latent variable models. Advances in neural information processing systems 23 (2010), 1189–1197.
  32. Sunil Kumar and Ilyoung Chong. 2018. Correlation analysis to identify the effective data in machine learning: Prediction of depressive disorder and emotion states. International journal of environmental research and public health 15, 12 (2018), 2907.
  33. A deep learning approach for anomaly detection based on SAE and LSTM in mechanical equipment. The International Journal of Advanced Manufacturing Technology 103, 1-4 (2019), 499–510.
  34. A novel cloud-based framework for the elderly healthcare services using digital twin. IEEE Access 7 (2019), 49088–49101.
  35. Deep learning-based anomaly detection in cyber-physical systems: Progress and opportunities. ACM Computing Surveys (CSUR) 54, 5 (2021), 1–36.
  36. Network attack detection using partial deterministic finite automaton pattern matching. (March 8 2011). US Patent 7,904,961.
  37. Testing self-healing cyber-physical systems under uncertainty: a fragility-oriented approach. Software Quality Journal 27, 2 (2019), 615–649.
  38. Alexander Maier. 2014. Online Passive Learning of Timed Automata for Cyber-Physical Production Systems. (2014).
  39. Aditya P Mathur and Nils Ole Tippenhauer. 2016. SWaT: a water treatment testbed for research and training on ICS security. In 2016 International Workshop on Cyber-physical Systems for Smart Water Networks (CySWater). IEEE, 31–36.
  40. Industrial Control System Simulation and Data Logging for Intrusion Detection System Research.
  41. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett (Eds.). Curran Associates, Inc., 8024–8035. http://papers.neurips.cc/paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf
  42. Gustavo Penha and Claudia Hauff. 2020. Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking. Advances in Information Retrieval 12035 (2020), 699.
  43. Competence-based curriculum learning for neural machine translation. NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference 1 (2019), 1162–1172. DOI:http://dx.doi.org/10.18653/v1/n19-1119 
  44. Shivesh Ranjan and John HL Hansen. 2017. Curriculum learning based approaches for noise robust speaker recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing 26, 1 (2017), 197–210.
  45. Douglas L.T. Rohde and David C. Plaut. 1999. Language acquisition in the absence of explicit negative evidence: How important is starting small? Cognition 72, 1 (1999), 67–109. DOI:http://dx.doi.org/10.1016/S0010-0277(99)00031-1 
  46. T D Sanger. 1994. Neural network learning control of robot manipulators using gradually increasing task difficulty. IEEE Transactions on Robotics and Automation 10, 3 (jun 1994), 323–333. DOI:http://dx.doi.org/10.1109/70.294207 
  47. Roberto M. Scheffel and Antônio A. Fröhlich. 2019. Increasing sensor reliability through confidence attribution. Journal of the Brazilian Computer Society 25, 1 (2019). DOI:http://dx.doi.org/10.1186/s13173-019-0094-6 
  48. Joel L Schiff. 2011. Cellular automata: a discrete view of the world. Vol. 45. John Wiley & Sons.
  49. Training and Tracking in Robotics.. In Ijcai. Citeseer, 670–672.
  50. Curriculum Learning: A Survey. 14, 8 (2021), 1–29. http://arxiv.org/abs/2101.10382
  51. Valentin I Spitkovsky and Daniel Jurafsky. 2010. From Baby Steps to Leapfrog : How “ Less is More ” in Unsupervised Dependency Parsing. June (2010), 751–759.
  52. The Battle Of The Attack Detection Algorithms: Disclosing Cyber Attacks On Water Distribution Networks. Journal of Water Resources Planning and Management 144, 8 (aug 2018), 4018048. DOI:http://dx.doi.org/10.1061/(ASCE)WR.1943-5452.0000969 
  53. Simple and effective curriculum pointer-generator networks for reading comprehension over long narratives. arXiv preprint arXiv:1905.10847 (2019).
  54. Learning the curriculum with bayesian optimization for task-specific word representation learning. arXiv preprint arXiv:1605.03852 (2016).
  55. Jonathan G Tullis and Aaron S Benjamin. 2011. On the effectiveness of self-paced learning. Journal of memory and language 64, 2 (feb 2011), 109–118. DOI:http://dx.doi.org/10.1016/j.jml.2010.11.002 
  56. Heng Wang and Zubin Abraham. 2015. Concept drift detection for streaming data. Proceedings of the International Joint Conference on Neural Networks 2015-Septe (2015). DOI:http://dx.doi.org/10.1109/IJCNN.2015.7280398 
  57. A survey on curriculum learning. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).
  58. STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 11 (2017), 2314–2320. DOI:http://dx.doi.org/10.1109/TPAMI.2016.2636150 
  59. Curriculum learning by transfer learning: Theory and experiments with deep networks. 35th International Conference on Machine Learning, ICML 2018 12 (2018), 8331–8339.
  60. A weighted deep representation learning model for imbalanced fault diagnosis in cyber-physical systems. Sensors 18, 4 (2018), 1096.
  61. Curriculum Learning for Natural Language Understanding. (2020), 6095–6104. DOI:http://dx.doi.org/10.18653/v1/2020.acl-main.542 
  62. Digital Twin-based Anomaly Detection in Cyber-physical Systems. In 2021 14th IEEE Conference on Software Testing, Verification and Validation (ICST). IEEE, 205–216.
  63. Uncertainty-Aware Transfer Learning to Evolve Digital Twins for Industrial Elevators. In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2022). Association for Computing Machinery, New York, NY, USA, 1257–1268. DOI:http://dx.doi.org/10.1145/3540250.3558957 
  64. Anomaly detection as a service: challenges, advances, and opportunities. Synthesis Lectures on Information Security, Privacy, and Trust 9, 3 (2017), 1–173.
  65. Towards requirements engineering for digital twins of cyber-physical systems. In International Symposium on Leveraging Applications of Formal Methods. Springer, 9–21.
  66. Understanding digital twins for cyber-physical systems: a conceptual model. In International Symposium on Leveraging Applications of Formal Methods. Springer, 54–71.
  67. Uncertainty-wise test case generation and minimization for cyber-physical systems. Journal of Systems and Software 153 (2019), 1–21.
  68. Uncertainty-wise cyber-physical system test modeling. Software & Systems Modeling 18, 2 (2019), 1379–1418.
  69. Curriculum learning for domain adaptation in neural machine translation. arXiv preprint arXiv:1905.05816 (2019).
  70. Adversarial feature matching for text generation. In International Conference on Machine Learning. PMLR, 4006–4015.
Citations (15)

Summary

We haven't generated a summary for this paper yet.