One evaluation of model-based testing and its automation (1701.06815v1)

Published 24 Jan 2017 in cs.SE

Abstract: Model-based testing relies on behavior models for the generation of model traces: input and expected output---test cases---for an implementation. We use the case study of an automotive network controller to assess different test suites in terms of error detection, model coverage, and implementation coverage. Some of these suites were generated automatically with and without models, purely at random, and with dedicated functional test selection criteria. Other suites were derived manually, with and without the model at hand. Both automatically and manually derived model-based test suites detected significantly more requirements errors than hand-crafted test suites that were directly derived from the requirements. The number of detected programming errors did not depend on the use of models. Automatically generated model-based test suites detected as many errors as hand-crafted model-based suites with the same number of tests. A sixfold increase in the number of model-based tests led to an 11% increase in detected errors.

View on arXiv

Authors (8)

Alexander Pretschner (35 papers)
Wolfgang Prenninger (1 paper)
Stefan Wagner (199 papers)
Christian Kjanel (1 paper)
Martin Baumgartner (1 paper)
Bernd Sostawa (1 paper)
Rüdiger Zölch (1 paper)
Thomas Stauner (1 paper)

Citations (234)

View on Semantic Scholar

Summary

Evaluation of Model-Based Testing and Its Automation

This paper presents a detailed analysis and evaluation of model-based testing (MBT) methods, particularly focusing on automated test generation and their effectiveness in identifying software errors. The empirical paper was centered around an automotive network controller, emphasizing error detection, model coverage, and implementation coverage, while comparing manually derived test suites against automatically generated MBT suites.

Model-based testing, as articulated in the paper, employs behavior models to generate model traces that serve as test cases for software systems. These models serve two primary roles: they clarify and substantiate the specifications, and they provide a systematic approach to test case generation. The aim of the paper was to measure whether MBT pays off in terms of software quality, and to assess how manual and automatic testing approaches compare in terms of their ability to uncover errors.

Key Findings and Numerical Results

Error Detection: The paper revealed that MBT, both automated and manual, excels in detecting requirements errors, significantly more than tests derived directly from requirements documents without model intervention. Specifically, the paper states that model-based test suites detected a higher count of requirements errors, considered as those necessitating changes to the requirements documents.
Comparative Analysis: Manual model-based test suites were as effective in detecting errors as automated ones when considering an equivalent count of tests. However, the authors note that while increasing the number of automatically generated tests by sixfold resulted in an 11% increase in detected errors, none of the test suites were able to detect all errors.
Coverage Correlation: Coverage metrics showed a moderate positive correlation between condition/decision (C/D) coverage in both the model and the implementation and the number of detected errors. The paper points out that higher C/D coverage at the levels of both the model and implementation does not necessarily imply a higher rate of failure detection, highlighting the limitations of relying solely on structural test criteria.
Automation vs. Manual Testing: The findings suggest that although automated testing is beneficial, especially for large numbers of test cases, it does not replace the unique insights provided by human testers in terms of conceptual and domain-specific knowledge. Automated test suites were able to detect certain faults missed by humans due to their ability to generate "non-standard" or random test inputs.

The implications of these findings are multifaceted. Practically, the paper underlines the importance of integrating model-based testing into the software development process, particularly for complex systems like automotive controllers where clarifying requirements is crucial. Theoretically, the paper suggests that while automation in MBT is useful, there is a need for more sophisticated methods that can seamlessly integrate human insights and automated processes to enhance fault detection capabilities. Additionally, the results point toward developing better metrics and tools for measuring the effectiveness of MBT.

Speculations on Future Developments in AI

The findings of this paper could inform future AI development, where model-based testing could be increasingly applicable. As AI systems become more integrated and prevalent, ensuring their reliability through robust testing methods is paramount. The insights from MBT can be adapted to AI systems to verify and validate models, particularly in safety-critical applications. Furthermore, the integration of automated and manual testing approaches can be explored in AI development workflows, potentially leveraging AI-driven test generation that mimics human intuition and understanding.

Conclusion

In conclusion, the paper provides a nuanced perspective on model-based testing, asserting its strengths in error detection and highlighting the areas where further research could enhance its utility. The paper demonstrates the potential of MBT to improve software quality, specifically through its capability to uncover requirements errors effectively. While automation aids in scaling and can sometimes uncover unique faults, human intuition remains indispensable, advocating for a balanced approach in testing methodologies. As AI systems grow in complexity, the integration of MBT in AI development practices will likely become crucial for ensuring their robustness and reliability.

PDF Markdown

Related Papers

Find Related Papers