A Second Look at the Impact of Passive Voice Requirements on Domain Modeling: Bayesian Reanalysis of an Experiment (2402.10800v1)
Abstract: The quality of requirements specifications may impact subsequent, dependent software engineering (SE) activities. However, empirical evidence of this impact remains scarce and too often superficial as studies abstract from the phenomena under investigation too much. Two of these abstractions are caused by the lack of frameworks for causal inference and frequentist methods which reduce complex data to binary results. In this study, we aim to demonstrate (1) the use of a causal framework and (2) contrast frequentist methods with more sophisticated Bayesian statistics for causal inference. To this end, we reanalyze the only known controlled experiment investigating the impact of passive voice on the subsequent activity of domain modeling. We follow a framework for statistical causal inference and employ Bayesian data analysis methods to re-investigate the hypotheses of the original study. Our results reveal that the effects observed by the original authors turned out to be much less significant than previously assumed. This study supports the recent call to action in SE research to adopt Bayesian data analysis, including causal frameworks and Bayesian statistics, for more sophisticated causal inference.
- Muneera Bano. 2015. Addressing the challenges of requirements ambiguity: A review of empirical literature. In 2015 IEEE Fifth International Workshop on Empirical Requirements Engineering (EmpiRE). IEEE, 21–24.
- JC Barnes and Shannon J Linning. 2021. Statistical Power, P-Values, and the Positive Predictive Value. The Encyclopedia of Research Methods in Criminology and Criminal Justice 1 (2021), 337–343.
- Yoav Benjamini and Yosef Hochberg. 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal statistical society: series B (Methodological) 57, 1 (1995), 289–300.
- Barry W Boehm and Philip N. Papaccio. 1988. Understanding and controlling software costs. IEEE transactions on software engineering 14, 10 (1988), 1462–1477.
- Handbook of markov chain monte carlo. CRC press.
- Paul-Christian Bürkner. 2017. brms: An R package for Bayesian multilevel models using Stan. Journal of statistical software 80 (2017), 1–28.
- Quasi-experimentation: Design & analysis issues for field settings. Vol. 351. Houghton Mifflin Boston.
- Felix Elwert. 2013. Graphical causal models. In Handbook of causal analysis for social research. Springer, 245–273.
- Neil A Ernst. 2018. Bayesian hierarchical modelling for tailoring metric thresholds. In Proceedings of the 15th international conference on mining software repositories. 587–591.
- Henning Femmer. 2018. Requirements Quality Defect Detection with the Qualicen Requirements Scout.. In REFSQ Workshops.
- Rapid quality assurance with requirements smells. Journal of Systems and Software 123 (2017), 190–213.
- On the impact of passive voice requirements on domain modelling. In Proceedings of the 8th ACM/IEEE international symposium on empirical software engineering and measurement. 1–4.
- How do Practitioners Perceive the Relevance of Requirements Engineering Research? IEEE Transactions on Software Engineering (2020).
- Requirements Quality Research: a harmonized Theory, Evaluation, and Roadmap. Requirements engineering (2023).
- A live extensible ontology of quality factors for textual requirements. In 2022 IEEE 30th International Requirements Engineering Conference (RE). IEEE, 274–280.
- Let’s Stop Building at the Feet of Giants: Recovering unavailable Requirements Quality Artifacts. arXiv preprint arXiv:2304.04670 (2023).
- Bayesian data analysis in empirical software engineering research. IEEE Transactions on Software Engineering 47, 9 (2019), 1786–1810.
- Applying Bayesian analysis guidelines to empirical software engineering data: The case of programming languages and code quality. ACM Transactions on Software Engineering and Methodology (TOSEM) 31, 3 (2022), 1–38.
- Many researchers were not compliant with their published data sharing statement: mixed-methods study. Journal of Clinical Epidemiology (2022).
- A framework to measure and improve the quality of textual requirements. Requirements engineering 18 (2013), 25–41.
- E. T. Jaynes. 2003. Probability theory: The logic of science. Cambridge University Press, Cambridge.
- Leonid Kof. 2007. Treatment of passive voice and conjunctions in use case documents. In Natural Language Processing and Information Systems: 12th International Conference on Applications of Natural Language to Information Systems, NLDB 2007, Paris, France, June 27-29, 2007. Proceedings 12. Springer, 181–192.
- Jennifer Krisch and Frank Houdek. 2015. The myth of bad passive voice and weak words an empirical investigation in the automotive industry. In 2015 IEEE 23rd International Requirements Engineering Conference (RE). IEEE, 344–351.
- J Jack Lee. 2011. Demystify statistical significance—time to move on from the p value to Bayesian analysis. , 2–3 pages.
- Richard McElreath. 2020. Statistical rethinking: A Bayesian course with examples in R and Stan. CRC press.
- Naming the pain in requirements engineering: Contemporary problems, causes, and effects in practice. Empirical software engineering 22 (2017), 2298–2338.
- Daniel Méndez Fernández and Birgit Penzenstadler. 2015. Artefact-based requirements engineering: the AMDiRE approach. Requirements Engineering 20 (2015), 405–434.
- Tim Menzies and Martin Shepperd. 2019. “Bad smells” in software analytics papers. Information and software technology 112 (2019), 35–47.
- Empirical research on requirements quality: a systematic mapping study. Requirements Engineering 27, 2 (2022), 183–209.
- Causal inference in statistics: A primer. John Wiley & Sons.
- Assessing the quality of use case descriptions. Software Quality Journal 15, 1 (2007), 69–97.
- Klaus Pohl. 2016. Requirements engineering fundamentals: a study guide for the certified professional for requirements engineering exam-foundation level-IREB compliant. Rocky Nook, Inc.
- Julien Siebert. 2023. Applications of statistical causal inference in software engineering. Information and Software Technology (2023), 107198.
- Bayesian data analysis in empirical software engineering: The case of missing data. Contemporary Empirical Methods in Software Engineering (2020), 289–324.
- Crossover designs in software engineering experiments: Benefits and perils. IEEE Transactions on Software Engineering 42, 2 (2015), 120–135.
- Status quo in requirements engineering: A theory and a global family of surveys. ACM Transactions on Software Engineering and Methodology (TOSEM) 28, 2 (2019), 1–48.
- Jeff S Wesner and Justin PF Pomeranz. 2021. Choosing priors in Bayesian ecological models by simulating from the prior predictive distribution. Ecosphere 12, 9 (2021), e03739.
- A retrospective study of one decade of artifact evaluations. In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 145–156.
- Experimentation in software engineering. Springer Science & Business Media.
- Julian Frattini (26 papers)
- Davide Fucci (44 papers)
- Richard Torkar (27 papers)
- Daniel Mendez (63 papers)