Code Red: The Business Impact of Code Quality -- A Quantitative Study of 39 Proprietary Production Codebases (2203.04374v1)

Published 8 Mar 2022 in cs.SE

Abstract: Code quality remains an abstract concept that fails to get traction at the business level. Consequently, software companies keep trading code quality for time-to-market and new features. The resulting technical debt is estimated to waste up to 42% of developers' time. At the same time, there is a global shortage of software developers, meaning that developer productivity is key to software businesses. Our overall mission is to make code quality a business concern, not just a technical aspect. Our first goal is to understand how code quality impacts 1) the number of reported defects, 2) the time to resolve issues, and 3) the predictability of resolving issues on time. We analyze 39 proprietary production codebases from a variety of domains using the CodeScene tool based on a combination of source code analysis, version-control mining, and issue information from Jira. By analyzing activity in 30,737 files, we find that low quality code contains 15 times more defects than high quality code. Furthermore, resolving issues in low quality code takes on average 124% more time in development. Finally, we report that issue resolutions in low quality code involve higher uncertainty manifested as 9 times longer maximum cycle times. This study provides evidence that code quality cannot be dismissed as a technical concern. With 15 times fewer defects, twice the development speed, and substantially more predictable issue resolution times, the business advantage of high quality code should be unmistakably clear.

Citations (13)

View on Semantic Scholar

Summary

The paper quantifies the business impact of code quality by analyzing 39 proprietary codebases using the Code Health metric.
The study shows that alert-level code harbors 15 times more defects and takes 124% longer to resolve issues compared to healthy code.
The research reveals that unpredictable cycle times, up to 9 times longer in low-quality code, hinder effective scheduling and strategic planning.

Analyzing the Business Impact of Code Quality: A Quantitative Exploration

The paper "Code Red: The Business Impact of Code Quality," authored by Adam Tornhill and Markus Borg, provides a robust quantitative assessment of code quality's impact on software development and business productivity. This paper explores the challenging intersection of technical metrics and their business ramifications by analyzing 39 proprietary codebases. Using CodeScene's Code Health metric as a proxy for code quality, the research offers a comprehensive view of how code quality influences defect density, development time, and predictability in task completion.

The primary aim of the paper is to elevate code quality from a mere technical discussion to a crucial business consideration, addressing how it affects the number of reported defects, the time required to resolve issues, and the predictability of completing tasks on time. The researchers dissected 30,737 files, categorizing them into three code health levels: healthy, warning, and alert. Their findings underscore the significant repercussions of low code quality, quantifying the resulting technical debt as an impediment to productivity amid a global shortage of software developers.

Key Findings

The research reveals stark contrasts between high-quality and low-quality code in terms of defects and productivity:

Defect Density: Alert-level code was found to harbor 15 times more defects than healthy code. This indicates a direct correlation between code quality and defect frequency, highlighting an area ripe for improvement through code refactoring and quality enhancement.
Development Time: Resolving issues in low-quality (alert) code takes 124% longer than handling similar issues in high-quality code. This longevity directly links inefficient code to slower time-to-market, potentifying the argument for regular codebase updates and refactoring as a business imperative.
Predictability: The uncertainty in resolving issues is significantly pronounced in lower-quality code, with maximum cycle times being 9 times longer than those in healthy code. This variance not only affects scheduling but also the strategic planning necessary for agile business operations.

Implications and Future Directions

The implications of these findings are multifaceted, affecting software development strategy, business operations, and organizational finance. The empirical evidence provided by Tornhill and Borg bridges the conceptual gap between technical debt and business impact, supporting the integration of quality metrics into key performance indicators that guide business decisions.

In strategizing future research, considerations should be given to disentangling confounding variables, such as developer expertise or module complexity, which can further illuminate the causality between code quality and productivity. Additionally, distinctions between planned and unplanned work efforts provide avenues to explore how predictability and cycle times might differ based on task types.

From a practical perspective, development teams and managers are empowered by these insights to prioritize coding standards and invest in processes that elevate software quality. Transitioning quality discussions from the engineering floor to the boardroom might deliver sustainable operational efficiencies and foster a culture where high-quality code is synonymous with high business value.

Ultimately, the robust analysis within this paper serves as a call to action for software organizations to recalibrate their perception and handling of code quality, ensuring productive use of developer time and efficient business operations.

PDF Markdown

Related Papers

GitHub

GitHub - empear-analytics/code-health-study-tech-debt-2022: A complete replication package for our research paper for TechDebt 2022 (5 stars)

Tweets

https://twitter.com/AdamTornhill/status/1745882496695148620

https://twitter.com/crudeoilreport/status/1928581876903555544

https://twitter.com/mbaltaks/status/1785608499466318110

https://twitter.com/maxwellnoshit/status/1761420554509103257

https://twitter.com/b_a_a_d_o/status/1938599764481933592

https://twitter.com/NoriSte/status/1859617270214873502

YouTube

Show All Videos