Tuning for Software Analytics: is it Really Necessary? (1609.01759v1)

Published 6 Sep 2016 in cs.SE

Abstract: Context: Data miners have been widely used in software engineering to, say, generate defect predictors from static code measures. Such static code defect predictors perform well compared to manual methods, and they are easy to use and useful to use. But one of the "black art" of data mining is setting the tunings that control the miner. Objective:We seek simple, automatic, and very effective method for finding those tunings. Method: For each experiment with different data sets (from open source JAVA systems), we ran differential evolution as anoptimizer to explore the tuning space (as a first step) then tested the tunings using hold-out data. Results: Contrary to our prior expectations, we found these tunings were remarkably simple: it only required tens, not thousands,of attempts to obtain very good results. For example, when learning software defect predictors, this method can quickly find tuningsthat alter detection precision from 0% to 60%. Conclusion: Since (1) the improvements are so large, and (2) the tuning is so simple, we need to change standard methods insoftware analytics. At least for defect prediction, it is no longer enough to just run a data miner and present the resultwithoutconducting a tuning optimization study. The implication for other kinds of analytics is now an open and pressing issue

Authors (3)

Wei Fu (59 papers)
Tim Menzies (128 papers)
Xipeng Shen (28 papers)

Citations (202)

View on Semantic Scholar

Summary

The paper reveals that parameter tuning using differential evolution significantly enhances software defect prediction, boosting precision from 0% to 60%.
The paper challenges the reliance on default settings by showing that a few dozen tuning attempts yield substantial performance improvements.
The paper underscores the potential of evolutionary algorithms to optimize model parameters, prompting a re-evaluation of standard software analytics practices.

An Evaluation of Parameter Tuning in Software Analytics

The paper "Tuning for Software Analytics: is it Really Necessary?" by Wei Fu, Tim Menzies, and Xipeng Shen explores an often-overlooked aspect of software defect prediction: the tuning of data mining algorithms. The authors challenge the assumption that default parameters in data miners, which have been extensively tested by their developers, suffice for effective performance optimization. Their research indicates that parameter tuning is not only necessary but also significantly impactful, contradicting prior notions in the field.

Key Insights and Methodology

The authors employed differential evolution (DE), a well-regarded optimization algorithm, to tune software defect predictors derived from data mining models such as CART, Random Forest, and WHERE. Their paper focused on defect prediction in Java systems, utilizing data from open-source repositories to validate their findings. The effectiveness of these tunings was assessed across different releases of software projects, ensuring that training and testing data were temporally separated to avoid overfitting.

What sets this paper apart is the discovery that effective tuning of software defect predictors does not require an exhaustive search. Contrary to initial expectations, the researchers found that it only took tens of attempts, facilitated by DE's efficient search mechanism, to produce significant improvements in predictive performance. For example, after tuning, some predictors' precision scores improved dramatically from 0% to 60%. This result challenges the status quo of software quality assurance by advocating for tuning as a standard practice.

Strong Results and Challenges to Conventional Wisdom

The paper presents detailed evaluations that underscore the effectiveness of tuning. Notably, when tuned, the CART model often matched or exceeded the performance of Random Forest, challenging prior studies like Lessmann et al. (2008) that positioned Random Forest as superior. Moreover, the paper spotlights how tuning influences the selection of project factors, highlighting inconsistencies when different tuning approaches are applied.

The authors further argue that this non-trivial yet straightforward optimization practice should be integral to future defect prediction work, as even slight parameter adjustments can substantially affect the conclusions drawn from data mining studies.

Implications and Future Directions

The implications of this research are broad and significant. Firstly, it suggests a paradigm shift in how software engineering researchers and practitioners approach data mining tasks: simple "off-the-shelf" miner applications are no longer tenable without considering parameter tuning. The dramatic improvements observed indicate that many existing insights and recommendations in empirical software engineering need to be revisited and possibly revised.

Secondly, the success of this paper in using differential evolution points towards the potential benefits of using evolutionary algorithms in other software analytics contexts. There's an open question here about the universality of these tuning benefits outside defect prediction.

The work sets a course for future exploration where tuning and learning processes are increasingly integrated, possibly evolving into more sophisticated, adaptive systems that are capable of self-optimization amidst evolving goals and datasets. In particular, this research opens avenues for developing simplified frameworks that merge optimization processes with learner implementations.

In conclusion, the paper makes a compelling case for treating data mining in software analytics as a complex, multifaceted optimization problem. Researchers and practitioners should embrace parameter tuning as a means to not only enhance predictive performance but also to refine and transform the insights derived from software engineering data.

PDF Markdown