Quantify bug-label misclassification in CodeRed and UOwns and compare with open source
Determine the fraction of mislabeled 'bug'-type issues in the CodeRed dataset (39 proprietary projects) and the UOwns dataset (40 proprietary projects), and ascertain whether bug-label correctness in proprietary issue trackers is higher than in open-source projects.
References
We cannot assess the fraction of mislabelled issues in the CodeRed and UOwns datasets. We suspect that the label correctness is generally higher in proprietary projects, but we cannot know.
— Increasing, not Diminishing: Investigating the Returns of Highly Maintainable Code
(2401.13407 - Borg et al., 2024) in Threats to Validity, Construct validity (Section 7)