Threshy: Supporting Safe Usage of Intelligent Web Services (2008.08252v1)

Published 19 Aug 2020 in cs.SE and cs.CY

Abstract: Increased popularity of `intelligent' web services provides end-users with machine-learnt functionality at little effort to developers. However, these services require a decision threshold to be set which is dependent on problem-specific data. Developers lack a systematic approach for evaluating intelligent services and existing evaluation tools are predominantly targeted at data scientists for pre-development evaluation. This paper presents a workflow and supporting tool, Threshy, to help software developers select a decision threshold suited to their problem domain. Unlike existing tools, Threshy is designed to operate in multiple workflows including pre-development, pre-release, and support. Threshy is designed for tuning the confidence scores returned by intelligent web services and does not deal with hyper-parameter optimisation used in ML models. Additionally, it considers the financial impacts of false positives. Threshold configuration files exported by Threshy can be integrated into client applications and monitoring infrastructure. Demo: https://bit.ly/2YKeYhE.

Citations (8)

View on Semantic Scholar

Summary

The paper introduces Threshy as a novel tool that automates decision threshold tuning to support safe and efficient use of intelligent web services.
It employs optimization algorithms and domain-specific metrics to accurately configure thresholds for diverse classification tasks.
The tool enhances developer productivity by providing an intuitive, plug-and-play interface for integrating threshold calibration into software systems.

Insights on "Threshy: Supporting Safe Usage of Intelligent Web Services"

The paper entitled "Threshy: Supporting Safe Usage of Intelligent Web Services" introduces an innovative tool designed to enhance the utilization of intelligent web services by software developers. According to the authors, the increased reliance on intelligent web services necessitates systematic methods for configuring decision thresholds, which are critical for determining the performance and utility of such services in specific problem domains. The work is attributed to researchers Alex Cummaudo, Scott Barnett, Rajesh Vasa, and John Grundy, who have contributed to exploring tools that bridge the gap between machine learning outputs and software engineering processes.

Problem Context

The primary challenge addressed by the authors is the difficulty developers face when integrating intelligent web services into applications. Unlike traditional web services, these systems return confidence values associated with predictions, requiring developers to configure decision thresholds appropriately. Current tools largely target data scientists and are utilized predominantly for pre-development evaluation, which leaves a gap in the workflows necessary for developers during the development, deployment, and maintenance stages.

Threshy: A Tailored Solution

Threshy is presented as a novel solution within this context, aimed specifically at software developers rather than data scientists. The tool stands out by facilitating decision threshold determination through automation, incorporating domain-specific costs and constraints. The developers of Threshy emphasize its versatility across different phases, including pre-development, pre-release, and ongoing support. This versatility is significant in environments where threshold tuning is a continuous necessity due to the dynamic nature of data and models.

Methodological and Technical Highlights

Threshy differentiates itself through features such as:

Automated Threshold Selection: Using optimization algorithms, Threshy automates the threshold selection process, accommodating the nuances of binary, multi-class, and multi-label classification problems.
Domain-Specific Configuration: The tool allows users to integrate domain-specific metrics such as financial costs and the impact of false positives in threshold determination. This flexibility ensures that the chosen thresholds align with business objectives and constraints.
User Interaction: Through an intuitive interface, Threshy engages developers in an educational workflow, leading them through data exploration, cost setting, and potential optimizations before final threshold integration into systems.

The authors illustrate the architecture and workflow of Threshy, marking the tool’s distinction from pre-existing frameworks that focus on model internals. By providing a "plug-and-play" solution, Threshy allows developers to calibrate thresholds without exploring model adjustments, which are often opaque and inaccessible in third-party services.

Implications and Future Directions

The introduction of Threshy holds several implications for the future of intelligent web service integration:

Enhanced Developer Productivity: By abstracting away the intricacies involved in threshold tuning, developers can focus on higher-level system optimization and product design.
Improved System Reliability: With a standardized, data-driven approach to threshold calibration, systems can achieve consistent performance across varying conditions and datasets.

Looking ahead, considerations include extending the application of Threshy beyond classification problems to other complex machine learning tasks. Evaluating Threshy’s user acceptance and effectiveness in real-world scenarios remains a pivotal step in validating its utility and advancing its development.

Conclusion

In summary, Threshy represents a significant development in supporting software engineers engaging with intelligent web services, bridging the gap left by traditional tools tailored for data scientists. By addressing the need for effective threshold calibration, Threshy contributes to the growing discourse on integrative software tools that enhance the intersection of machine learning and software engineering practices. The framework and insights provided by this paper open avenues for further refining machine learning application methodologies in software development.

PDF Markdown

Related Papers

GitHub

GitHub - a2i2/threshy (1 star)

YouTube

Show All Videos