DESlib: A Dynamic ensemble selection library in Python

Published 14 Feb 2018 in cs.LG | (1802.04967v3)

Abstract: DESlib is an open-source python library providing the implementation of several dynamic selection techniques. The library is divided into three modules: (i) \emph{dcs}, containing the implementation of dynamic classifier selection methods (DCS); (ii) \emph{des}, containing the implementation of dynamic ensemble selection methods (DES); (iii) \emph{static}, with the implementation of static ensemble techniques. The library is fully documented (documentation available online on Read the Docs), has a high test coverage (codecov.io) and is part of the scikit-learn-contrib supported projects. Documentation, code and examples can be found on its GitHub page: https://github.com/scikit-learn-contrib/DESlib.

Abstract PDF Upgrade to Chat

Citations (80)

View on Semantic Scholar

Summary

The paper introduces DESlib, a library that dynamically selects classifiers based on local competence estimation to enhance prediction accuracy.
It presents a modular framework detailing regions of competence, information sources, and selection strategies for both dynamic classifier and ensemble selection.
The library integrates with scikit-learn using rigorous testing and documentation standards, offering a robust tool for machine learning research and practical applications.

Overview of DESlib: A Dynamic Ensemble Selection Library in Python

Introduction

The paper presents DESlib, a Python library designed for implementing dynamic ensemble selection (DES) techniques, advancing the domain of multiple classifier systems. This library effectively bridges the gap between academic research on dynamic selection methods and practical machine learning applications. By offering implementations of both dynamic classifier selection (DCS) and dynamic ensemble selection (DES), along with static ensemble methodologies, DESlib serves as a comprehensive resource for researchers and practitioners.

Dynamic Selection in Multiple Classifier Systems

Dynamic selection approaches are pivotal for adapting to local data regions by evaluating the competence of classifiers in a pool for each specific instance. These methodologies surpass traditional static methods like majority voting by dynamically selecting classifiers that are deemed most competent for a given prediction task. This paradigm acknowledges that no single classifier can be universally optimal across diverse feature spaces.

The implementations in DESlib are classified under a taxonomy focusing on three main components: defining the region of competence, information sources for competence estimation, and the selection approach for choosing classifiers in both DCS and DES contexts. This structured design promotes modularity, facilitating the integration of novel methods by requiring implementations only for the competence estimation and selection processes.

Project Management and Implementation

DESlib is engineered to support ease of integration and encourage contributions from the research community. Developed on a collaborative platform using GitHub, the project incorporates rigorous code quality and documentation standards. The library's development adheres to PEP 8 standards, and its functionality is tested extensively using Travis CI and Codacy. These practices ensure high reliability and consistent code quality, backed by detailed documentation available on Read the Docs.

Modules and Techniques

DESlib comprises three core modules:

Dynamic Classifier Selection (DCS): Implements techniques selecting the most competent single classifier, such as OLA, LCA, and MLA, among others.
Dynamic Ensemble Selection (DES): Implements strategies for constructing an ensemble of classifiers that surpass a competence threshold. Notable methods include META-DES, KNORA-E, and DES-RRC.
Static Ensembles: Offers baseline ensemble techniques, like Single Best, Static Selection, and Stacked Generalization, providing benchmarks for comparison with dynamic approaches.

The library also integrates state-of-the-art improvements, including dynamic frienemy pruning within the FIRE-DES framework and hybrid selection techniques.

Practical Implications and Usability

Fully compatible with the scikit-learn API, DESlib requires minimal setup for use with existing machine learning workflows. By supporting both homogeneous and heterogeneous classifier pools, it accommodates a wide range of experimental designs. Users can deploy the library via pip and configure dynamic selection models using familiar API calls like fit and predict.

Conclusion and Future Directions

DESlib presents a robust platform for implementing and experimenting with dynamic ensemble techniques. This library not only advances practical applications in machine learning but also provides a resourceful framework for conducting further research into dynamic selection methodologies. Future developments in DESlib will explore extensions into specialized applications like One-Class Classification and regression, further broadening its utility and relevance in diverse machine learning contexts.

Markdown