HIVE-COTE 2.0: a new meta ensemble for time series classification

Published 15 Apr 2021 in cs.LG | (2104.07551v1)

Abstract: The Hierarchical Vote Collective of Transformation-based Ensembles (HIVE-COTE) is a heterogeneous meta ensemble for time series classification. HIVE-COTE forms its ensemble from classifiers of multiple domains, including phase-independent shapelets, bag-of-words based dictionaries and phase-dependent intervals. Since it was first proposed in 2016, the algorithm has remained state of the art for accuracy on the UCR time series classification archive. Over time it has been incrementally updated, culminating in its current state, HIVE-COTE 1.0. During this time a number of algorithms have been proposed which match the accuracy of HIVE-COTE. We propose comprehensive changes to the HIVE-COTE algorithm which significantly improve its accuracy and usability, presenting this upgrade as HIVE-COTE 2.0. We introduce two novel classifiers, the Temporal Dictionary Ensemble (TDE) and Diverse Representation Canonical Interval Forest (DrCIF), which replace existing ensemble members. Additionally, we introduce the Arsenal, an ensemble of ROCKET classifiers as a new HIVE-COTE 2.0 constituent. We demonstrate that HIVE-COTE 2.0 is significantly more accurate than the current state of the art on 112 univariate UCR archive datasets and 26 multivariate UEA archive datasets.

Abstract PDF Upgrade to Chat

Citations (202)

View on Semantic Scholar

Summary

HIVE-COTE 2.0: An Examination of a Novel Meta Ensemble for Time Series Classification

The paper "HIVE-COTE 2.0: a new meta ensemble for time series classification" presents an enhanced version of the Hierarchical Vote Collective of Transformation-based Ensembles (HIVE-COTE), a state-of-the-art algorithm for time series classification (TSC). HIVE-COTE 2.0 (HC2) is designed to improve both accuracy and usability over its predecessor, HIVE-COTE 1.0 (HC1), by introducing novel components and revising ensemble methods.

Overview of HIVE-COTE 2.0

Time series classification tasks involve predicting discrete target variables from time series data and have applications across various fields, such as seizure detection and earthquake monitoring. HIVE-COTE is a heterogeneous ensemble method that combines classifiers leveraging different data transformation domains, allowing it to exploit multiple types of temporal features. Since its earlier versions, it has remained competitive on the UCR time series classification archive.

HC2 builds on the foundation of HC1 by introducing significant improvements to its component classifiers and overall architecture. The paper introduces two novel classifiers, the Temporal Dictionary Ensemble (TDE) and the Diverse Representation Canonical Interval Forest (DrCIF), along with a new ensemble of ROCKET classifiers named the Arsenal. These enhancements ensure that HC2 outperforms existing methods across a wide range of datasets.

Key Enhancements in HIVE-COTE 2.0

Temporal Dictionary Ensemble (TDE): TDE is an ensemble of 1-NN classifiers that transform each time series into a bag-of-words representation using Symbolic Fourier Approximation (SFA). It replaces the dictionary components of HC1 with improvements in handling multivariate data and integrates mechanisms like Gaussian processes for parameter selection.
Diverse Representation Canonical Interval Forest (DrCIF): DrCIF extends the Canonical Interval Forest (CIF) by incorporating features from various representations, such as the first order differences and spectral features (e.g., periodograms). This broadened feature space leads to improved classification capabilities over previous interval-based classifiers in HC1.
Arsenal: A ROCKET Ensemble: HC2 implements ROCKET classifiers as an ensemble (the Arsenal), achieving better probability estimates than the single ROCKET classifier. It improves the integration of ROCKET into the ensemble, essential for HIVE-COTE’s probabilistic weighting scheme.
Shapelet Transform Classifier (STC): STC has been refined for better usability and is now capable of handling multivariate time series, making it an essential component despite no change in core algorithm design compared to HC1.

Performance Evaluation

The paper presents extensive empirical analysis on 112 univariate UCR datasets and 26 multivariate UEA datasets, demonstrating that HC2 significantly outperforms contemporary algorithms like InceptionTime, ROCKET, and TS-CHIEF in terms of accuracy, negative log-likelihood, and area under the receiver operating characteristic curve. Specifically, HC2 achieves over 1% more accuracy per problem on average compared to these state-of-the-art algorithms, showing particularly substantial gains on complex multivariate datasets.

The study includes an ablative analysis showing that each component of HC2 significantly contributes to its overall performance. Despite HC2’s extensive computational requirements, which could be a concern for extremely large datasets, the contracting mechanism allows it to produce reasonable models within user-specified time budgets.

Implications and Future Directions

HIVE-COTE 2.0 sets a new benchmark for time series classification, demonstrating the effectiveness of a heterogeneous ensemble leveraging multiple feature transformations. From a practical standpoint, it offers a comprehensive tool for various applications where accurate time series analysis is crucial. The paper suggests potential future enhancements, such as exploring improved contracting methods and multithreading capabilities to better scale with data size.

As the field of TSC continues evolving, HC2’s framework provides a versatile foundation for incorporating future advancements in data transformation and ensemble learning methods. Its open-source implementations further facilitate adoption and adaptation within the research community.