ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels (1910.13051v1)

Published 29 Oct 2019 in cs.LG and stat.ML

Abstract: Most methods for time series classification that attain state-of-the-art accuracy have high computational complexity, requiring significant training time even for smaller datasets, and are intractable for larger datasets. Additionally, many existing methods focus on a single type of feature such as shape or frequency. Building on the recent success of convolutional neural networks for time series classification, we show that simple linear classifiers using random convolutional kernels achieve state-of-the-art accuracy with a fraction of the computational expense of existing methods.

Citations (672)

View on Semantic Scholar

Summary

The paper introduces random convolutional kernels to transform time series data, achieving state-of-the-art accuracy without costly training.
It extracts key features using maximum values and the proportion of positive values, enhancing performance through simple linear classifiers.
The method scales linearly with dataset size, reducing training time from days to minutes even on large, complex time series data.

ROCKET: Efficient and Accurate Time Series Classification with Random Kernels

The paper "ROCKET: Exceptionally Fast and Accurate Time Series Classification using Random Convolutional Kernels" introduces a novel method for time series classification that attains state-of-the-art accuracy with lower computational costs compared to existing approaches. The method leverages random convolutional kernels and linear classifiers, demonstrating a significant reduction in training time without sacrificing accuracy.

Main Contributions

Random Convolutional Kernels: The paper highlights the use of random kernels to transform time series data, capturing essential features for classification without the need for weight learning through backpropagation. The convolutional kernels have random parameters, including length, weights, bias, dilation, and padding, which allows them to capture various features across different scales and frequencies.
Feature Extraction: Two main features are extracted from the kernel outputs: the maximum value and the proportion of positive values (ppv). The ppv feature particularly enhances classification accuracy by quantifying the prevalence of patterns within the time series.
Computational Efficiency: By using a large number of random kernels, the method maintains low computational overhead. The complexity is linear concerning the number of kernels and the size of the dataset, allowing for scalability in both dataset size and time series length.
Experimental Validation: The paper demonstrates the efficacy of ROCKET by evaluating it on the UCR archive's 85 'bake off' datasets, showing competitive performance with current state-of-the-art methods like HIVE-COTE and InceptionTime. ROCKET achieves superior computational efficiency, reducing training times from days to minutes in many cases.
Scalability: The method shows excellent scalability, learning from 1 million time series orders of magnitude faster than alternatives like Proximity Forest. The training time remains manageable even as the dataset size and time series length increase.

Numerical Results and Implications

ROCKET presents strong numerical results:

Achieves state-of-the-art accuracy on UCR datasets while significantly reducing training times.
The total compute time for all 85 'bake off' datasets is 1 hour 50 minutes, compared to over 11 days using traditional methods.
Handles large datasets efficiently, with linear complexity regarding time series length and training examples.

The results imply that ROCKET is an attractive solution for real-world applications where computational resources and time are limited. It is particularly suitable for large-scale deployments, making it a valuable tool in practical scenarios like real-time monitoring and anomaly detection.

Future Directions

The paper suggests multiple avenues for future research:

Extending the approach to multivariate time series.
Exploring feature selection mechanisms to further enhance efficiency.
Applying the method to different domains beyond time series, such as image data, where similar convolutional principles can be applied.
Investigating the use of learned kernels integrated with random kernels to potentially improve or adapt accuracy for specific tasks.

Conclusion

ROCKET's innovative use of random convolutional kernels provides a compelling balance of accuracy, simplicity, and scalability. Its contribution to scalable time series analysis offers both theoretical and practical advancements, setting a foundation for further exploration in machine learning and data mining. The paper successfully establishes ROCKET as a viable alternative to complex and computationally intensive methods, offering insights into efficient model design and feature extraction in time series classification.

Related Papers

GitHub

GitHub - angus924/rocket: ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels (445 stars)