Zero-shot Classification using Hyperdimensional Computing (2401.16876v1)

Published 30 Jan 2024 in cs.CV and cs.LG

Abstract: Classification based on Zero-shot Learning (ZSL) is the ability of a model to classify inputs into novel classes on which the model has not previously seen any training examples. Providing an auxiliary descriptor in the form of a set of attributes describing the new classes involved in the ZSL-based classification is one of the favored approaches to solving this challenging task. In this work, inspired by Hyperdimensional Computing (HDC), we propose the use of stationary binary codebooks of symbol-like distributed representations inside an attribute encoder to compactly represent a computationally simple end-to-end trainable model, which we name Hyperdimensional Computing Zero-shot Classifier~(HDC-ZSC). It consists of a trainable image encoder, an attribute encoder based on HDC, and a similarity kernel. We show that HDC-ZSC can be used to first perform zero-shot attribute extraction tasks and, can later be repurposed for Zero-shot Classification tasks with minimal architectural changes and minimal model retraining. HDC-ZSC achieves Pareto optimal results with a 63.8% top-1 classification accuracy on the CUB-200 dataset by having only 26.6 million trainable parameters. Compared to two other state-of-the-art non-generative approaches, HDC-ZSC achieves 4.3% and 9.9% better accuracy, while they require more than 1.85x and 1.72x parameters compared to HDC-ZSC, respectively.

References (38)

Citations (1)

View on Semantic Scholar

Summary

The paper presents HDC-ZSC, a novel non-generative method leveraging hyperdimensional computing for efficient zero-shot learning.
It uses a three-phase training process with an image encoder and a stationary HDC-based attribute encoder, achieving 63.8% top-1 accuracy on the CUB-200 dataset.
The approach outperforms similar methods by up to 9.9% while significantly reducing parameter counts, making it ideal for resource-limited systems.

Hyperdimensional Computing and Zero-shot Learning

Introduction to Zero-shot Learning and HDC

Zero-shot learning (ZSL) is an unorthodox approach in machine learning where a model attempts to classify objects it has never explicitly seen during training. It relies on descriptors that semantically link seen and unseen classes. This paper presents an innovative non-generative approach named Hyperdimensional Computing Zero-shot Classifier (HDC-ZSC), which utilizes Hyperdimensional Computing (HDC) to create efficient attribute encoders for ZSL tasks. HDC, drawing inspiration from the brain's processing capabilities, employs high-dimensional binary vectors (or hypervectors) to represent information which allows for simple, powerful computations.

Model Architecture and Training

The HDC-ZSC model comprises three key components: a trainable image encoder that processes input images, a stationary HDC-based attribute encoder utilizing binary codebooks to encode class attributes, and a similarity kernel that compares embeddings from the encoder to those represented by the attribute encoder. Interestingly, the attribute encoder leverages randomly initialized binary hypervectors that are quasi-orthogonal, enabling HDC's potential for resource-efficient computing.

Training involves a three-phase process: the model initially focuses on standard image classification tasks, shifts towards the attribute extraction phase, and finally converges to the ZSC task. These steps equip the image encoder with rich semantics needed for zero-shot classification. Essentially, the HDC-ZSC model first learns to recognize attributes from images and subsequently uses the learned representations to interpolate to unseen classes.

Empirical Results and Comparisons

The model's prowess is quantified on the CUB-200 dataset, where HDC-ZSC reaches an impressive 63.8% top-1 classification accuracy with a minimal parametric footprint of only 26.6 million trainable parameters. In stark contrast to other non-generative methods, it outperforms them by up to 9.9% more accurately and requires significantly fewer parameters (by factors of 1.85× and 1.72×). Additionally, against computationally expensive generative methods, HDC-ZSC maintains competitive accuracy while greatly reducing the parameter count, thereby underscoring its efficiency.

Concluding Remarks

The efficacy of HDC-ZSC makes a compelling case for its application in environments where computational resources are constrained, such as embedded systems. The paper suggests that future iterations of the model could explore hardware-accelerated versions, enabling on-the-edge training and inference, which would be a significant leap for AI deployment in resource-limited scenarios. Acknowledgments were given to Junyuan Cui for his preliminary technical studies, which paved the way for the HDC-ZSC's development.

PDF Markdown

Tweets

https://twitter.com/pulp_platform/status/1752585337354784926