SphereReID: Deep Hypersphere Manifold Embedding for Person Re-Identification (1807.00537v1)

Published 2 Jul 2018 in cs.CV

Abstract: Many current successful Person Re-Identification(ReID) methods train a model with the softmax loss function to classify images of different persons and obtain the feature vectors at the same time. However, the underlying feature embedding space is ignored. In this paper, we use a modified softmax function, termed Sphere Softmax, to solve the classification problem and learn a hypersphere manifold embedding simultaneously. A balanced sampling strategy is also introduced. Finally, we propose a convolutional neural network called SphereReID adopting Sphere Softmax and training a single model end-to-end with a new warming-up learning rate schedule on four challenging datasets including Market-1501, DukeMTMC-reID, CHHK-03, and CUHK-SYSU. Experimental results demonstrate that this single model outperforms the state-of-the-art methods on all four datasets without fine-tuning or re-ranking. For example, it achieves 94.4% rank-1 accuracy on Market-1501 and 83.9% rank-1 accuracy on DukeMTMC-reID. The code and trained weights of our model will be released.

Citations (193)

View on Semantic Scholar

Summary

Analyzing SphereReID: Deep Hypersphere Manifold Embedding for Person Re-Identification

The research paper, "SphereReID: Deep Hypersphere Manifold Embedding for Person Re-Identification," proposes an advanced methodology aimed at enhancing the performance of person re-identification (ReID) tasks through innovative neural network architectures and loss functions. The central contribution lies in the development of the SphereReID network, which incorporates a novel loss function, termed Sphere Loss, designed to simultaneously solve classification problems and learn a hypersphere manifold embedding.

Methodological Innovations

SphereReID distinguishes itself by addressing the limitation of current ReID approaches that often do not consider the intrinsic structure of the feature embedding space. The research introduces Sphere Softmax, a modification of the conventional softmax function, which enhances feature discrimination by ensuring feature embeddings lie on a hypersphere manifold. This geometrical constraint is achieved through the normalization of feature and weight vectors, which ensures that classification is based solely on angular separation, thereby improving inter-class separability and intra-class compactness.

Key innovations include:

Sphere Loss: This loss function incorporates both feature and weight normalization with a scalar temperature parameter. It promotes an angular margin constraint, which enhances the angular discriminative power of learned embeddings.
Balanced Sampling Strategy: To mitigate the effects of class imbalance typically encountered in ReID datasets, a balanced sampling strategy is employed. This approach controls the number of samples per identity in each mini-batch, which helps in stabilizing the influence of each identity during training.
Warming-up Learning Rate Schedule: By initializing the network with a gradually increasing learning rate, the proposed strategy assists in achieving better convergence and initialization, leveraging a robust training framework without additional computational costs.

Empirical Performance

The SphereReID model is evaluated on four benchmark ReID datasets: Market-1501, DukeMTMC-reID, CUHK03, and CUHK-SYSU. The experimental results are noteworthy, with SphereReID surpassing existing state-of-the-art methods in terms of rank-1 accuracy across all datasets. Specifically, SphereReID achieves 94.4% rank-1 accuracy on the Market-1501 dataset and 83.9% on DukeMTMC-reID, without the requirement for additional fine-tuning or re-ranking processes.

Theoretical and Practical Implications

From a theoretical perspective, the mapping of feature vectors onto a hypersphere manifold introduces a new avenue for ensuring uniform distribution and discriminative separation of embeddings. This approach leverages principles from face recognition enhanced with angular constraints, thereby enriching the method's applicability and reliability in tasks requiring robust identification.

Practically, with its superior performance in standard ReID benchmarks, SphereReID provides a robust solution for real-world surveillance applications. The algorithms and strategies, including balanced sampling and warming-up, are generic enough to extend beyond ReID, potentially benefiting other identification tasks in multimedia retrieval and beyond.

Future Directions

Looking forward, the research opens avenues for further exploration into hypersphere manifold embeddings. Potential areas of research could include experimenting with additive margin variations to further improve class separation and exploring the applicability of Sphere Loss in other domains that require high dimensional classification. Additionally, integrating complementary modalities or attributes with SphereReID could further enhance its adaptability to complex scenarios.

In summary, the SphereReID architecture, with its novel loss function and sampling strategy, provides a compelling advance in the domain of person ReID, demonstrating impressive improvements in performance and setting a foundation for future exploration in deep metric learning.