Person Re-identification with Metric Learning using Privileged Information (1904.05005v1)

Published 10 Apr 2019 in cs.CV

Abstract: Despite the promising progress made in recent years, person re-identification remains a challenging task due to complex variations in human appearances from different camera views. This paper presents a logistic discriminant metric learning method for this challenging problem. Different with most existing metric learning algorithms, it exploits both original data and auxiliary data during training, which is motivated by the new machine learning paradigm - Learning Using Privileged Information. Such privileged information is a kind of auxiliary knowledge which is only available during training. Our goal is to learn an optimal distance function by constructing a locally adaptive decision rule with the help of privileged information. We jointly learn two distance metrics by minimizing the empirical loss penalizing the difference between the distance in the original space and that in the privileged space. In our setting, the distance in the privileged space functions as a local decision threshold, which guides the decision making in the original space like a teacher. The metric learned from the original space is used to compute the distance between a probe image and a gallery image during testing. In addition, we extend the proposed approach to a multi-view setting which is able to explore the complementation of multiple feature representations. In the multi-view setting, multiple metrics corresponding to different original features are jointly learned, guided by the same privileged information. Besides, an effective iterative optimization scheme is introduced to simultaneously optimize the metrics and the assigned metric weights. Experiment results on several widely-used datasets demonstrate that the proposed approach is superior to global decision threshold based methods and outperforms most state-of-the-art results.

Citations (165)

View on Semantic Scholar

Summary

The paper introduces a novel logistic discriminant metric learning approach that leverages privileged information to enhance person re-identification.
It employs a locally adaptive decision rule by learning dual metrics in both original and privileged feature spaces to improve CMC and mAP.
Experimental results on benchmarks like Market-1501 validate the method's effectiveness and potential for integration with deep learning.

An Analytical Overview of "Person Re-identification with Metric Learning using Privileged Information"

In the domain of computer vision, person re-identification (re-ID) has emerged as an essential task within video analytics, particularly for security and surveillance. The paper "Person Re-identification with Metric Learning using Privileged Information" presents a novel logistic discriminant metric learning approach designed to enhance person re-ID by utilizing privileged information—auxiliary knowledge available only during training. This research contributes to the ongoing enhancement of metric learning strategies by employing a locally adaptive decision rule, offering a new perspective within the paradigm of Learning Using Privileged Information (LUPI).

Methodology and Approach

The presented method steps away from conventional metric learning techniques that rely on solely original data with global decision thresholds, which may lack adaptability when faced with complex intra-class and inter-class variations. Instead, the paper introduces the incorporation of privileged information to refine the decision-making process during training. This privileged information acts like an auxiliary teacher, guiding the learning process to achieve more reliable distance metrics. Specifically, the method involves learning two simultaneous metrics: one in the original feature space and another using privileged information, thereby establishing a locally adaptive decision threshold.

The core of this approach is to minimize the empirical loss, which penalizes discrepancies between distances calculated within the original space and those in the privileged space. The method uses a logistic discriminant framework, which is further extended to a multi-view setting that combines multiple feature representations for a more comprehensive analysis.

Results and Implications

The experimental validation, conducted on well-established datasets like VIPeR, CUHK01, PRID450S, and Market-1501, confirmed the superiority of the proposed LDML+ and MVLDML+ methods over traditional methods that employ global decision thresholds. Significant improvements were demonstrated, particularly on larger datasets such as Market-1501, which includes complex inter-class and intra-class variations. The paper reports substantial advancements in terms of Cumulative Matching Characteristics (CMC) and Mean Average Precision (mAP) metrics across these datasets.

These experiments validate the hypothesis that utilizing privileged information can indeed act as an adaptive guide in the learning process, resulting in more robust and discriminative metrics. Multi-view learning, in particular, was shown to be advantageous when combining multiple feature representations.

Future Directions

The inclusion of privileged information in metric learning strategies opens new avenues for exploring how auxiliary data can be utilized more effectively across various domains. Future research could focus on optimizing the balance between original and privileged spaces to enhance model performance further, possibly through advanced feature fusion techniques or adaptive weighting schemes. Furthermore, extending this approach to deep learning frameworks could harness the scalability and feature extraction capabilities of neural networks while retaining the adaptive benefits of using privileged information.

Conclusion

This paper's approach presents a substantial contribution to the field of metric learning for person re-ID by leveraging the LUPI paradigm. The proposed methods bring forth a nuanced understanding of how auxiliary knowledge can substantially impact the effectiveness of learned metrics, providing a feasible pathway towards more adaptable and precise person re-identification systems. The work stands as a basis for fostering future research that seeks to integrate diverse sources of information within learning frameworks to tackle complex visual recognition tasks.

PDF Markdown