Multi-task Mid-level Feature Alignment Network for Unsupervised Cross-Dataset Person Re-Identification (1807.01440v2)

Published 4 Jul 2018 in cs.CV

Abstract: Most existing person re-identification (Re-ID) approaches follow a supervised learning framework, in which a large number of labelled matching pairs are required for training. Such a setting severely limits their scalability in real-world applications where no labelled samples are available during the training phase. To overcome this limitation, we develop a novel unsupervised Multi-task Mid-level Feature Alignment (MMFA) network for the unsupervised cross-dataset person re-identification task. Under the assumption that the source and target datasets share the same set of mid-level semantic attributes, our proposed model can be jointly optimised under the person's identity classification and the attribute learning task with a cross-dataset mid-level feature alignment regularisation term. In this way, the learned feature representation can be better generalised from one dataset to another which further improve the person re-identification accuracy. Experimental results on four benchmark datasets demonstrate that our proposed method outperforms the state-of-the-art baselines.

PDF Abstract

Multi-task Mid-level Feature Alignment for Unsupervised Person Re-Identification

The paper introduces a novel approach for tackling the unsupervised cross-dataset problem in person re-identification (Re-ID). Most existing solutions rely heavily on supervised methods, requiring large volumes of labeled data, primarily focusing on capturing the identity features of individual subjects. Such dependency poses challenges for scalability in real-world applications where labeled data may be scarcely available across numerous cameras. This paper addresses these limitations by presenting the Multi-task Mid-level Feature Alignment (MMFA) network, designed for unsupervised learning and adaptation of person Re-ID across different datasets.

Methodology Overview

The MMFA network utilizes the assumption that while different datasets may contain distinct identities, they often share mid-level semantic attributes — such as gender, age-group, or apparel color — across different individuals. Leveraging these shared attributes, the authors employ a domain adaptation strategy that aligns mid-level feature representations between source and target datasets without requiring identity overlap. The methodology innovatively integrates multi-task learning, combining identity classification with attribute recognition, while aligning the mid-level feature distributions to facilitate better transfer between datasets.

Technical Highlights

Mid-level Feature Alignment: The paper proposes using Maximum Mean Discrepancy (MMD) as a measure to quantify and minimize the distribution differences between mid-level features extracted from both source and target datasets. By reducing this disparity, the MMFA network can generalize learned features effectively across datasets.
Simultaneous Training: Unlike traditional methods that separate feature learning and adaptation into discrete steps, the MMFA employs a unified training process, balancing supervised identity and attribute classification with unsupervised domain adaptation. This single-step procedure is computationally efficient, reducing training time compared to other deep learning-based Re-ID approaches.
Extensive Performance Evaluations: The experiments span four major person Re-ID datasets, namely Market1501, DukeMTMC-reID, VIPeR, and PRID, with the MMFA method consistently outperforming various state-of-the-art unsupervised methods in both Rank-1 accuracy and mAP metrics.

Implications and Future Directions

The MMFA network paves a promising path for unsupervised cross-dataset person Re-ID, emphasizing the value of mid-level features in overcoming the inherent difficulties of deploying Re-ID systems at scale. By capitalizing on shared attributes, the methodology lifts existing constraints related to identity mismatch across datasets, thus broadening the horizon for unsupervised learning models.

Future research could explore adaptive frameworks that incorporate more complex mid-level features or refine the alignment methodology to enhance precision further. It opens the potential for employing these concepts in other computer vision domains where cross-dataset application and scalability remain pivotal.

This paper enriches the discourse in machine learning and AI, addressing a practical challenge with a sophisticated yet adaptable solution in the field of personal security and surveillance technology.

PDF Markdown Bookmark Chat (Pro)

Authors (4)

Shan Lin (67 papers)
Haoliang Li (67 papers)
Chang-Tsun Li (22 papers)
Alex Chichung Kot (7 papers)

Citations (182)

View on Semantic Scholar

Multi-task Mid-level Feature Alignment Network for Unsupervised Cross-Dataset Person Re-Identification (1807.01440v2)

Multi-task Mid-level Feature Alignment for Unsupervised Person Re-Identification

Methodology Overview

Technical Highlights

Implications and Future Directions

Related Papers