Joint Geometrical and Statistical Alignment for Visual Domain Adaptation (1705.05498v1)

Published 16 May 2017 in cs.CV

Abstract: This paper presents a novel unsupervised domain adaptation method for cross-domain visual recognition. We propose a unified framework that reduces the shift between domains both statistically and geometrically, referred to as Joint Geometrical and Statistical Alignment (JGSA). Specifically, we learn two coupled projections that project the source domain and target domain data into low dimensional subspaces where the geometrical shift and distribution shift are reduced simultaneously. The objective function can be solved efficiently in a closed form. Extensive experiments have verified that the proposed method significantly outperforms several state-of-the-art domain adaptation methods on a synthetic dataset and three different real world cross-domain visual recognition tasks.

Authors (3)

Jing Zhang (731 papers)
Wanqing Li (53 papers)
Philip Ogunbona (19 papers)

Citations (478)

View on Semantic Scholar

Summary

Joint Geometrical and Statistical Alignment for Visual Domain Adaptation

The paper entitled "Joint Geometrical and Statistical Alignment for Visual Domain Adaptation" introduces a novel approach to unsupervised domain adaptation, addressing the discrepancies between different visual domains. This work proposes a framework named Joint Geometrical and Statistical Alignment (JGSA), which simultaneously reduces both geometrical and statistical shifts between source and target domains, thereby enhancing cross-domain visual recognition.

Methodology Overview

The JGSA method stands out by learning two coupled projections that map the source and target domain data into low-dimensional subspaces. This mapping aims to minimize domain shifts while preserving the intrinsic data structures of both domains. Unlike data-centric methods that assume a unified transformation to align domain distributions, JGSA relaxes this constraint by focusing separately on the properties of each domain.

Key Components

Target Variance Maximization: By maximizing the variance of the target domain in the new subspace, JGSA preserves the target domain properties, thereby enhancing feature relevance.
Source Discriminative Information: The framework maintains the discriminative information in the source domain by preserving both between-class and within-class variance.
Minimization of Distribution Divergence: Employing Maximum Mean Discrepancy (MMD), the method reduces both marginal and conditional distribution shifts between domains.
Subspace Divergence Minimization: By constraining the divergence between the aligned source and target subspaces, JGSA ensures the domains are closely aligned, both geometrically and statistically.

The above components are integrated into an optimization problem solved efficiently in closed form. Moreover, JGSA can be extended to nonlinear scenarios using kernel methods, thereby broadening its applicability.

Experimental Evaluation

JGSA's efficacy is demonstrated across synthetic datasets and three real-world cross-domain tasks:

Object Recognition: Evaluations on Office+Caltech datasets using both SURF descriptors and Decaf features reveal that JGSA consistently outperforms several state-of-the-art methods, including TCA, JDA, and SA.
Digit Recognition: On USPS to MNIST and vice versa, the proposed method shows superior accuracy, leveraging its ability to handle significant distributional shifts between datasets.
Action Recognition: Cross-dataset evaluations on RGB-D-based action datasets further showcase JGSA's capability to manage diverse domain challenges effectively.

Implications and Future Directions

By integrating both geometrical and statistical properties within the domain adaptation process, JGSA offers a promising avenue for more robust cross-domain learning applications. Its flexibility in handling nonlinear shifts through kernelization further enhances its practical relevance.

From a theoretical perspective, JGSA advances understanding of how domain-specific and shared features can be leveraged for optimal alignment. Future work could explore its integration with more complex neural architectures or its scalability to larger datasets.

The findings and methodologies presented imply significant potential for applications in contexts where labeled data in new domains are sparse. The continuous evolution of domain adaptation techniques such as JGSA will undoubtedly contribute to more effective and efficient machine learning solutions across varying domains.

PDF Markdown