Graphonomy: Universal Human Parsing via Graph Transfer Learning (1904.04536v1)

Published 9 Apr 2019 in cs.CV

Abstract: Prior highly-tuned human parsing models tend to fit towards each dataset in a specific domain or with discrepant label granularity, and can hardly be adapted to other human parsing tasks without extensive re-training. In this paper, we aim to learn a single universal human parsing model that can tackle all kinds of human parsing needs by unifying label annotations from different domains or at various levels of granularity. This poses many fundamental learning challenges, e.g. discovering underlying semantic structures among different label granularity, performing proper transfer learning across different image domains, and identifying and utilizing label redundancies across related tasks. To address these challenges, we propose a new universal human parsing agent, named "Graphonomy", which incorporates hierarchical graph transfer learning upon the conventional parsing network to encode the underlying label semantic structures and propagate relevant semantic information. In particular, Graphonomy first learns and propagates compact high-level graph representation among the labels within one dataset via Intra-Graph Reasoning, and then transfers semantic information across multiple datasets via Inter-Graph Transfer. Various graph transfer dependencies (\eg, similarity, linguistic knowledge) between different datasets are analyzed and encoded to enhance graph transfer capability. By distilling universal semantic graph representation to each specific task, Graphonomy is able to predict all levels of parsing labels in one system without piling up the complexity. Experimental results show Graphonomy effectively achieves the state-of-the-art results on three human parsing benchmarks as well as advantageous universal human parsing performance.

Citations (169)

View on Semantic Scholar

Summary

The paper introduces a novel graph transfer learning framework that unifies varied label granularity in human parsing.
It employs intra-graph reasoning and inter-graph transfer modules to bridge semantic disparities across multiple datasets.
Quantitative evaluations on benchmarks reveal improved Mean IoU and robust performance compared to traditional specialized models.

Insights into Graphonomy: Universal Human Parsing via Graph Transfer Learning

The paper, "Graphonomy: Universal Human Parsing via Graph Transfer Learning," addresses a significant challenge in the field of human parsing: the development of a universal model that can handle varying label granularities across different datasets without extensive re-training. Traditional models in human parsing tend to be highly specialized, often fitting precisely to specific datasets and domain tasks, which limits their adaptability and necessitates redundant efforts when transitioning to new tasks. The authors propose a novel framework named Graphonomy, which leverages graph transfer learning to unify and process diverse label annotations from multiple datasets within a single model.

The central innovation in this work is the incorporation of hierarchical graph transfer learning to develop a universal parsing model. Specifically, the framework utilizes two main modules: Intra-Graph Reasoning and Inter-Graph Transfer. The Intra-Graph Reasoning module creates a semantic graph representation within a dataset by propagating semantic information through a graph structure that encapsulates semantic relationships, such as body part structure relations. Inter-Graph Transfer further enhances this model by learning and transferring semantic correlations between nodes in graphs from different datasets, thereby effectively bridging the semantic discrepancy among datasets.

The implementation benefits significantly from geometric deep learning techniques, such as graph convolutional networks, which enable the handling of structured data more effectively. Notably, the paper highlights several strategies for defining the graph transfer dependencies, including handcraft relations, learnable matrix weights, and linguistic knowledge embedded via word embeddings. The results indicate that leveraging feature-level and semantic similarity metrics—particularly using word embeddings—facilitates better knowledge transfer across datasets.

Quantitative evaluation of Graphonomy on three prominent human parsing benchmarks—PASCAL-Person-Part, ATR, and CIHP datasets—demonstrates the model's efficacy. The proposed method achieves state-of-the-art results on these benchmarks. Specifically, Graphonomy achieves a Mean IoU improvement over previous models, illustrating its robustness and ability to generalize across varied datasets with diversified label sets. Furthermore, the universal application of Graphonomy—when trained jointly across datasets—outperformed traditional multi-task learning approaches, underlining its capability to handle multiple parsing tasks simultaneously with consistency and reliability.

The theoretical implications of this work are substantial. By effectively addressing label discrepancy through graph-based semantic transfer, the paper underscores the potential of integrating structured knowledge into machine learning models, which may generalize beyond human parsing into other domains of computer vision that encounter similar challenges with diverse annotations and tasks. On a practical level, the efficiency gains from reducing data re-labeling efforts and redundant model specialization are noteworthy.

Future research avenues could explore integrating more complex semantic relationships and leveraging additional external knowledge bases to further enhance model robustness. Additionally, investigating the extension of Graphonomy to other domains such as general semantic segmentation tasks could provide valuable insights into this promising paradigm of universal model development in deep learning.

Overall, the Graphonomy framework is a significant step towards creating a more flexible, unified approach to tackle human parsing challenges, suggesting broader applicability in various fields requiring structured data understanding and effective knowledge transfer.

PDF Markdown

Related Papers

GitHub

GitHub - Gaoyiminggithub/Graphonomy: Graphonomy: Universal Human Parsing via Graph Transfer Learning (306 stars)