Few-Shot Learning with Graph Neural Networks (1711.04043v3)

Published 10 Nov 2017 in stat.ML and cs.LG

Abstract: We propose to study the problem of few-shot learning with the prism of inference on a partially observed graphical model, constructed from a collection of input images whose label can be either observed or not. By assimilating generic message-passing inference algorithms with their neural-network counterparts, we define a graph neural network architecture that generalizes several of the recently proposed few-shot learning models. Besides providing improved numerical performance, our framework is easily extended to variants of few-shot learning, such as semi-supervised or active learning, demonstrating the ability of graph-based models to operate well on 'relational' tasks.

Citations (1,192)

View on Semantic Scholar

Summary

The paper introduces a graph-based framework that recasts few-shot learning as a message-passing inference problem, achieving high efficiency and competitive performance.
It demonstrates improved performance on benchmarks like Omniglot and Mini-Imagenet with fewer parameters compared to existing methods.
The framework’s flexibility extends to semi-supervised and active learning, highlighting its utility for diverse relational reasoning tasks.

Few-Shot Learning with Graph Neural Networks

The paper "Few-Shot Learning with Graph Neural Networks" by Victor Garcia and Joan Bruna addresses the challenge of few-shot learning through an innovative application of Graph Neural Networks (GNNs). The authors propose a framework where the few-shot learning task is formulated as an inference problem on a partially observed graphical model. This approach leverages message-passing algorithms to propagate label information, thereby generalizing several existing few-shot learning models.

In this work, the authors present a graph-based few-shot learning model that improves performance while maintaining simplicity and efficiency. The proposed GNN is trained end-to-end and can be easily extended to semi-supervised and active learning setups, highlighting the flexibility and effectiveness of graph-based methods in relational tasks.

Contributions and Methodology

The main contributions of this paper are as follows:

Casting Few-Shot Learning as Graph Inference: The authors frame few-shot learning as a supervised message-passing task within a GNN architecture. This approach allows the model to propagate label information from labeled to unlabeled nodes effectively.
Numerical Performance: The introduced GNN model achieves state-of-the-art performance on benchmarks such as Omniglot and Mini-Imagenet, often with fewer parameters than competing models.
Applications Beyond Few-Shot Learning: The framework is naturally extended to semi-supervised and active learning tasks with minimal changes to the training procedure.

Numerical Results and Experimental Setup

Omniglot

On the Omniglot dataset, the GNN achieves impressive results in various few-shot learning setups, including 5-way and 20-way, 1-shot and 5-shot tasks. The model matches or exceeds the performance of alternative methods like Matching Networks, Prototypical Networks, and Temporal Convolutional Meta Learners (TCML). For instance, in the 20-way 1-shot task, the GNN slightly outperforms TCML with fewer parameters, demonstrating the model's efficiency and effectiveness.

Mini-Imagenet

Similarly, the results on Mini-Imagenet show significant improvements. When comparing our metric learning paired with K-nearest neighbors (KNN) against the full GNN architecture, the latter shows an improvement of around 2.39% in the 5-shot 5-way setup, validating the benefits of propagating information through the graph structure. Although the mini-Imagenet dataset's complexity poses more significant challenges, the GNN still manages to achieve competitive results.

Semi-Supervised and Active Learning

The paper also explores semi-supervised learning by evaluating performance in scenarios where only a fraction of the data is labeled. The authors observe consistent performance improvements when incorporating unlabeled data into the model. Even with only 20% of samples labeled, the GNN matches the performance of models trained on 40% labeled data, underscoring the effectiveness of semi-supervised learning with GNNs.

In the active learning scenario, the model queries the most informative unlabeled samples to label. This learning strategy enhances the model's performance compared to random sampling, particularly noticeable in the more complex Mini-Imagenet dataset, where the active learning criterion yields a significant improvement over the random baseline.

Theoretical and Practical Implications

The theoretical implication of this work is the validation of GNNs as a robust tool for embedding and inference in few-shot learning tasks. The graph-based approach generalizes various existing methods, providing a unified framework that adapts across different learning paradigms. This versatility is crucial for developing models that can seamlessly transition between various types of learning tasks, including few-shot, semi-supervised, and active learning.

From a practical perspective, the development of efficient GNNs opens avenues for applications in diverse domains requiring robust relational reasoning with limited data. The proposed methods can be instrumental in fields such as image recognition, bioinformatics, and natural language processing, where relational structures are inherent and leveraging them can dramatically improve learning outcomes.

Future Directions

Future research could aim to scale GNNs to handle larger graphs, incorporating hierarchical and coarsening approaches to manage millions of nodes efficiently. Moreover, extending active learning to enable more nuanced query strategies, such as question-asking or reinforcement learning in non-stationary environments, holds promise for enhancing the robustness and versatility of GNNs further.

Conclusion

The paper "Few-Shot Learning with Graph Neural Networks" demonstrates the efficacy of GNNs in addressing few-shot, semi-supervised, and active learning challenges. The presented model not only achieves competitive numerical performance but also provides a flexible and efficient framework for leveraging relational information inherent in many tasks. This work lays the groundwork for future advancements in graph-based learning methods and their applications in various complex learning scenarios.

PDF Markdown

Related Papers

GitHub

GitHub - vgsatorras/few-shot-gnn (576 stars)