Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Self Paced Adversarial Training for Multimodal Few-shot Learning (1811.09192v1)

Published 22 Nov 2018 in cs.CV, cs.LG, and cs.MM

Abstract: State-of-the-art deep learning algorithms yield remarkable results in many visual recognition tasks. However, they still fail to provide satisfactory results in scarce data regimes. To a certain extent this lack of data can be compensated by multimodal information. Missing information in one modality of a single data point (e.g. an image) can be made up for in another modality (e.g. a textual description). Therefore, we design a few-shot learning task that is multimodal during training (i.e. image and text) and single-modal during test time (i.e. image). In this regard, we propose a self-paced class-discriminative generative adversarial network incorporating multimodality in the context of few-shot learning. The proposed approach builds upon the idea of cross-modal data generation in order to alleviate the data sparsity problem. We improve few-shot learning accuracies on the finegrained CUB and Oxford-102 datasets.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Frederik Pahde (13 papers)
  2. Oleksiy Ostapenko (10 papers)
  3. Patrick Jähnichen (4 papers)
  4. Tassilo Klein (27 papers)
  5. Moin Nabi (44 papers)
Citations (20)

Summary

We haven't generated a summary for this paper yet.