Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Meta-learning Pathologies from Radiology Reports using Variance Aware Prototypical Networks (2210.13979v2)

Published 22 Oct 2022 in cs.LG and cs.CL

Abstract: Large pretrained Transformer-based LLMs like BERT and GPT have changed the landscape of NLP. However, fine tuning such models still requires a large number of training examples for each target task, thus annotating multiple datasets and training these models on various downstream tasks becomes time consuming and expensive. In this work, we propose a simple extension of the Prototypical Networks for few-shot text classification. Our main idea is to replace the class prototypes by Gaussians and introduce a regularization term that encourages the examples to be clustered near the appropriate class centroids. Experimental results show that our method outperforms various strong baselines on 13 public and 4 internal datasets. Furthermore, we use the class distributions as a tool for detecting potential out-of-distribution (OOD) data points during deployment.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Arijit Sehanobish (20 papers)
  2. Kawshik Kannan (1 paper)
  3. Nabila Abraham (5 papers)
  4. Anasuya Das (3 papers)
  5. Benjamin Odry (5 papers)