Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

How to Pre-Train Your Model? Comparison of Different Pre-Training Models for Biomedical Question Answering (1911.00712v1)

Published 2 Nov 2019 in cs.CL and cs.LG

Abstract: Using deep learning models on small scale datasets would result in overfitting. To overcome this problem, the process of pre-training a model and fine-tuning it to the small scale dataset has been used extensively in domains such as image processing. Similarly for question answering, pre-training and fine-tuning can be done in several ways. Commonly reading comprehension models are used for pre-training, but we show that other types of pre-training can work better. We compare two pre-training models based on reading comprehension and open domain question answering models and determine the performance when fine-tuned and tested over BIOASQ question answering dataset. We find open domain question answering model to be a better fit for this task rather than reading comprehension model.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Sanjay Kamath (1 paper)
  2. Brigitte Grau (3 papers)
  3. Yue Ma (126 papers)
Citations (7)