Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Incremental Adaptation Strategies for Neural Network Language Models (1412.6650v4)

Published 20 Dec 2014 in cs.NE, cs.CL, and cs.LG

Abstract: It is today acknowledged that neural network LLMs outperform backoff LLMs in applications like speech recognition or statistical machine translation. However, training these models on large amounts of data can take several days. We present efficient techniques to adapt a neural network LLM to new data. Instead of training a completely new model or relying on mixture approaches, we propose two new methods: continued training on resampled data or insertion of adaptation layers. We present experimental results in an CAT environment where the post-edits of professional translators are used to improve an SMT system. Both methods are very fast and achieve significant improvements without overfitting the small adaptation data.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Aram Ter-Sarkisov (10 papers)
  2. Holger Schwenk (35 papers)
  3. Fethi Bougares (18 papers)
  4. Loic Barrault (4 papers)
Citations (15)

Summary

We haven't generated a summary for this paper yet.