Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Sequence to sequence pretraining for a less-resourced Slovenian language (2207.13988v2)

Published 28 Jul 2022 in cs.CL

Abstract: Large pretrained LLMs have recently conquered the area of natural language processing. As an alternative to predominant masked LLMling introduced in BERT, the T5 model has introduced a more general training objective, namely sequence to sequence transformation, which includes masked LLM but more naturally fits text generation tasks such as machine translation, summarization, question answering, text simplification, dialogue systems, etc. The monolingual variants of T5 models have been limited to well-resourced languages, while the massively multilingual T5 model supports 101 languages. In contrast, we trained two different sized T5-type sequence to sequence models for morphologically rich Slovene language with much less resources and analyzed their behavior on 11 tasks. Concerning classification tasks, the SloT5 models mostly lag behind the monolingual Slovene SloBERTa model but are useful for the generative tasks.

Citations (13)

Summary

We haven't generated a summary for this paper yet.