Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Morphological Segmentation Inside-Out (1911.04916v2)

Published 12 Nov 2019 in cs.CL

Abstract: Morphological segmentation has traditionally been modeled with non-hierarchical models, which yield flat segmentations as output. In many cases, however, proper morphological analysis requires hierarchical structure -- especially in the case of derivational morphology. In this work, we introduce a discriminative, joint model of morphological segmentation along with the orthographic changes that occur during word formation. To the best of our knowledge, this is the first attempt to approach discriminative segmentation with a context-free model. Additionally, we release an annotated treebank of 7454 English words with constituency parses, encouraging future research in this area.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Ryan Cotterell (226 papers)
  2. Arun Kumar (78 papers)
  3. Hinrich Schütze (250 papers)
Citations (16)

Summary

We haven't generated a summary for this paper yet.