Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhancements to the BOUN Treebank Reflecting the Agglutinative Nature of Turkish (2207.11782v1)

Published 24 Jul 2022 in cs.CL

Abstract: In this study, we aim to offer linguistically motivated solutions to resolve the issues of the lack of representation of null morphemes, highly productive derivational processes, and syncretic morphemes of Turkish in the BOUN Treebank without diverging from the Universal Dependencies framework. In order to tackle these issues, new annotation conventions were introduced by splitting certain lemmas and employing the MISC (miscellaneous) tab in the UD framework to denote derivation. Representational capabilities of the re-annotated treebank were tested on a LSTM-based dependency parser and an updated version of the BoAT Tool is introduced.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Büşra Marşan (2 papers)
  2. Salih Furkan Akkurt (2 papers)
  3. Muhammet Şen (1 paper)
  4. Merve Gürbüz (1 paper)
  5. Onur Güngör (15 papers)
  6. Şaziye Betül Özateş (5 papers)
  7. Suzan Üsküdarlı (4 papers)
  8. Arzucan Özgür (24 papers)
  9. Tunga Güngör (15 papers)
  10. Balkız Öztürk (2 papers)
Citations (6)