Papers

Topics

Authors

Recent

View all

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 78 tok/s

Gemini 2.5 Pro 46 tok/s Pro

GPT-5 Medium 12 tok/s Pro

GPT-5 High 14 tok/s Pro

GPT-4o 89 tok/s Pro

Kimi K2 212 tok/s Pro

GPT OSS 120B 472 tok/s Pro

Claude Sonnet 4 39 tok/s Pro

2000 character limit reached

Less is More: A Lightweight and Robust Neural Architecture for Discourse Parsing (2210.09537v2)

Published 18 Oct 2022 in cs.CL

Abstract: Complex feature extractors are widely employed for text representation building. However, these complex feature extractors make the NLP systems prone to overfitting especially when the downstream training datasets are relatively small, which is the case for several discourse parsing tasks. Thus, we propose an alternative lightweight neural architecture that removes multiple complex feature extractors and only utilizes learnable self-attention modules to indirectly exploit pretrained neural LLMs, in order to maximally preserve the generalizability of pre-trained LLMs. Experiments on three common discourse parsing tasks show that powered by recent pretrained LLMs, the lightweight architecture consisting of only two self-attention layers obtains much better generalizability and robustness. Meanwhile, it achieves comparable or even better system performance with fewer learnable parameters and less processing time.

Citations (2)

View on Semantic Scholar