Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

NeCo@ALQAC 2023: Legal Domain Knowledge Acquisition for Low-Resource Languages through Data Enrichment (2309.05500v1)

Published 11 Sep 2023 in cs.CL and cs.AI

Abstract: In recent years, natural language processing has gained significant popularity in various sectors, including the legal domain. This paper presents NeCo Team's solutions to the Vietnamese text processing tasks provided in the Automated Legal Question Answering Competition 2023 (ALQAC 2023), focusing on legal domain knowledge acquisition for low-resource languages through data enrichment. Our methods for the legal document retrieval task employ a combination of similarity ranking and deep learning models, while for the second task, which requires extracting an answer from a relevant legal article in response to a question, we propose a range of adaptive techniques to handle different question types. Our approaches achieve outstanding results on both tasks of the competition, demonstrating the potential benefits and effectiveness of question answering systems in the legal field, particularly for low-resource languages.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Hai-Long Nguyen (6 papers)
  2. Dieu-Quynh Nguyen (1 paper)
  3. Hoang-Trung Nguyen (4 papers)
  4. Thu-Trang Pham (1 paper)
  5. Huu-Dong Nguyen (1 paper)
  6. Thach-Anh Nguyen (1 paper)
  7. Thi-Hai-Yen Vuong (13 papers)
  8. Ha-Thanh Nguyen (33 papers)
Citations (2)