2000 character limit reached
VnCoreNLP: A Vietnamese Natural Language Processing Toolkit (1801.01331v2)
Published 4 Jan 2018 in cs.CL
Abstract: We present an easy-to-use and fast toolkit, namely VnCoreNLP---a Java NLP annotation pipeline for Vietnamese. Our VnCoreNLP supports key NLP tasks including word segmentation, part-of-speech (POS) tagging, named entity recognition (NER) and dependency parsing, and obtains state-of-the-art (SOTA) results for these tasks. We release VnCoreNLP to provide rich linguistic annotations to facilitate research work on Vietnamese NLP. Our VnCoreNLP is open-source and available at: https://github.com/vncorenlp/VnCoreNLP
- Thanh Vu (59 papers)
- Dat Quoc Nguyen (55 papers)
- Dai Quoc Nguyen (26 papers)
- Mark Dras (38 papers)
- Mark Johnson (46 papers)