Fast and Accurate Neural CRF Constituency Parsing

Published 9 Aug 2020 in cs.CL | (2008.03736v1)

Abstract: Estimating probability distribution is one of the core issues in the NLP field. However, in both deep learning (DL) and pre-DL eras, unlike the vast applications of linear-chain CRF in sequence labeling tasks, very few works have applied tree-structure CRF to constituency parsing, mainly due to the complexity and inefficiency of the inside-outside algorithm. This work presents a fast and accurate neural CRF constituency parser. The key idea is to batchify the inside algorithm for loss computation by direct large tensor operations on GPU, and meanwhile avoid the outside algorithm for gradient computation via efficient back-propagation. We also propose a simple two-stage bracketing-then-labeling parsing approach to improve efficiency further. To improve the parsing performance, inspired by recent progress in dependency parsing, we introduce a new scoring architecture based on boundary representation and biaffine attention, and a beneficial dropout strategy. Experiments on PTB, CTB5.1, and CTB7 show that our two-stage CRF parser achieves new state-of-the-art performance on both settings of w/o and w/ BERT, and can parse over 1,000 sentences per second. We release our code at https://github.com/yzhangcs/crfpar.

Abstract PDF Upgrade to Chat

Citations (84)

View on Semantic Scholar

Summary

The paper introduces a novel neural CRF constituency parser that accelerates parsing with GPU-batched inside algorithm computations.
It employs a two-stage process using span boundary representation paired with biaffine attention to optimize syntactic analysis.
Experimental results on Penn and Chinese Treebanks show the model processes over 1,000 sentences per second while outperforming previous state-of-the-art methods.

An Analysis of Neural CRF Constituency Parsing Methods

The paper, "Fast and Accurate Neural CRF Constituency Parsing," presents a significant contribution to the domain of NLP, specifically in the context of syntactic parsing. Constituency parsing, a cornerstone in NLP, seeks to form a hierarchical tree structure that describes the syntactic structure of a given sentence. Historically, this task has been dominated by probabilistic context-free grammar (PCFG) models and their various extensions. This research introduces a novel neural Conditional Random Fields (CRF) based constituency parser, which significantly enhances parsing efficiency while maintaining or exceeding state-of-the-art accuracy.

Methodology Overview

The paper proposes a series of optimizations and architectural choices to address previous inefficiencies observed in CRF-based constituency parsers. A key innovation is the leveraging of GPU-accelerated computations through batching, specifically for the inside algorithm, which is integral to these parsers but notoriously resource-intensive. This results in substantial speed improvements without sacrificing parsing accuracy. Furthermore, the approach circumvents the complexity of the outside algorithm for gradient computation, opting instead for efficient back-propagation methods.

The parsing model is divided into a two-stage process: bracketing followed by labeling. This contrasts traditional one-stage parsing approaches and is shown to improve efficiency and maintain accuracy. The scoring architecture is redesigned to use span boundary representation along with biaffine attention, which outperforms previous span representation methods that relied heavily on difference vector calculations.

Experimental Findings

Extensive experiments conducted on datasets including the Penn Treebank (PTB), Chinese Treebank (CTB5.1, and CTB7) demonstrate the effectiveness of these innovations. Key results highlight the parser's capability to process over 1,000 sentences per second, a benchmark significantly surpassing previous models. More critically, the parser achieves superior performance metrics across all benchmark datasets, outperforming existing state-of-the-art models in both English and Chinese settings.

Implications and Future Perspectives

The implications of this work are twofold. Practically, the improvements in parsing speed and accuracy mean that the proposed model is an attractive option for real-time applications that require syntactic analysis as a foundational step. Theoretically, the successful integration of CRF principles with modern neural network architectures opens new avenues for research, particularly in other structured prediction tasks within NLP.

Future work could explore further refinements to the biaffine attention mechanisms employed and extend these methodologies to other languages or parsing tasks. Additionally, leveraging pre-trained contextual embeddings like BERT has shown promise, indicating that future research could benefit from integrating this parser with other emerging LLMs to further enhance performance.

In conclusion, the paper makes substantial strides in neural CRF constituency parsing by addressing efficiency bottlenecks and improving accuracy. The proposed techniques not only advance the field of syntactic parsing but also provide a robust framework for future research and development in NLP applications.

Markdown