Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Graph Pre-training for AMR Parsing and Generation (2203.07836v4)

Published 15 Mar 2022 in cs.CL

Abstract: Abstract meaning representation (AMR) highlights the core semantic information of text in a graph structure. Recently, pre-trained LLMs (PLMs) have advanced tasks of AMR parsing and AMR-to-text generation, respectively. However, PLMs are typically pre-trained on textual data, thus are sub-optimal for modeling structural knowledge. To this end, we investigate graph self-supervised training to improve the structure awareness of PLMs over AMR graphs. In particular, we introduce two graph auto-encoding strategies for graph-to-graph pre-training and four tasks to integrate text and graph information during pre-training. We further design a unified framework to bridge the gap between pre-training and fine-tuning tasks. Experiments on both AMR parsing and AMR-to-text generation show the superiority of our model. To our knowledge, we are the first to consider pre-training on semantic graphs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Xuefeng Bai (34 papers)
  2. Yulong Chen (32 papers)
  3. Yue Zhang (618 papers)
Citations (87)