Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Neural-Hidden-CRF: A Robust Weakly-Supervised Sequence Labeler (2309.05086v2)

Published 10 Sep 2023 in cs.CL and cs.AI

Abstract: We propose a neuralized undirected graphical model called Neural-Hidden-CRF to solve the weakly-supervised sequence labeling problem. Under the umbrella of probabilistic undirected graph theory, the proposed Neural-Hidden-CRF embedded with a hidden CRF layer models the variables of word sequence, latent ground truth sequence, and weak label sequence with the global perspective that undirected graphical models particularly enjoy. In Neural-Hidden-CRF, we can capitalize on the powerful LLM BERT or other deep models to provide rich contextual semantic knowledge to the latent ground truth sequence, and use the hidden CRF layer to capture the internal label dependencies. Neural-Hidden-CRF is conceptually simple and empirically powerful. It obtains new state-of-the-art results on one crowdsourcing benchmark and three weak-supervision benchmarks, including outperforming the recent advanced model CHMM by 2.80 F1 points and 2.23 F1 points in average generalization and inference performance, respectively.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Zhijun Chen (17 papers)
  2. Hailong Sun (23 papers)
  3. Wanhao Zhang (1 paper)
  4. Chunyi Xu (1 paper)
  5. Qianren Mao (13 papers)
  6. Pengpeng Chen (4 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.