Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Sequence Modeling via Segmentations (1702.07463v7)

Published 24 Feb 2017 in stat.ML and cs.LG

Abstract: Segmental structure is a common pattern in many types of sequences such as phrases in human languages. In this paper, we present a probabilistic model for sequences via their segmentations. The probability of a segmented sequence is calculated as the product of the probabilities of all its segments, where each segment is modeled using existing tools such as recurrent neural networks. Since the segmentation of a sequence is usually unknown in advance, we sum over all valid segmentations to obtain the final probability for the sequence. An efficient dynamic programming algorithm is developed for forward and backward computations without resorting to any approximation. We demonstrate our approach on text segmentation and speech recognition tasks. In addition to quantitative results, we also show that our approach can discover meaningful segments in their respective application contexts.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Chong Wang (308 papers)
  2. Yining Wang (91 papers)
  3. Po-Sen Huang (30 papers)
  4. Abdelrahman Mohamed (59 papers)
  5. Dengyong Zhou (20 papers)
  6. Li Deng (76 papers)
Citations (44)

Summary

We haven't generated a summary for this paper yet.