Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 167 tok/s

Gemini 2.5 Pro 49 tok/s Pro

GPT-5 Medium 36 tok/s Pro

GPT-5 High 42 tok/s Pro

GPT-4o 97 tok/s Pro

Kimi K2 203 tok/s Pro

GPT OSS 120B 442 tok/s Pro

Claude Sonnet 4.5 32 tok/s Pro

2000 character limit reached

Unsupervised Topic Segmentation of Meetings with BERT Embeddings (2106.12978v1)

Published 24 Jun 2021 in cs.LG and cs.CL

Abstract: Topic segmentation of meetings is the task of dividing multi-person meeting transcripts into topic blocks. Supervised approaches to the problem have proven intractable due to the difficulties in collecting and accurately annotating large datasets. In this paper we show how previous unsupervised topic segmentation methods can be improved using pre-trained neural architectures. We introduce an unsupervised approach based on BERT embeddings that achieves a 15.5% reduction in error rate over existing unsupervised approaches applied to two popular datasets for meeting transcripts.

Citations (47)

View on Semantic Scholar

Summary

The paper proposes an unsupervised method using BERT embeddings for topic segmentation in meeting transcripts, addressing the data scarcity problem of supervised approaches.
The unsupervised BERT-based method achieved significantly lower error rates (15.5% vs. unsupervised, 26.6% vs. supervised) on meeting datasets like ICSI and AMI.
It enhances the TextTiling process by using BERT-based semantic similarity scores for segmentation, improving indexing and understanding of meeting transcripts.

Unsupervised Artificial Neural Networks for Business Meeting Topic Segmentation

The research paper "Reti Neurali Artificiali Non Supervisionate per la Suddivisione di Riunioni Aziendali in Argomenti" explores an innovative approach to topic segmentation within business meeting transcripts. Faced with the intricate challenges posed by traditional supervised methods, primarily due to the scarcity of quality-annotated data, this paper suggests a robust and non-supervised strategy utilizing pre-trained neural architectures, specifically BERT embeddings.

Overview

Business meetings, abundant in today’s corporate environment, produce extensive transcripts due to the growing convenience of automatic speech recognition (ASR) systems. The paper addresses the critical task of topic segmentation, aiming to divide these texts into coherent blocks. This segmentation potentially elevates user engagement by enhancing the indexing for subsequent retrieval and facilitating a comprehensive understanding of the meetings without the need for exhaustive manual review.

The researchers introduce an unsupervised model based on BERT embeddings, purporting a notable reduction in error rates by 15.5% compared to prevailing unsupervised methods. Furthermore, they highlight a distinct advantage of 26.6% in error rates over contemporary supervised models when applied to datasets like the ICSI Meeting Corpus and AMI Meeting Corpus.

Methodology

The paper delineates a sophisticated semantic representation framework leveraging BERT and Sentence-BERT models. These embeddings are computed via maximum pooling of penultimate layers for robust semantic capture, adeptly filtering irrelevant linguistic noise typical in spoken content.

Conventional supervised models relying on BiLSTM architectures necessitate extensive labeled datasets, which limits their efficacy in the domain of noisy meeting transcripts. Instead, the proposed approach circumvents this by adopting an entirely unsupervised mechanism that does not demand labeled training data. The method builds upon the foundational TextTiling process, enhancing its capabilities by utilizing BERT-based semantic similarity scores rather than simple word frequency metrics.

Evaluation and Results

The paper evaluates the proposed model against both naive baselines (Random and Even segmentation) and supervised models trained on textually diverging datasets like Wikipedia. The performance metrics, namely Pk and WinDiff, illustrate the superior capabilities of BERT-based embeddings, achieving a discernibly lower error rate. The tabulated results in the paper present a compelling argument, showcasing the method’s improvement over TextTiling and supervised systems on both AMI and ICSI datasets.

Implications and Future Directions

The implications of this research extend both theoretically and practically within AI-driven language processing fields. By demonstrating an effective unsupervised methodology, this paper opens avenues for more adaptable AI models that can operate with minimal data dependence. Future research could aim at incorporating multimodal signals such as speaker characteristics and agenda items, and explore related tasks like abstractive meeting summarization.

In conclusion, this paper provides a substantial contribution to the field of computational linguistics, notably advancing the accuracy and applicability of unsupervised topic segmentation within complex, real-world data environments. The robust approach outlined, centered around BERT embeddings, further encourages exploration into enriched semantic signals and cross-domain applicability.