Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
166 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors (2010.15258v2)

Published 28 Oct 2020 in cs.SD, cs.LG, and eess.AS

Abstract: Human subjective evaluation is the gold standard to evaluate speech quality optimized for human perception. Perceptual objective metrics serve as a proxy for subjective scores. The conventional and widely used metrics require a reference clean speech signal, which is unavailable in real recordings. The no-reference approaches correlate poorly with human ratings and are not widely adopted in the research community. One of the biggest use cases of these perceptual objective metrics is to evaluate noise suppression algorithms. This paper introduces a multi-stage self-teaching based perceptual objective metric that is designed to evaluate noise suppressors. The proposed method generalizes well in challenging test conditions with a high correlation to human ratings.

Citations (259)

Summary

  • The paper introduces DNSMOS, a metric designed to evaluate noise suppressors without requiring a clean reference signal.
  • It utilizes perceptual modeling techniques to estimate speech quality directly from degraded audio samples.
  • Empirical results show a strong correlation between DNSMOS scores and subjective human assessments, confirming its practical effectiveness.

Overview of ICASSP 2020 Proceedings Manuscript Guidelines

The document outlines comprehensive instructions for authors preparing manuscripts for the ICASSP 2020 proceedings. It serves to standardize the format and presentation of scholarly work submitted to this prominent conference, ensuring consistency across contributed papers. This document is crucial for authors aiming to have their work included in the ICASSP 2020 compendium, focusing on aspects from formatting and typesetting to figure placement and prior work discussions.

Formatting and Style Specifications

The document emphasizes strict adherence to precise formatting requirements. It delineates technical specifications including manuscript dimensions, font choice, and layout format. Notably, authors must conform to a two-column layout with defined dimensions, maintain specified margins, utilize Times-Roman font, and ensure text fully justifies within columns. The emphasis on printed material consistency ensures uniformity across all papers presented at the conference.

The document specifies the protocol for typographic elements, including major headings, subheadings, and pagination details, encouraging a professional and streamlined presentation. Headings such as "Introduction" and "References" must follow predefined casing and placement rules to aid efficient navigation of the manuscript by readers. Authors are advised against incorporating manual page numbers, as these will be managed in the final proceedings assembly.

Inclusion and Presentation of Illustrations

The guidelines provide a detailed methodology for handling illustrations, graphs, and photographs that accompany technical manuscripts. Authors are directed to position illustrations preferably at the top of the columns and ensure captions and numbering for each figure to promote clarity. This meticulous attention to graphical representation is critical for the transmission of complex experimental results and analysis in a coherent manner.

A noteworthy inclusion is illustrative best practices for accommodating figures within a LaTeX document, addressing common challenges authors face in scientific typesetting. This technical insight supports authors in achieving high-quality visual representation in their submissions, which is imperative for the accurate and effective communication of research findings.

Discourse on Prior Work

A significant aspect emphasized in the guidelines is the required discussion of prior work and its relation to the new contributions made in the paper. The document highlights the importance of situating new research within the existing body of literature, differentiating current findings from past efforts and delineating innovative features. This retrospective analysis is designed to offer readers a comprehensive context for the presented work and to illustrate its novel aspects.

Implications and Future Work

The ICASSP 2020 manuscript guidelines not only function as an instructional document but also embody the greater objective of advancing coherent and scholarly communication within the signal processing community. By imposing rigorous formatting and content criteria, the guidelines facilitate the dissemination of high-quality research and contribute to the ongoing development and sharing of knowledge in this dynamic field.

As AI and signal processing domains continue to evolve, it is plausible that future developments may necessitate updates to guidelines such as these to accommodate new advances in typesetting technologies, digital media formats, and collaboration tools. Continuing evolution and adaptation of these guidelines will remain critical to maintaining authoritative and effective presentation standards in academic publishing.