Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Deep Griffin-Lim Iteration (1903.03971v1)

Published 10 Mar 2019 in cs.SD, cs.LG, and eess.AS

Abstract: This paper presents a novel phase reconstruction method (only from a given amplitude spectrogram) by combining a signal-processing-based approach and a deep neural network (DNN). To retrieve a time-domain signal from its amplitude spectrogram, the corresponding phase is required. One of the popular phase reconstruction methods is the Griffin-Lim algorithm (GLA), which is based on the redundancy of the short-time Fourier transform. However, GLA often involves many iterations and produces low-quality signals owing to the lack of prior knowledge of the target signal. In order to address these issues, in this study, we propose an architecture which stacks a sub-block including two GLA-inspired fixed layers and a DNN. The number of stacked sub-blocks is adjustable, and we can trade the performance and computational load based on requirements of applications. The effectiveness of the proposed method is investigated by reconstructing phases from amplitude spectrograms of speeches.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Yoshiki Masuyama (30 papers)
  2. Kohei Yatabe (39 papers)
  3. Yuma Koizumi (39 papers)
  4. Yasuhiro Oikawa (14 papers)
  5. Noboru Harada (48 papers)
Citations (55)

Summary

We haven't generated a summary for this paper yet.