Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Autoencoder Based Architecture For Fast & Real Time Audio Style Transfer (1812.07159v2)

Published 18 Dec 2018 in cs.SD, cs.LG, eess.AS, and stat.ML

Abstract: Recently, there has been great interest in the field of audio style transfer, where a stylized audio is generated by imposing the style of a reference audio on the content of a target audio. We improve on the current approaches which use neural networks to extract the content and the style of the audio signal and propose a new autoencoder based architecture for the task. This network generates a stylized audio for a content audio in a single forward pass. The proposed network architecture proves to be advantageous over the quality of audio produced and the time taken to train the network. The network is experimented on speech signals to confirm the validity of our proposal.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Dhruv Ramani (4 papers)
  2. Samarjit Karmakar (4 papers)
  3. Anirban Panda (3 papers)
  4. Asad Ahmed (2 papers)
  5. Pratham Tangri (2 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.