Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PixInWav: Residual Steganography for Hiding Pixels in Audio (2106.09814v1)

Published 17 Jun 2021 in cs.MM, cs.SD, and eess.AS

Abstract: Steganography comprises the mechanics of hiding data in a host media that may be publicly available. While previous works focused on unimodal setups (e.g., hiding images in images, or hiding audio in audio), PixInWav targets the multimodal case of hiding images in audio. To this end, we propose a novel residual architecture operating on top of short-time discrete cosine transform (STDCT) audio spectrograms. Among our results, we find that the residual audio steganography setup we propose allows independent encoding of the hidden image from the host audio without compromising quality. Accordingly, while previous works require both host and hidden signals to hide a signal, PixInWav can encode images offline -- which can be later hidden, in a residual fashion, into any audio signal. Finally, we test our scheme in a lab setting to transmit images over airwaves from a loudspeaker to a microphone verifying our theoretical insights and obtaining promising results.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Margarita Geleta (7 papers)
  2. Cristina Punti (1 paper)
  3. Kevin McGuinness (76 papers)
  4. Jordi Pons (36 papers)
  5. Cristian Canton (1 paper)
  6. Xavier Giro-i-Nieto (69 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.