Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Text-to-Image-to-Text Translation using Cycle Consistent Adversarial Networks (1808.04538v1)

Published 14 Aug 2018 in cs.LG, cs.CL, cs.CV, and stat.ML

Abstract: Text-to-Image translation has been an active area of research in the recent past. The ability for a network to learn the meaning of a sentence and generate an accurate image that depicts the sentence shows ability of the model to think more like humans. Popular methods on text to image translation make use of Generative Adversarial Networks (GANs) to generate high quality images based on text input, but the generated images don't always reflect the meaning of the sentence given to the model as input. We address this issue by using a captioning network to caption on generated images and exploit the distance between ground truth captions and generated captions to improve the network further. We show extensive comparisons between our method and existing methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Satya Krishna Gorti (9 papers)
  2. Jeremy Ma (4 papers)
Citations (27)

Summary

We haven't generated a summary for this paper yet.