Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fine-Grained Image Generation from Bangla Text Description using Attentional Generative Adversarial Network (2109.11749v1)

Published 24 Sep 2021 in cs.CV

Abstract: Generating fine-grained, realistic images from text has many applications in the visual and semantic realm. Considering that, we propose Bangla Attentional Generative Adversarial Network (AttnGAN) that allows intensified, multi-stage processing for high-resolution Bangla text-to-image generation. Our model can integrate the most specific details at different sub-regions of the image. We distinctively concentrate on the relevant words in the natural language description. This framework has achieved a better inception score on the CUB dataset. For the first time, a fine-grained image is generated from Bangla text using attentional GAN. Bangla has achieved 7th position among 100 most spoken languages. This inspires us to explicitly focus on this language, which will ensure the inevitable need of many people. Moreover, Bangla has a more complex syntactic structure and less natural language processing resource that validates our work more.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Md Aminul Haque Palash (5 papers)
  2. Aditi Dhali (2 papers)
  3. Faria Afrin (3 papers)
  4. MD Abdullah Al Nasim (27 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.