Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Survey of AI Text-to-Image and AI Text-to-Video Generators (2311.06329v1)

Published 10 Nov 2023 in cs.CV, cs.AI, cs.CL, cs.LG, and eess.IV

Abstract: Text-to-Image and Text-to-Video AI generation models are revolutionary technologies that use deep learning and NLP techniques to create images and videos from textual descriptions. This paper investigates cutting-edge approaches in the discipline of Text-to-Image and Text-to-Video AI generations. The survey provides an overview of the existing literature as well as an analysis of the approaches used in various studies. It covers data preprocessing techniques, neural network types, and evaluation metrics used in the field. In addition, the paper discusses the challenges and limitations of Text-to-Image and Text-to-Video AI generations, as well as future research directions. Overall, these models have promising potential for a wide range of applications such as video production, content creation, and digital marketing.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (1)
  1. Aditi Singh (19 papers)
Citations (11)