Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

LayoutDM: Discrete Diffusion Model for Controllable Layout Generation (2303.08137v1)

Published 14 Mar 2023 in cs.CV and cs.GR

Abstract: Controllable layout generation aims at synthesizing plausible arrangement of element bounding boxes with optional constraints, such as type or position of a specific element. In this work, we try to solve a broad range of layout generation tasks in a single model that is based on discrete state-space diffusion models. Our model, named LayoutDM, naturally handles the structured layout data in the discrete representation and learns to progressively infer a noiseless layout from the initial input, where we model the layout corruption process by modality-wise discrete diffusion. For conditional generation, we propose to inject layout constraints in the form of masking or logit adjustment during inference. We show in the experiments that our LayoutDM successfully generates high-quality layouts and outperforms both task-specific and task-agnostic baselines on several layout tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Naoto Inoue (15 papers)
  2. Kotaro Kikuchi (8 papers)
  3. Edgar Simo-Serra (25 papers)
  4. Mayu Otani (32 papers)
  5. Kota Yamaguchi (20 papers)
Citations (81)

Summary

We haven't generated a summary for this paper yet.