Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Real-time Localized Photorealistic Video Style Transfer (2010.10056v1)

Published 20 Oct 2020 in cs.CV

Abstract: We present a novel algorithm for transferring artistic styles of semantically meaningful local regions of an image onto local regions of a target video while preserving its photorealism. Local regions may be selected either fully automatically from an image, through using video segmentation algorithms, or from casual user guidance such as scribbles. Our method, based on a deep neural network architecture inspired by recent work in photorealistic style transfer, is real-time and works on arbitrary inputs without runtime optimization once trained on a diverse dataset of artistic styles. By augmenting our video dataset with noisy semantic labels and jointly optimizing over style, content, mask, and temporal losses, our method can cope with a variety of imperfections in the input and produce temporally coherent videos without visual artifacts. We demonstrate our method on a variety of style images and target videos, including the ability to transfer different styles onto multiple objects simultaneously, and smoothly transition between styles in time.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Xide Xia (13 papers)
  2. Tianfan Xue (62 papers)
  3. Zheng Sun (92 papers)
  4. Abby Chang (1 paper)
  5. Brian Kulis (33 papers)
  6. Jiawen Chen (24 papers)
  7. Wei-Sheng Lai (29 papers)
Citations (28)

Summary

We haven't generated a summary for this paper yet.