Papers
Topics
Authors
Recent
2000 character limit reached

S2WAT: Image Style Transfer via Hierarchical Vision Transformer using Strips Window Attention (2210.12381v3)

Published 22 Oct 2022 in cs.CV

Abstract: Transformer's recent integration into style transfer leverages its proficiency in establishing long-range dependencies, albeit at the expense of attenuated local modeling. This paper introduces Strips Window Attention Transformer (S2WAT), a novel hierarchical vision transformer designed for style transfer. S2WAT employs attention computation in diverse window shapes to capture both short- and long-range dependencies. The merged dependencies utilize the "Attn Merge" strategy, which adaptively determines spatial weights based on their relevance to the target. Extensive experiments on representative datasets show the proposed method's effectiveness compared to state-of-the-art (SOTA) transformer-based and other approaches. The code and pre-trained models are available at https://github.com/AlienZhang1996/S2WAT.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (41)
  1. Artflow: Unbiased image style transfer via reversible neural flows. In CVPR.
  2. ITstyler: Image-optimized Text-based Style Transfer. arXiv.
  3. Longformer: The long-document transformer. arXiv.
  4. The image local autoregressive transformer. Advances in Neural Information Processing Systems, 34: 18433–18445.
  5. Stylebank: An explicit representation for neural image style transfer. In CVPR.
  6. Artistic style transfer with internal-external learning and contrastive learning. In NeurIPS.
  7. Long short-term memory-networks for machine reading. In EMNLP.
  8. Arbitrary video style transfer via multi-channel correlation. In AAAI.
  9. StyTr2: Image Style Transfer with Transformers. In CVPR.
  10. Arbitrary style transfer via multi-adaptation network. In ACM MM.
  11. Cswin transformer: A general vision transformer backbone with cross-shaped windows. In CVPR.
  12. Image quilting for texture synthesis and transfer. In Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques.
  13. Multiscale vision transformers. In ICCV.
  14. Texture synthesis using convolutional neural networks. In NeurIPS.
  15. A neural algorithm of artistic style. In Vision Sciences Society.
  16. Levit: a vision transformer in convnet’s clothing for faster inference. In ICCV.
  17. AesPA-Net: Aesthetic Pattern-Aware Style Transfer Networks. In ICCV.
  18. Arbitrary style transfer in real-time with adaptive instance normalization. In ICCV.
  19. Perceptual losses for real-time style transfer and super-resolution. In ECCV.
  20. Adam: A method for stochastic optimization. In ICLR.
  21. Exploring the Temporal Consistency of Arbitrary Style Transfer: A Channelwise Perspective. IEEE Transactions on Neural Networks and Learning Systems.
  22. Arbitrary Style Transfer with Semantic Content Enhancement. In The 18th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry.
  23. Universal style transfer via feature transforms. In NeurIPS.
  24. Microsoft coco: Common objects in context. In ECCV.
  25. Swin transformer: Hierarchical vision transformer using shifted windows. In ICCV.
  26. Name your style: An arbitrary artist-aware image style transfer. arXiv.
  27. RAST: Restorable Arbitrary Style Transfer via Multi-Restoration. In WACV.
  28. Arbitrary style transfer with style-attentional networks. In CVPR.
  29. Automatic differentiation in pytorch. In NeurIPS.
  30. Wiki Art Gallery, Inc.: A case for critical thinking. Issues in Accounting Education.
  31. Instance normalization: The missing ingredient for fast stylization. arXiv.
  32. Attention is all you need. In NeurIPS.
  33. Pyramid vision transformer: A versatile backbone for dense prediction without convolutions. In ICCV.
  34. Cvt: Introducing convolutions to vision transformers. In ICCV.
  35. Styleformer: Real-time arbitrary style transfer via parametric style composition. In ICCV.
  36. SegFormer: Simple and efficient design for semantic segmentation with transformers. Advances in Neural Information Processing Systems, 34: 12077–12090.
  37. On layer normalization in the transformer architecture. In ICML.
  38. Attention-aware multi-stroke style transfer. In CVPR.
  39. Inversion-based style transfer with diffusion models. In CVPR, 10146–10156.
  40. Domain Enhanced Arbitrary Image Style Transfer via Contrastive Learning. In SIGGRAPH.
  41. All-to-key Attention for Arbitrary Style Transfer. In ICCV.
Citations (16)

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.