Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SketchINR: A First Look into Sketches as Implicit Neural Representations (2403.09344v1)

Published 14 Mar 2024 in cs.CV and cs.AI

Abstract: We propose SketchINR, to advance the representation of vector sketches with implicit neural models. A variable length vector sketch is compressed into a latent space of fixed dimension that implicitly encodes the underlying shape as a function of time and strokes. The learned function predicts the $xy$ point coordinates in a sketch at each time and stroke. Despite its simplicity, SketchINR outperforms existing representations at multiple tasks: (i) Encoding an entire sketch dataset into a fixed size latent vector, SketchINR gives $60\times$ and $10\times$ data compression over raster and vector sketches, respectively. (ii) SketchINR's auto-decoder provides a much higher-fidelity representation than other learned vector sketch representations, and is uniquely able to scale to complex vector sketches such as FS-COCO. (iii) SketchINR supports parallelisation that can decode/render $\sim$$100\times$ faster than other learned vector representations such as SketchRNN. (iv) SketchINR, for the first time, emulates the human ability to reproduce a sketch with varying abstraction in terms of number and complexity of strokes. As a first look at implicit sketches, SketchINR's compact high-fidelity representation will support future work in modelling long and complex sketches.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (66)
  1. Deepwriting: Making digital ink editable via deep generative modeling. In CHI, 2018.
  2. Cose: Compositional stroke embeddings. In NeurIPS, 2020.
  3. Abstracting sketches through simple primitives. In ECCV, 2022.
  4. What sketch explainability really means for downstream tasks. In CVPR, 2024a.
  5. Doodle your 3d: From abstract freehand sketches to precise 3d shapes. In CVPR, 2024b.
  6. Pixelor: A competitive sketching ai agent. so you think you can beat me? In SIGGRAPH Asia, 2020.
  7. Vectorization and rasterization: Self-supervised learning for sketch and handwriting. In CVPR, 2021.
  8. Sens: Part-aware sketch-based implicit neural shape modeling. arXiv preprint arXiv:2306.06088, 2023.
  9. Jack E Bresenham. Algorithm for computer control of a digital plotter. IBM Systems journal, 1965.
  10. Deepsvg: A hierarchical generative network for vector graphics animation. In NeurIPS, 2020.
  11. Qualcomm Innovation Center. Ai model efficiency toolkit (aimet). https://github.com/quic/aimet, 2023.
  12. Neural ordinary differential equations. In NeurIPS, 2018.
  13. Fs-coco: Towards understanding of freehand sketches of common objects in context. In ECCV, 2022.
  14. What can human sketches do for object detection? In CVPR, 2023.
  15. Livesketch: Query perturbations for guided sketch-based visual search. In CVPR, 2019.
  16. Béziersketch: A generative model for scalable vector sketches. In ECCV, 2020.
  17. Sketchode: Learning neural sketch representation in continuous time. In ICLR, 2021a.
  18. Cloud2curve: Generation and vectorization of parametric sketches. In CVPR, 2021b.
  19. Chirodiff: Modelling chirographic data with diffusion models. In ICLR, 2023.
  20. A practical guide to splines. Applied Mathematical Sciences, 1978.
  21. Hawq: Hessian aware quantization of neural networks with mixed-precision. In ICCV, 2019.
  22. Implicit Functions and Solution Mappings. Springer-Verlag, 2014.
  23. Algorithms for the reduction of the number of points required to represent a digitized line or its caricature. Cartographica: the international journal for geographic information and geovisualization, 1973.
  24. How do humans sketch objects? ACM TOG, 2012.
  25. Gauss-newton approximation to bayesian learning. In ICNN, 1997.
  26. Synthesizing programs for images using reinforced adversarial learning. In ICML, 2018.
  27. Creative sketch generation. In ICLR, 2021.
  28. A neural representation of sketch drawings. In ICLR, 2018.
  29. Hmq: Hardware friendly mixed precision quantization block for cnns. In ECCV, 2020.
  30. Spaghetti: Editing implicit shapes through part aware generation. ACM TOG, 2022.
  31. Aaron Hertzmann. Why do line drawings work? a realism hypothesis. Perception, 2020.
  32. Sketch-a-classifier: Sketch-based photo classifier generation. In CVPR, 2018.
  33. Sketch-a-segmenter: Sketch-based photo segmenter generation. IEEE TIP, 2020.
  34. Shap-e: Generating conditional 3d implicit functions. arXiv preprint arXiv:2305.02463, 2023.
  35. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013.
  36. Kurt Koffka. Principles of Gestalt psychology. Routledge, 2013.
  37. Picture that sketch: Photorealistic image generation from abstract sketches. In CVPR, 2023.
  38. Free2cad: Parsing freehand drawings into cad commands. ACM TOG, 2022.
  39. Sketch-bert: Learning sketch bidirectional encoder representation from transformers by self-supervised learning of sketch gestalt. In CVPR, 2020.
  40. Microsoft coco: Common objects in context. In ECCV, 2014.
  41. Deflocnet: Deep image editing via flexible low-level controls. In CVPR, 2021.
  42. An efficient algorithm for robust curve fitting using cubic bezier curves. In ICIC, 2010.
  43. Occupancy networks: Learning 3d reconstruction in function space. In CVPR, 2019.
  44. Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM, 2021.
  45. Jorge J. Moŕe. The levenberg-marquardt algorithm. In Numerical Analysis, 1976.
  46. Learning deep sketch abstraction. In CVPR, 2018.
  47. Single image super-resolution via a dual interactive implicit neural network. In WACV, 2023.
  48. Deepsdf: Learning continuous signed distance functions for shape representation. In CVPR, 2019.
  49. On the spectral bias of neural networks. In ICML, 2019.
  50. Using generative models for handwritten digit recognition. IEEE TPAMI, 1996.
  51. Sketchformer: Transformer-based representation for sketched structure. In CVPR, 2020.
  52. Clip for all things zero-shot sketch-based image retrieval, fine-grained or not. In CVPR, 2023.
  53. The sketchy database: learning to retrieve badly drawn bunnies. ACM TOG, 2016.
  54. Where to draw the line? PLOS One, 2021.
  55. Implicit neural representations for image compression. In ECCV, 2022.
  56. Mixed precision dnns: All you need is a good parametrization. In ICLR, 2020.
  57. Clipasso: Semantically-aware object sketching. ACM TOG, 2022.
  58. Clipascene: Scene sketching with different types and levels of abstraction. In ICCV, 2023.
  59. Sketchembednet: Learning novel concepts by imitating drawings. In ICML, 2021.
  60. Diffsketcher: Text guided vector sketch synthesis through latent diffusion models. In NeurIPS, 2023.
  61. Sketchmate: Deep hashing for million-scale human sketch retrieval. In CVPR, 2018.
  62. Free-form image inpainting with gated convolution. In ICCV, 2019.
  63. Sketch-a-net that beats humans. In BMVC, 2015.
  64. Sketch me that shoe. In CVPR, 2016.
  65. Adding conditional control to text-to-image diffusion models. In ICCV, 2021.
  66. Fast b-spline curve fitting by l-bfgs. Computer Aided Geometric Design, 2012.
Citations (6)

Summary

We haven't generated a summary for this paper yet.

Youtube Logo Streamline Icon: https://streamlinehq.com