What Sketch Explainability Really Means for Downstream Tasks (2403.09480v1)
Abstract: In this paper, we explore the unique modality of sketch for explainability, emphasising the profound impact of human strokes compared to conventional pixel-oriented studies. Beyond explanations of network behavior, we discern the genuine implications of explainability across diverse downstream sketch-related tasks. We propose a lightweight and portable explainability solution -- a seamless plugin that integrates effortlessly with any pre-trained model, eliminating the need for re-training. Demonstrating its adaptability, we present four applications: highly studied retrieval and generation, and completely novel assisted drawing and sketch adversarial attacks. The centrepiece to our solution is a stroke-level attribution map that takes different forms when linked with downstream tasks. By addressing the inherent non-differentiability of rasterisation, we enable explanations at both coarse stroke level (SLA) and partial stroke level (P-SLA), each with its advantages for specific downstream tasks.
- Sanity checks for saliency maps. In NeurIPS, 2018.
- Debugging tests for model explanations. arXiv preprint arXiv:2011.05429, 2020.
- Advances in adversarial attacks and defenses in computer vision: A survey. arXiv preprint arXiv:2108.00401, 2021.
- Prompting for discovery: Flexible sense-making for ai art-making with dreamsheets. arXiv preprint arXiv:2310.09985, 2023.
- Towards better understanding of gradient-based attribution methods for deep neural networks. arXiv preprint arXiv:1711.06104, 2017a.
- A unified view of gradient-based attribution methods for deep neural networks. In NIPSW, 2017b.
- On the robustness of semantic segmentation models to adversarial attacks. In CVPR, 2018.
- On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PloS one, 2015.
- Sketchinr: A first look into sketches as implicit neural representations. In CVPR, 2024a.
- Doodle your 3d: From abstract freehand sketches to precise 3d shapes. In CVPR, 2024b.
- Pixelor: A competitive sketching ai agent. so you think you can sketch? ACM TOG, 2020a.
- Sketch less for more: On-the-fly fine-grained sketch based image retrieval. In CVPR, 2020b.
- More photos are all you need: Semi-supervised learning for fine-grained sketch based image retrieval. In CVPR, 2021.
- Doodle it yourself: Class incremental learning by drawing a few sketches. In CVPR, 2022a.
- Sketching without worrying: Noise-tolerant sketch-based image retrieval. In CVPR, 2022b.
- B-cos: Networks: Alignment is all we need for interpretability. In CVPR, 2022.
- Approximating cnns with bag-of-local-features models works surprisingly well on imagenet. arXiv preprint arXiv:1904.00760, 2019.
- Look and think twice: Capturing top-down visual attention with feedback convolutional neural networks. In ICCV, 2015.
- This looks like that: deep learning for interpretable image recognition. In NeurIPS, 2019.
- Deepfacedrawing: Deep generation of face images from sketches. ACM TOG, 2020a.
- Sketchygan: Towards diverse and realistic sketch to image synthesis. In ICCV, 2018.
- One-shot adversarial attacks on visual tracking with dual attention. In CVPR, 2020b.
- Partially does it: Towards scene-level fg-sbir with partial input. In CVPR, 2022a.
- Fs-coco: Towards understanding of freehand sketches of common objects in context. In ECCV, 2022b.
- Garment ideation: Iterative view-aware sketch-based garment modeling. In 3DV, 2022c.
- Livesketch: Query perturbations for guided sketch-based visual search. In CVPR, 2019.
- Real time image saliency for black box classifiers. In NeurIPS, 2017.
- Béziersketch: A generative model for scalable vector sketches. In ECCV, 2020.
- Doodle to search: Practical zero-shot sketch-based image retrieval. In CVPR, 2019.
- How important is a neuron? arXiv preprint arXiv:1805.12233, 2018.
- Sharcs: Shared concept space for explainable multimodal learning. arXiv preprint arXiv:2307.00316, 2023.
- Semantically tied paired cycle consistency for zero-shot sketch-based image retrieval. In CVPR, 2019.
- How do humans sketch objects? ACM TOG, 2012.
- Visualizing higher-layer features of a deep network. Technical Report, Univeristé de Montréal, 2009.
- What i cannot predict, i do not understand: A human-centered evaluation framework for explainability methods. arXiv preprint arXiv:2112.04417, 2021.
- Understanding deep networks via extremal perturbations and smooth masks. In ICCV, 2019.
- Interpretable explanations of black boxes by meaningful perturbation. In ICCV, 2017.
- Sketchycoco: Image generation from freehand scene sketches. In CVPR, 2020.
- Sketch2mesh: Reconstructing and editing 3d shapes from sketches. In CVPR, 2021.
- A neural representation of sketch drawings. In ICLR, 2018.
- Deep residual learning for image recognition. In CVPR, 2016.
- Segmentations-leak: Membership inference attacks and defenses in semantic image segmentation. In ECCV, 2020.
- Aaron Hertzmann. Why do line drawings work? a realism hypothesis. Perception, 2020.
- Supportingreferenceimageryfordigitaldrawing. In ICCVW, 2021.
- Sketch-a-segmenter: Sketch-based photo segmenter generation. IEEE TIP, 2020.
- Study of ratling scales for subjective quality assessment of high definition video. IEEE TBC, 2010.
- Categorical reparameterization with gumbel-softmax. In ICLR, 2017.
- Fooling detection alone is not enough: Adversarial attack against multiple object tracking. In ICLR, 2020.
- Bayesian safe policy learning with chance constrained optimization: Application to military security assessment during the vietnam war. arXiv preprint arXiv:2307.08840, 2023.
- Scaling up gans for text-to-image synthesis. In CVPR, 2023.
- Xrai: Better attributions through regions. In ICCV, 2019.
- Guided integrated gradients: An adaptive path method for removing noise. In CVPR, 2021.
- A style-based generator architecture for generative adversarial networks. In CVPR, 2019.
- Hive: Evaluating the human interpretability of visual explanations. In ECCV, 2022.
- The (un) reliability of saliency methods. In Explainable AI: Interpreting, Explaining and Visualizing Deep Learning, 2019.
- Picture that sketch: Photorealistic image generation from abstract sketches. In CVPR, 2023.
- Principles of explanatory debugging to personalize interactive machine learning. In IUI, 2015a.
- Principles of explanatory debugging to personalize interactive machine learning. In IUI, 2015b.
- An evaluation of the human-interpretability of explanation. arXiv preprint arXiv:1902.00006, 2019.
- Photo-sketching: Inferring contour drawings from images. In WACV, 2019.
- Towards transferable targeted attack. In CVPR, 2020.
- Zero-shot everything sketch-based image retrieval, and in explainable style. In CVPR, 2023.
- Deflocnet: Deep image editing via flexible low-level controls. In CVPR, 2021.
- Deep sketch hashing: Fast free-hand sketch-based image retrieval. In CVPR, 2017.
- T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models. arXiv preprint arXiv:2302.08453, 2023.
- Neural prototype trees for interpretable fine-grained image recognition. In CVPR, 2021.
- Multifaceted feature visualization: Uncovering the different types of features learned by each neuron in deep neural networks. ICMLW, 2016.
- Feature visualization. Distill, 2017.
- Generalising fine-grained sketch-based image retrieval. In CVPR, 2019.
- Rise: Randomized input sampling for explanation of black-box models. arXiv preprint arXiv:1806.07421, 2018.
- Adversarial imaging pipelines. In CVPR, 2021.
- Sketchxai: A first look at explainability for human sketches. In CVPR, 2023.
- Learning transferable visual models from natural language supervision. In ICML, 2021.
- Towards better understanding attribution methods. In CVPR, 2022.
- “why should i trust you?” explaining the predictions of any classifier. In KDD, 2016.
- High-resolution image synthesis with latent diffusion. In CVPR, 2022.
- Cynthia Rudin. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, 2019.
- Stylemeup: Towards style-agnostic sketch-based image retrieval. In CVPR, 2021.
- Clip for all things zero-shot sketch-based image retrieval, fine-grained or not. In CVPR, 2023.
- Evaluating the visualization of what a deep neural network has learned. IEEE TNNLS, 2016.
- The sketchy database: learning to retrieve badly drawn bunnies. ACM TOG, 2016.
- Pronimal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017.
- Grad-cam: Visual explanations from deep networks via gradient-based localisation. In ICCV, 2017a.
- Grad-cam: Visual explanations from deep networks via gradient-based localization. In ICCV, 2017b.
- Do input gradients highlight discriminative features? In NeurIPS, 2021.
- Image information and visual quality. IEEE TIP, 2006.
- Zero-shot sketch-image hashing. In ICCV, 2018.
- Learning important features through propagating activation differences. In ICML, 2017.
- Deep inside convolutional networks: Visualising image classification models and saliency maps. In ICLRW, 2014.
- Do users benefit from interpretable vision? a user study, baseline, and dataset. arXiv preprint arXiv:2204.11642, 2022.
- Smoothgrad: removing noise by adding noise. arXiv preprint arXiv:1706.03825, 2017.
- Striving for simplicity: The all convolutional net. arXiv preprint arXiv:1412.6806, 2014.
- Striving for simplicity: The all convolutional net. In ICLRW, 2015.
- Axiomatic attribution for deep networks. In ICML, 2017.
- Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199, 2013a.
- Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199, 2013b.
- Diversity can be transferred: Output diversification for white- and black-box attacks. In NeurIPS, 2020.
- Sketch-guided object localization in natural images. In ECCV, 2020.
- Physically realizable adversarial examples for lidar object detection. In CVPR, 2020.
- Evaluation of post-hoc interpretability methods in time-series classification. Nature Machine Intelligence, 2023.
- Sketch your own gan. In CVPR, 2021.
- Learning and using the arrow of time. In CVPR, 2018.
- On the safety of interpretable machine learning: A maximum deviation approach. In NeurIPS, 2022.
- Sketchhairsalon: deep sketch-based hair image synthesis. ACM TOG, 2021.
- Sketchmate: Deep hashing for million-scale human sketch retrieval. In CVPR, 2018.
- Domain disentangled generative adversarial network for zero-shot sketch-based 3d shape retrieval. AAAI, 2022.
- Interactive liquid splash modeling by user sketches. ACM TOG, 2020.
- Semantic hierarchy emerges in deep generative representations for scene synthesis. IJCV, 2021.
- On the (in) fidelity and sensitivity of explanations. In NeurIPS, 2019.
- A zero-shot framework for sketch based image retrieval. In ECCV, 2018.
- Animating portrait line drawings from a single face photo and a speech signal. In ACM SIGGRAPH, 2022.
- Free-form image inpainting with gated convolution. In CVPR, 2019.
- Sketch-a-net that beats humans. In BMVC, 2015.
- Sketch me that shoe. In CVPR, 2016.
- Visualizing and understanding convolutional networks. In ECCV, 2014.
- Sketchedit: Mask-free local image manipulation with partial sketches. In CVPR, 2022.
- Towards adversarially robust object detection. In ICCV, 2019.
- Sketchnet: Sketch classification with web images. In CVPR, 2016.
- Top-down neural attention by excitation backprop. IJCV, 2018.
- Adding conditional control to text-to-image diffusion models. In ICCV, 2023.
- Sketch2model: View-aware 3d modeling from single free-hand sketches. In CVPR, 2021.
- Learning deep features for discriminative localization. In CVPR, 2016.
- Lg-gan: Label guided adversarial network for flexible targeted attack of point cloud-based deep networks. In CVPR, 2020.
- Sketchyscene: Richly-annotated scene sketches. In ECCV, 2018.