Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Deep Learning for Robust and Explainable Models in Computer Vision (2403.18674v1)

Published 27 Mar 2024 in cs.CV

Abstract: Recent breakthroughs in machine and deep learning (ML and DL) research have provided excellent tools for leveraging enormous amounts of data and optimizing huge models with millions of parameters to obtain accurate networks for image processing. These developments open up tremendous opportunities for using AI in the automation and human assisted AI industry. However, as more and more models are deployed and used in practice, many challenges have emerged. This thesis presents various approaches that address robustness and explainability challenges for using ML and DL in practice. Robustness and reliability are the critical components of any model before certification and deployment in practice. Deep convolutional neural networks (CNNs) exhibit vulnerability to transformations of their inputs, such as rotation and scaling, or intentional manipulations as described in the adversarial attack literature. In addition, building trust in AI-based models requires a better understanding of current models and developing methods that are more explainable and interpretable a priori. This thesis presents developments in computer vision models' robustness and explainability. Furthermore, this thesis offers an example of using vision models' feature response visualization (models' interpretations) to improve robustness despite interpretability and robustness being seemingly unrelated in the related research. Besides methodological developments for robust and explainable vision models, a key message of this thesis is introducing model interpretation techniques as a tool for understanding vision models and improving their design and robustness. In addition to the theoretical developments, this thesis demonstrates several applications of ML and DL in different contexts, such as medical imaging and affective computing.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (326)
  1. Defense Advanced Research Projects Agency. Broad agency announcement: Explainable artificial intelligence (XAI). https://www.darpa.mil/attachments/DARPA-BAA-16-53.pdf, 2016. Online; accessed 3 December 2022.
  2. Intelligent driver drowsiness detection for traffic safety based on multi CNN deep model and facial subsampling. IEEE Transactions on Intelligent Transportation Systems, 23(10):19743–19752, 2021.
  3. Threat of adversarial attacks on deep learning in computer vision: A survey. IEEE Access, 6:14410–14430, 2018.
  4. Local gabor binary patterns from three orthogonal planes for automatic facial expression recognition. In Proceedings of the Humaine Association Conference on Affective Computing and Intelligent Interaction, pages 356–361. IEEE, 2013. ISSN: 2156-8111.
  5. Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. Journal of big Data, 8(1):1–74, 2021.
  6. Explainability for artificial intelligence in healthcare: a multidisciplinary perspective. BMC Medical Informatics and Decision Making, 20(1):310, 2020.
  7. Evaluation methodology for respiratory signal extraction from clinical cone-beam CT (CBCT) using data-driven methods. International Journal of Integrated Engineering, 13(5):1–8, 2021.
  8. Support vector regression of sparse dictionary-based features for view-independent action unit intensity estimation. In Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), pages 854–859. IEEE, IEEE, 2017.
  9. Using radial basis function neural networks for continuous and discrete pain estimation from bio-physiological signals. In Friedhelm Schwenker, Hazem M. Abbas, Neamat El Gayar, and Edmondo Trentin, editors, IAPR Workshop on Artificial Neural Networks in Pattern Recognition, volume 9896, pages 269–284. Springer, Springer, 2016.
  10. Continuous multimodal human affect estimation using echo state networks. In Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, AVEC ’16, pages 67–74. Association for Computing Machinery, 2016.
  11. PrepNet: A convolutional auto-encoder to homogenize CT scans for cross-dataset medical image analysis. In 2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), pages 1–7. IEEE, 2021.
  12. Mitigation of motion-induced artifacts in cone beam computed tomography using deep convolutional neural networks. Medical Physics, 50(10):6228–6242, 2023.
  13. Efficient deep cnns for cross-modal automated computer vision under time and space constraints. In European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD). ZHAW Zürcher Hochschule für Angewandte Wissenschaften, September 2019.
  14. Radial basis function networks for convolutional neural networks to learn similarity distance metric and improve interpretability. IEEE Access, 8:123087–123097, 2020.
  15. Trace and detect adversarial attacks on CNNs using feature response maps. In Luca Pancioni, Friedhelm Schwenker, and Edmondo Trentin, editors, IAPR Workshop on Artificial Neural Networks in Pattern Recognition, Lecture Notes in Computer Science, pages 346–358. Springer, Springer, 2018.
  16. Two to trust: Automl for safe modelling and interpretable deep learning for robustness. In Trustworthy AI - Integrating Learning, Optimization and Reasoning, pages 268–275, Cham, 2021. Springer International Publishing.
  17. Michael R Anderberg. Cluster Analysis for Applications, volume 19. Elsevier, 2014.
  18. Simultaneous algebraic reconstruction technique (SART): A superior implementation of the ART algorithm. Ultrasonic Imaging, 6(1):81–94, 1984.
  19. Provable bounds for learning some deep representations. In Proceedings of the 31th International Conference on Machine Learning, ICML 2014, Beijing, China, 21-26 June 2014, volume 32 of JMLR Workshop and Conference Proceedings, pages 584–592. JMLR.org, 2014.
  20. Explainable artificial intelligence for autonomous driving: A comprehensive overview and field guide for future research directions. Computing Research Repository (CoRR), abs/2104.10972, 2021.
  21. Neural machine translation by jointly learning to align and translate. In Proceedings of the 3rd International Conference on Learning Representations, ICLR, 2015.
  22. Are transformers more robust than CNNs? In M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan, editors, Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), volume 34, pages 26831–26843. Curran Associates, Inc., 2021.
  23. AI fairness 360: An extensible toolkit for detecting and mitigating algorithmic bias. IBM Journal of Research and Development, 63(4):1–15, 2019.
  24. Benchmark analysis of representative deep neural network architectures. IEEE Access, 6:64270–64277, 2018.
  25. CT artifacts: causes and reduction techniques. Imaging in Medicine, 4(2):229–240, 2012.
  26. VisualBackProp: Efficient visualization of CNNs for autonomous driving. In Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), pages 4701–4708. IEEE, IEEE, 2018.
  27. End to end learning for self-driving cars. Computing Research Repository (CoRR), abs/1604.07316, 2016.
  28. Man is to computer programmer as woman is to homemaker? debiasing word embeddings. In D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, and R. Garnett, editors, Proceedings of the Advances in Neural Information Processing Systems (NIPS), volume 29. Curran Associates, Inc., 2016.
  29. Leo Breiman. Random forests. Machine Learning, 45(1):5–32, 2001.
  30. Decision-based adversarial attacks: Reliable attacks against black-box machine learning models. In 6th International Conference on Learning Representations, ICLR. OpenReview.net, 2018.
  31. Multivariable functional interpolation and adaptive networks, complex systems, vol. 2. Complex Systems, 2, 1988.
  32. Language models are few-shot learners. In H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, editors, Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), volume 33, pages 1877–1901. Curran Associates, Inc., 2020.
  33. Analysis of explainers of black box deep neural networks for computer vision: A survey. Machine Learning and Knowledge Extraction, 3(4):966–989, 2021.
  34. Thorsten M. Buzug. Computed Tomography: From Photon Statistics to Modern Cone-Beam CT. Springer, 2008.
  35. VGGFace2: A dataset for recognising faces across pose and age. In Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), pages 67–74. IEEE, IEEE, 2018.
  36. Interpretability of deep learning models: A survey of results. In Proceedings of the IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI), pages 1–6. IEEE, IEEE, 2017.
  37. An efficient blood vessel detection algorithm for retinal images using local entropy thresholding. In Proceedings of the International Symposium on Circuits and Systems ISCAS., volume 5, pages V–21–V–24. IEEE, 2003.
  38. AirNet: Fused analytical and iterative reconstruction with deep neural network regularization for sparse‐data CT. Medical Physics, 47(7):2916–2930, 2020.
  39. 4d-AirNet: a temporally-resolved CBCT slice reconstruction method synergizing analytical and iterative method with deep learning. Physics in Medicine & Biology, 65(17):175020, 2020.
  40. Practical identification of NARMAX models using radial basis functions. International Journal of Control, 52(6):1327–1350, 1990.
  41. When deep learning meets metric learning: Remote sensing image scene classification via learning discriminative CNNs. IEEE Transactions on Geoscience and Remote Sensing, 56(5):2811–2821, 2018.
  42. Multi-column deep neural networks for image classification. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pages 3642–3649. IEEE, 2012.
  43. A committee of neural networks for traffic sign classification. In The 2011 International Joint Conference on Neural Networks, pages 1918–1921. IEEE, 2011. ISSN: 2161-4407.
  44. Houdini: fooling deep structured visual and speech recognition models with adversarial examples. In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, pages 6980–6990. Curran Associates Inc., 2017.
  45. Group equivariant convolutional networks. In Proceedings of the 33nd International Conference on Machine Learning (ICML), pages 2990–2999. PMLR, 2016.
  46. Support-vector networks. Machine Learning, 20(3):273–297, 1995.
  47. Jonathan Crabbé and Mihaela van der Schaar. Label-free explainability for unsupervised models. In Proceedings of the International Conference on Machine Learning, ICML, pages 4391–4420, 2022.
  48. Kate Crawford. Artificial intelligence’s white guy problem. https://www.nytimes.com/2016/06/26/opinion/sunday/artificial-intelligences-white-guy-problem.html, 2022. Online; accessed 7 December 2022.
  49. Gabriela Csurka. A comprehensive survey on domain adaptation for visual applications. In Gabriela Csurka, editor, Domain Adaptation in Computer Vision Applications, Advances in Computer Vision and Pattern Recognition, pages 1–35. Springer, 2017.
  50. AutoAugment: Learning augmentation strategies from data. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 113–123. IEEE, 2019.
  51. Radiotherapy respiratory motion management in hepatobiliary and pancreatic malignancies: a systematic review of patient factors influencing effectiveness of motion reduction with abdominal compression. Acta Oncologica, 61(7):833–841, 2022. PMID: 35611555.
  52. The electrodermal system. In John T. Cacioppo, Louis G. Tassinary, and Gary G. Berntson, editors, Handbook of Psychophysiology, pages 217–243. Cambridge University Press, 4 edition, 2017.
  53. The mahalanobis distance. Chemometrics and intelligent laboratory systems, 50(1):1–18, 2000.
  54. Classification of mammograms using convolutional neural network based feature extraction. In International Conference on Information and Communication Technology for Develoment for Africa, pages 89–98. Springer, September 2017.
  55. Classification of mammograms using texture and cnn based extracted features. Journal of Biomimetics, Biomaterials and Biomedical Engineering, 42:79–97, 8 2019.
  56. COVAREP — a collaborative voice analysis repository for speech technologies. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 960–964. IEEE, 2014. ISSN: 2379-190X.
  57. The numerical solution of fokker–planck equation with radial basis functions (RBFs) based on the meshless technique of kansaś approach and galerkin method. Engineering Analysis with Boundary Elements, 47:38–63, 2014.
  58. ImageNet: A large-scale hierarchical image database. In IEEE Conference on Computer Vision and Pattern Recognition, pages 248–255. IEEE, 2009.
  59. ArcFace: Additive angular margin loss for deep face recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4685–4694. IEEE, 2019.
  60. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186. Association for Computational Linguistics, 2019.
  61. Evaluating reconstruction algorithms for respiratory motion guided acquisition. Physics in Medicine & Biology, 65(17):175009, 2020.
  62. Learning convolutional neural networks in presence of concept drift. In Proceedings of the International Joint Conference on Neural Networks (IJCNN), pages 1–8. IEEE, IEEE, 2019.
  63. Denoising prior driven deep neural network for image restoration. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(10):2305–2318, 2018.
  64. Explainable artificial intelligence: A survey. In Proceedings of the 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), pages 0210–0215. IEEE, IEEE, 2018.
  65. An image is worth 16x16 words: Transformers for image recognition at scale. International Conference on Learning Representations, 2021.
  66. A systematic review of robustness in deep learning for computer vision: Mind the gap? Computing Research Repository (CoRR), abs/2112.00639, 2021.
  67. A fast neural beamformer for antenna arrays. In IEEE International Conference on Communications. Conference Proceedings. ICC 2002 (Cat. No.02CH37333), volume 1, pages 139–144. IEEE, IEEE, 2002.
  68. A guide to convolution arithmetic for deep learning. Computing Research Repository (CoRR), abs/1605.07277, 2016.
  69. Neural architecture search: A survey. The Journal of Machine Learning Research, 20(1):1997–2017, 2019.
  70. Evaluation of image quality for different kV cone-beam CT acquisition and reconstruction methods in the head and neck region. Acta Oncologica, 50(6):908–917, 2011.
  71. Forward and cross-scatter estimation in dual source CT using the deep scatter estimation (DSE). In Hilde Bosmans, Guang-Hong Chen, and Taly Gilat Schmidt, editors, Medical Imaging 2019: Physics of Medical Imaging, volume 10948, page 24. International Society for Optics and Photonics, SPIE, 2019.
  72. Visualizing higher-layer features of a deep network. Technical Report 3, University of Montreal, 2009. Online; accessed 7 December 2022.
  73. Learning SO(3) equivariant representations with spherical CNNs. International Journal of Computer Vision, 128(3):588–600, 2019.
  74. Detecting adversarial samples from artifacts. Computing Research Repository (CoRR), abs/1703.00410, 2017.
  75. Practical cone-beam algorithm. Journal of the Optical Society of America A, 1(6):612–619, 1984.
  76. Efficient and robust automated machine learning. In C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett, editors, Proceedings of the Advances in Neural Information Processing Systems (NIPS, volume 28. Curran Associates, Inc., 2015.
  77. Using meta-learning to initialize bayesian optimization of hyperparameters. In Proceedings of the International Conference on Meta-learning and Algorithm Selection - Volume 1201, MLAS’14, pages 3–10. Citeseer, CEUR-WS.org, 2014.
  78. Roger Fletcher. Practical Methods of Optimization. John Wiley & Sons, Ltd, 2000.
  79. Richard Franke. A critical comparison of some methods for interpolation of scattered data. Technical report, NAVAL POSTGRADUATE SCHOOL MONTEREY CA, 1979.
  80. Dennis Gabor. Theory of communication. part 1: The analysis of information. Journal of the Institution of Electrical Engineers - Part III: Radio and Communication Engineering, 93(26):429–441, 1946.
  81. Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In Maria-Florina Balcan and Kilian Q. Weinberger, editors, Proceedings of the 33nd International Conference on Machine Learning (ICML), volume 48 of JMLR Workshop and Conference Proceedings, pages 1050–1059. JMLR.org, 2016.
  82. Improvements in CBCT image quality using a novel iterative reconstruction algorithm: A clinical evaluation. Advances in Radiation Oncology, 4(2):390–400, 2019.
  83. Image style transfer using convolutional neural networks. In Proceedings fo the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 2414–2423. IEEE, 2016.
  84. ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness. Computing Research Repository (CoRR), abs/1811.12231, 2018.
  85. Explaining explanations: An overview of interpretability of machine learning. In 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA), pages 80–89. IEEE, 2018.
  86. Metal artifact reduction in CT: Where are we after four decades? IEEE Access, 4:5826–5849, 2016.
  87. How (not) to measure bias in face recognition networks. In Frank-Peter Schilling and Thilo Stadelmann, editors, Proceedings of the IAPR Workshop on Artificial Neural Networks in Pattern Recognition, Lecture Notes in Computer Science, pages 125–137. Springer, Springer International Publishing, 2020.
  88. Deep Learning. MIT Press, 2016. http://www.deeplearningbook.org.
  89. Explaining and harnessing adversarial examples. In Proceeding of International Conference on Learning Representations. arXiv, 2015.
  90. European union regulations on algorithmic decision-making and a “right to explanation”. AI Magazine, 38(3):50–57, 2016.
  91. Safire: Sinogram affirmed iterative reconstruction. https://cdn0.scrvt.com/39b415fb07de4d9656c7b516d8e2d907/1800000000306520/d80046026fd1/ct_SAFIRE_White_Paper_1800000000306520.pdf, 2012. Online; accessed 6 December 2022.
  92. A global taxonomy of interpretable AI: unifying the terminology for the technical and social sciences. Artificial Intelligence Review, pages 1–32, 2022.
  93. On the (statistical) detection of adversarial examples. Computing Research Repository (CoRR), abs/1702.06280, 2017.
  94. Analysis of the AutoML challenge series 2015–2018, 2017.
  95. Deep residual learning for compressed sensing CT reconstruction via persistent homology analysis. Computing Research Repository (CoRR), abs/1611.06391, 2016.
  96. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 770–778. IEEE, 2016.
  97. MRF-based deformable registration and ventilation estimation of lung CT. IEEE Transactions on Medical Imaging, 32(7):1239–1248, 2013.
  98. Deep learning-based simultaneous multi-phase deformable image registration of sparse 4d-cbct. Medical Physics, 49(6):e325–e326, 2022.
  99. Kashmir Hill. Wrongfully accused by an algorithm. https://www.nytimes.com/2020/06/24/technology/facial-recognition-arrest.html, 2020. Online; accessed 6 December 2022.
  100. Long short-term memory. Neural computation, 9(8):1735–1780, 1997.
  101. Deep metric learning using triplet network. In Aasa Feragen, Marcello Pelillo, and Marco Loog, editors, International Workshop on Similarity-Based Pattern Recognition, volume 9370, pages 84–92. Springer, Springer, 2015.
  102. Unsupervised learning and simulation for complexity management in business operations. Applied data science: lessons learned for the data-driven business, pages 313–331, 2019.
  103. Godfrey Hounsfield. Method of and apparatus for examining a body by radiation such as x or gamma radiation. Technical report, Originating Research Org. not identified, 1975.
  104. MobileNets: Efficient convolutional neural networks for mobile vision applications. Computing Research Repository (CoRR), abs/1704.04861, 2017.
  105. Squeeze-and-excitation networks. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7132–7141. IEEE, 2018.
  106. Deep transfer metric learning. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 325–333. IEEE, 2015.
  107. A lightweight privacy-preserving CNN feature extraction framework for mobile sensing. IEEE Transactions on Dependable and Secure Computing, 18(3):1441–1455, 2019.
  108. Lung nodule detection in CT using 3d convolutional neural networks. In Proceedings of the IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017), pages 379–383. IEEE, 2017.
  109. A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability. Computer Science Review, 37:100270, 2020.
  110. Efficient parallel inflated 3d convolution architecture for action recognition. IEEE Access, 8:45753–45765, 2020.
  111. Lietransformer: Equivariant self-attention for lie groups. In Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event, pages 4533–4543. PMLR, 2021.
  112. Herbert Jaeger. The “echo state” approach to analysing and training recurrent neural networks-with an erratum note. Bonn, Germany: German National Research Center for Information Technology GMD Technical Report, 148(34):13, 2001.
  113. Flat-panel cone-beam computed tomography for image-guided radiation therapy. International Journal of Radiation Oncology*Biology*Physics, 53(5):1337–1349, 2002.
  114. Using the iterative kV CBCT reconstruction on the varian halcyon linear accelerator for radiation therapy-planning CT datasets: A feasibility study. International Journal of Radiation Oncology*Biology*Physics, 105(1):E719–E720, 2019.
  115. Deep convolutional neural network for inverse problems in imaging. IEEE Transactions on Image Processing, 26(9):4509–4522, 2017.
  116. Adaptive confidence learning for the personalization of pain intensity estimation systems. Evolving Systems, 8(1):71–83, 2017.
  117. Methods for person-centered continuous pain intensity assessment from bio-physiological channels. IEEE Journal of Selected Topics in Signal Processing, 10(5):854–864, 2016.
  118. Multimodal data fusion for person-independent, continuous estimation of pain intensity. In Lazaros Iliadis and Chrisina Jayne, editors, Proceedings of the International Conference on Engineering Applications of Neural Networks (EANN), Communications in Computer and Information Science, pages 275–285. Springer, 2015.
  119. Stefan Kaczmarz. Angenäherte auflösung von systemen linearer gleichungen. Bulletin International de l’ Académie Polonaise des Sciences et des Lettres, 35:355–357, 1937.
  120. Geometric robustness of deep networks: Analysis and improvement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4441–4449. IEEE, 2018.
  121. On a generalized gaussian radial basis function: Analysis and applications. Engineering Analysis with Boundary Elements, 112:46–57, 2020.
  122. Andrej Karpathy. A recipe for training neural networks. http://karpathy.github.io/2019/04/25/recipe/, 2019. Online; accessed 7 December 2022.
  123. Trustworthy artificial intelligence: A review. ACM Computing Surveys (CSUR), 55(2):39:1–39:38, 2022.
  124. The s-transform using a new window to improve frequency and time resolutions. Signal, image and Video processing, 8(3):533–541, 2014.
  125. High resolution iterative CT reconstruction using graphics hardware. In Proceedings of the IEEE Nuclear Science Symposium Conference Record (NSS/MIC), pages 4035–4040, Oct 2009. ISSN: 1082-3654.
  126. Roland Kehrein. The prosody of authentic emotions. In Proceedings of the International Conference on Speech Prosody, pages 423–426, 2002.
  127. Multimodal fusion including camera photoplethysmography for pain recognition. In 2017 International Conference on Companion Technology (ICCT), pages 1–4, September 2017.
  128. Pain recognition with camera photoplethysmography. In 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA), pages 1–5, November 2017.
  129. Stochastic estimation of the maximum of a regression function. The Annals of Mathematical Statistics, 23(3):462–466, 1952.
  130. Transparency and accountability in AI decision support: Explaining and visualizing convolutional neural networks for text information. Decision Support Systems, 134:113302, 2020.
  131. Learning not to learn: Training deep neural networks with biased data. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 9004–9012. IEEE, 2019.
  132. Combining ordered subsets and momentum for accelerated X-ray CT image reconstruction. IEEE Transactions on Medical Imaging, 34(1):167–178, Jan 2015.
  133. Early clinical experience with varian halcyon v2 linear accelerator: Dual‐isocenter IMRT planning and delivery with portal dosimetry for gynecological cancer treatments. Journal of Applied Clinical Medical Physics, 20(11):111–120, 2019.
  134. Fusion architectures for multimodal cognitive load recognition. In Friedhelm Schwenker and Stefan Scherer, editors, Multimodal Pattern Recognition of Social Signals in Human-Computer-Interaction (MPRSS), volume 10183, pages 36–47. Springer, 2016.
  135. Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations (ICLR), 2015.
  136. A U-Nets cascade for sparse view computed tomography. In Florian Knoll, Andreas Maier, and Daniel Rueckert, editors, Proceedings of the First International Workshop on Machine Learning for Medical Image Reconstruction (MLMIR) Held in Conjunction with MICCAI, Lecture Notes in Computer Science, pages 91–99. Springer, 2018.
  137. 3D object representations for fine-grained categorization. In ICCV Workshops, pages 554–561, 2013.
  138. Arvind Krishna. IBM CEO’s letter to congress on racial justice reform. https://www.ibm.com/policy/facial-recognition-sunset-racial-justice-reforms/, 2020. Online; accessed 6 December 2022.
  139. Alex Krizhevsky. Learning multiple layers of features from tiny images. https://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf, 2009. Online; accessed 3 December 2022.
  140. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (NIPS), volume 25, pages 1097–1105, 2012.
  141. Miroslav Kubat. Decision trees can initialize radial-basis function networks. IEEE Transactions on Neural Networks, 9(5):813–821, 1998.
  142. A detailed review of feature extraction in image processing systems. In Proceedings of the Fourth International Conference on Advanced Computing & Communication Technologies, pages 5–12. IEEE, 2014. ISSN: 2327-0659.
  143. A dataset and a technique for generalized nuclear segmentation for computational pathology. IEEE Transactions on Medical Imaging, 36(7):1550–1560, 2017.
  144. Adversarial examples in the physical world. In Proceeding of International Conference on Learning Representations. arXiv, 2016.
  145. Survey on semantic segmentation using deep learning techniques. Neurocomputing, 338:321–348, 2019.
  146. Deep learning. Nature, 521(7553):436–444, 2015.
  147. Backpropagation Applied to Handwritten Zip Code Recognition. Neural computation, 1(4):541–551, December 1989.
  148. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, 1998.
  149. Illumination-aware faster r-CNN for robust multispectral pedestrian detection. Pattern Recognition, 85:161–171, 2019.
  150. Hyperband: A novel bandit-based approach to hyperparameter optimization. The Journal of Machine Learning Research, 18(185):1–52, 2018.
  151. Adversarial examples detection in deep networks with convolutional filter statistics. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 5775–5783. IEEE, 2017.
  152. Epileptic seizure detection in EEG signals using sparse multiscale radial basis function networks and the fisher vector approach. Knowledge-Based Systems, 164:96–106, 2019.
  153. Detecting adversarial image examples in deep neural networks with adaptive noise reduction. IEEE Transactions on Dependable and Secure Computing, 18(1):72–85, 2018.
  154. ADN: Artifact disentanglement network for unsupervised metal artifact reduction. IEEE Transactions on Medical Imaging, 39(3):634–643, 2020.
  155. Relaxed conditions for radial-basis function networks to be universal approximators. Neural Networks, 16(7):1019–1028, 2003.
  156. Fast autoaugment. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors, Proceedings of the Advances in Neural Information Processing Systems (NIPS), volume 32. Curran Associates, Inc., 2019.
  157. DuDoNet: Dual domain network for CT metal artifact reduction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10504–10513. IEEE, 2019.
  158. Roberta: A robustly optimized BERT pretraining approach. Computing Research Repository (CoRR), abs/1907.11692, 2019.
  159. CNN architectures for large-scale audio classification. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 131–135. IEEE, 2017.
  160. Towards automated deep learning: Analysis of the AutoDL challenge series 2019. In Proceedings of the NeurIPS Competition and Demonstration Track, pages 242–252. PMLR, PMLR, 2020.
  161. Steve Lohr. Facial recognition is accurate, if you’re a white guy. https://www.nytimes.com/2018/02/09/technology/facial-recognition-race-artificial-intelligence.html, 2018. Online; accessed 6 December 2022.
  162. Decoupled weight decay regularization. In Proceedings of the 7th International Conference on Learning Representations (ICLR), pages 1–18, 2019.
  163. David G Lowe. Object recognition from local scale-invariant features. In Proceedings of the Seventh IEEE International Conference on Computer Vision, volume 2, pages 1150–1157. IEEE, IEEE, 1999.
  164. David G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2):91–110, 2004.
  165. SafetyNet: Detecting and rejecting adversarial examples robustly. In IEEE International Conference on Computer Vision (ICCV), pages 446–454. IEEE, 2017.
  166. Cine cardiac MRI motion artifact reduction using a recurrent neural network. IEEE Transactions on Medical Imaging, 40(8):2170–2181, 2021.
  167. U-DuDoNet: Unpaired dual-domain network for CT metal artifact reduction. In Marleen de Bruijne, Philippe C. Cattin, Stéphane Cotin, Nicolas Padoy, Stefanie Speidel, Yefeng Zheng, and Caroline Essert, editors, Medical Image Computing and Computer Assisted Intervention (MICCAI), Lecture Notes in Computer Science, pages 296–306. Springer International Publishing, 2021.
  168. Encoding metal mask projection for metal artifact reduction in computed tomography. In Proceedings of the 23rd International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), pages 147–157. Springer, Springer-Verlag, 2020.
  169. Ryan Mac. Facebook apologizes after a.i. puts ‘primates’ label on video of black men. https://www.nytimes.com/2021/09/03/technology/facebook-ai-race-primates.html, 2021. Online; accessed 6 December 2022.
  170. Towards deep learning models resistant to adversarial attacks. In Proceedings of the 6th International Conference on Learning Representations (ICLR). OpenReview.net, 2018.
  171. Learning with known operators reduces maximum error bounds. Nature Machine Intelligence, 1(8):373–380, 2019.
  172. Deep scatter estimation (DSE): feasibility of using a deep convolutional neural network for real-time x-ray scatter prediction in cone-beam CT. SPIE Medical Imaging, 10573:56, 2018.
  173. Fine-grained visual classification of aircraft. Computing Research Repository (CoRR), abs/1306.5151, 2013.
  174. Evaluation and clinical application of a commercially available iterative reconstruction algorithm for CBCT-based IGRT. Technology in Cancer Research & Treatment, 18, 2019. PMID: 30803367.
  175. Interpretable models for granger causality using self-explaining neural networks. In Proceedings of the 9th International Conference on Learning Representations (ICLR), 2021.
  176. Rotation equivariant vector field networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), pages 5058–5067. IEEE, 2017.
  177. Deep face recognition: A survey. In 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), pages 471–478. IEEE, 2018.
  178. Deep face recognition: A survey. In Proceedings of the 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), pages 471–478. IEEE, IEEE, 2018.
  179. A survey on bias and fairness in machine learning. ACM Computing Surveys, 54(6):1–35, 2021.
  180. Learning neural models for end-to-end clustering. In Artificial Neural Networks in Pattern Recognition, pages 126–138, Cham, 2018. Springer International Publishing.
  181. MagNet: A two-pronged defense against adversarial examples. In Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, pages 135–147. ACM, 2017.
  182. Face recognition with radial basis function (RBF) neural networks. IEEE Transactions on Neural Networks, 13(3):697–710, 2002.
  183. On detecting adversarial perturbations. In 5th International Conference on Learning Representations, ICLR. OpenReview.net, 2017.
  184. Universal adversarial perturbations against semantic image segmentation. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), pages 2774–2783. IEEE, 2017.
  185. V-net: Fully convolutional neural networks for volumetric medical image segmentation. In Proceedings of the Fourth International Conference on 3D Vision (3DV), pages 565–571. IEEE, 2016.
  186. Fast learning in networks of locally-tuned processing units. Neural computation, 1(2):281–294, 1989.
  187. Universal adversarial perturbations. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 86–94. IEEE, 2017.
  188. DeepFool: A simple and accurate method to fool deep neural networks. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 2574–2582. IEEE, 2016.
  189. Voxceleb: A large-scale speaker identification dataset. Comput. Speech Lang., 60:2616–2620, 2017.
  190. Yu Nesterov. Smooth minimization of non-smooth functions. Mathematical Programming, 103(1):127–152, 2005.
  191. Single-shot 3d hand pose estimation using radial basis function networks trained on synthetic data. Pattern Analysis and Applications, 23(1):415–428, 2020.
  192. Using maximum entropy for text classification. In Proceedings of the IJCAI-99 workshop on machine learning for information filtering, volume 1, pages 61–67, 1999.
  193. Automated flower classification over a large number of classes. In Sixth Indian Conference on Computer Vision, Graphics & Image Processing, pages 722–729. IEEE, 2008.
  194. Feature visualization. Distill, 2(11):e7, 2017.
  195. The building blocks of interpretability. Distill, 3(3):e10, 2018.
  196. Automating biomedical data science through tree-based pipeline optimization. In Giovanni Squillero and Paolo Burelli, editors, Proceedings of the European Conference on the Applications of Evolutionary Computation, Lecture Notes in Computer Science, pages 123–137. Springer, Springer, 2016.
  197. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 22(10):1345–1359, 2009.
  198. Transferability in machine learning: from phenomena to black-box attacks using adversarial samples. Computing Research Repository (CoRR), abs/1605.07277, 2016.
  199. CT sinogram‐consistency learning for metal‐induced beam hardening correction. Medical Physics, 45(12):5376–5384, 2018.
  200. Cats and dogs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3498–3505. IEEE, 2012. ISSN: 1063-6919.
  201. PyTorch: an imperative style, high-performance deep learning library. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors, Proceedings of the 33rd International Conference on Neural Information Processing Systems (NeurIPS), pages 8024–8035. Curran Associates, Inc., 2019.
  202. Iterative image reconstruction in image-guided radiation therapy. https://patentscope.wipo.int/search/en/detail.jsf?docId=WO2017066248, November 16 2021. US Patent 11,173,324.
  203. Deep learning methods for image guidance in radiation therapy. In Frank-Peter Schilling and Thilo Stadelmann, editors, Proceedings of the 9th IAPR Artificial Neural Networks in Pattern Recognition Workshop (ANNPR), volume 12294 of Lecture Notes in Computer Science, pages 3–22. Springer, 2020.
  204. Convolutional network based motion artifact reduction in cone-beam CT. In AAPM annual meeting 2019, e-Poster, volume 46, pages E340–E341. WILEY , USA, 2019.
  205. Visually guided inspiration breath-hold facilitated with nasal high flow therapy in locally advanced lung cancer. Acta Oncologica, 60(5):567–574, 2021. PMID: 33295823.
  206. Roger Penrose. A generalized inverse for matrices. Mathematical Proceedings of the Cambridge Philosophical Society, 51(3):406–413, 1955.
  207. Reducing residual-motion artifacts in iterative 3d CBCT reconstruction in image-guided radiation therapy. Medical Physics, 48(10):6497–6507, 2021.
  208. Networks for approximation and learning. Proceedings of the IEEE, 78(9):1481–1497, 1990.
  209. The circumplex model of affect: An integrative approach to affective neuroscience, cognitive development, and psychopathology. Development and Psychopathology, 17(3), 2005.
  210. Searching for activation functions. Computing Research Repository (CoRR), abs/1710.05941, 2017.
  211. Foolbox: A python toolbox to benchmark the robustness of machine learning models. In Reliable Machine Learning in the Wild Workshop. JMLR, 2017.
  212. On the interpretability of artificial intelligence in radiology: Challenges and opportunities. Radiology: Artificial Intelligence, 2(3):e190043, 2020.
  213. ImageNet-21K pretraining for the masses. Computing Research Repository (CoRR), abs/2104.10972, 2021.
  214. Prediction of asynchronous dimensional emotion ratings from audiovisual and physiological data. Pattern Recognition Letters, 66:22–30, 2015.
  215. Introducing the RECOLA multimodal corpus of remote collaborative and affective interactions. In Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG), pages 1–8. IEEE, 2013.
  216. A stochastic approximation method. The Annals of Mathematical Statistics, 22(3):400–407, 1951.
  217. Face recognition: Too bias, or not too bias? In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 1–10. IEEE, 2020.
  218. Group equivariant stand-alone self-attention for vision. In Proceedings of the 9th International Conference on Learning Representations (ICLR). OpenReview.net, 2021.
  219. U-Net: Convolutional networks for biomedical image segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention (MICCAI). Springer, 2015.
  220. Why, who, what, when and how about explainability in human-agent systems. In Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS ’20, pages 2161–2164. International Foundation for Autonomous Agents and Multiagent Systems, 2020.
  221. Dictionaries for sparse representation modeling. Proceedings of the IEEE, 98(6):1045–1057, 2010.
  222. Cynthia Rudin. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, 1(5):206–215, 2019.
  223. ImageNet large scale visual recognition challenge. International Journal of Computer Vision, 115(3):211–252, 2015.
  224. MobileNetV2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4510–4520. IEEE, 2018.
  225. Analyzing and increasing the reliability of convolutional neural networks on GPUs. IEEE Transactions on Reliability, 68(2):663–677, 2018.
  226. Convolutional neural networks in medical image understanding: a survey. Evolutionary Intelligence, 15(1):1–22, 2021.
  227. APAC: Augmented PAttern classification with neural networks. Computing Research Repository (CoRR), abs/1505.03229, 2015.
  228. On the information bottleneck theory of deep learning. Journal of Statistical Mechanics: Theory and Experiment, 2019(12):124020, 2019.
  229. Jürgen Schmidhuber. Deep learning in neural networks: An overview. Neural Networks, 61:85–117, 2015.
  230. Simulation-based deep artifact correction with convolutional neural networks for limited angle artifacts. Zeitschrift für Medizinische Physik, 29(2):150–161, 2019.
  231. Artefacts in CBCT: a review. Dentomaxillofacial Radiology, 40(5):265–273, 2011.
  232. Bidirectional recurrent neural networks. IEEE Transactions on Signal Processing, 45(11):2673–2681, 1997.
  233. Initialisation of radial basis function networks using classification trees. Neural Network World, 10(3):473–482, 2000.
  234. Three learning phases for radial-basis-function networks. Neural Networks, 14(4):439–458, 2001.
  235. Similarities of LVQ and RBF learning-a survey of learning rules and the application to the classification of signals from high-resolution electrocardiography. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, volume 1, pages 646–651. IEEE, IEEE, 1994.
  236. Practices for engineering trustworthy machine learning applications. In Proceedings of the IEEE/ACM 1st Workshop on AI Engineering - Software Engineering for AI (WAIN), pages 97–100. IEEE, IEEE, 2021.
  237. Sensitive loss: Improving accuracy and fairness of face representations with discrimination-aware deep learning. Artificial Intelligence, 305:103682, 2022.
  238. Self-attention with relative position representations. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, volume 2, pages 464–468. Association for Computational Linguistics, 2018.
  239. Intraclass correlations: Uses in assessing rater reliability. Psychological Bulletin, 86(2):420–428, 1979.
  240. Opening the black box of deep neural networks via information. Computing Research Repository (CoRR), abs/1703.00810, 2017.
  241. Best practices for convolutional neural networks applied to visual document analysis. In Proceedings of the Seventh International Conference on Document Analysis and Recognition (ICDAR)., volume 1, pages 958–963. IEEE Comput. Soc, 2003.
  242. Very deep convolutional networks for large-scale image recognition. In 5th International Conference on Learning Representations, ICLR. OpenReview.net, 2015.
  243. John R Smith. Ibm research releases ‘diversity in faces’ dataset to advance study of fairness in facial recognition systems. https://www.ibm.com/blogs/research/2019/01/diversity-in-faces/, 2019. Online; accessed 7 December 2022.
  244. A large dataset of real patients CT scans for COVID-19 identification, 2020. Type: dataset.
  245. Deep metric learning via lifted structured feature embedding. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 4004–4012. IEEE, 2016.
  246. Striving for simplicity: The all convolutional net. In Proceedings of the 3rd International Conference on Learning Representations, {ICLR}. OpenReview.net, 2015.
  247. Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research (JMLR), 15(56):1929–1958, 2014.
  248. Highway networks. Computing Research Repository (CoRR), abs/1505.00387, 2015.
  249. Deep learning in the wild. In Luca Pancioni, Friedhelm Schwenker, and Edmondo Trentin, editors, Proceedings of the IAPR Workshop on Artificial Neural Networks in Pattern Recognition (ANNPR), Lecture Notes in Computer Science, pages 17–38. Springer, Springer, 2018.
  250. Beyond ImageNet: Deep learning in industrial practice. In Martin Braschler, Thilo Stadelmann, and Kurt Stockinger, editors, Applied Data Science, pages 205–232. Springer, 2019.
  251. Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces. Journal of Global Optimization, 11(4):341–359, 1997.
  252. Segmenter: Transformer for semantic segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 7242–7252. IEEE, 2021.
  253. RoFormer: Enhanced transformer with rotary position embedding. Computing Research Repository (CoRR), abs/2104.09864, 2021.
  254. One pixel attack for fooling deep neural networks. IEEE Transactions on Evolutionary Computation, 23(5):828–841, 2019.
  255. Learning a convolutional neural network for non-uniform motion blur removal. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 769–777. IEEE, 2015.
  256. Deep learning face representation by joint identification-verification. In Z. Ghahramani, M. Welling, C. Cortes, N. Lawrence, and K.Q. Weinberger, editors, Proceedings of the Advances in Neural Information Processing Systems NIPS, volume 27. Curran Associates, Inc., 2014.
  257. Deeply learned face representations are sparse, selective, and robust. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 2892–2900. IEEE, 2015.
  258. Sparsifying neural network connections for face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 4856–4864. IEEE, 2016.
  259. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1–9. IEEE, 2015.
  260. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 2818–2826. IEEE, 2016.
  261. Intriguing properties of neural networks. In Proceeding of International Conference on Learning Representations. arXiv, 2014.
  262. MnasNet: Platform-aware neural architecture search for mobile. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 2815–2823. IEEE, 2019.
  263. Efficientnet: Rethinking model scaling for convolutional neural networks. In Kamalika Chaudhuri and Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML, volume 97 of Proceedings of Machine Learning Research, pages 6105–6114. PMLR, 2019.
  264. Multi-modal pain intensity recognition based on the senseemotion database. IEEE Transactions on Affective Computing, 12(3):743–760, 2021.
  265. Multimodal deep denoising convolutional autoencoders for pain intensity classification based on physiological signals. In Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods, pages 289–296. SCITEPRESS - Science and Technology Publications, 2020.
  266. Robert Tibshirani. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological), 58(1):267–288, 1996.
  267. A survey on explainable artificial intelligence (XAI): Toward medical XAI. IEEE Transactions on Neural Networks and Learning Systems, 32(11):4793–4813, 2021.
  268. Training data-efficient image transformers & distillation through attention. In Proceedings of the 38th International Conference on Machine Learning (ICML), pages 10347–10357. PMLR, 2021.
  269. Adversarial training and robustness for multiple perturbations. In Proceedings of the 33rd International Conference on Neural Information Processing Systems (NeurIPS), pages 5866–5876. Curran Associates Inc., 2019.
  270. Design patterns for resource-constrained automated deep-learning methods. AI, 1(4):510–538, 2020.
  271. Automated machine learning in practice: State of the art and recent results. In Proceedings of the 6th Swiss Conference on Data Science (SDS), pages 31–36. IEEE, IEEE, 2019.
  272. Heang K. Tuy. An inversion formula for cone-beam reconstruction. SIAM Journal on Applied Mathematics, 43(3):546–552, 1983.
  273. Instance normalization: The missing ingredient for fast stylization. Computing Research Repository (CoRR), abs/1607.08022, 2016.
  274. AVEC 2016: Depression, mood, and emotion recognition workshop and challenge. In Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, pages 3–10. ACM, 2016.
  275. FERA 2017 - addressing head pose in the third facial expression recognition and analysis challenge. In Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), pages 839–847. IEEE, IEEE, 2017.
  276. Laurens Van der Maaten and Geoffrey Hinton. Visualizing non-metric similarities in multiple maps. Machine Learning, 87(1):33–55, 2012.
  277. Joaquin Vanschoren. Meta-learning. In Frank Hutter, Lars Kotthoff, and Joaquin Vanschoren, editors, Automated Machine Learning: Methods, Systems, Challenges, The Springer Series on Challenges in Machine Learning, pages 35–61. Springer, 2019.
  278. Attention is all you need. In I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, editors, Proceedings of the Advances in Neural Information Processing Systems (NIPS), volume 30. Curran Associates, Inc., 2017.
  279. Making machine learning models interpretable. In Proceedings of the European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), pages 163–172, 2012.
  280. Fairness definitions explained. In Proceedings of the IEEE/ACM International Workshop on Software Fairness, pages 1–7. IEEE, ACM, 2018.
  281. The biovid heat pain database data for the advancement and systematic validation of an automated pain recognition system. In Proceedings of the IEEE International Conference on Cybernetics (CYBCO), pages 128–131. IEEE, 2013.
  282. Regularization of neural networks using DropConnect. In Proceedings of the 30th International Conference on Machine Learning, pages 1058–1066. PMLR, 2013.
  283. PFW: A face database in the wild for studying face identification and verification in uncontrolled environment. In Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, pages 356–360, 2008. ISSN: 0730-6512.
  284. CosFace: Large margin cosine loss for deep face recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5265–5274. IEEE, 2018.
  285. InDuDoNet: An interpretable dual domain network for CT metal artifact reduction. In Proceedings of the 24th International Conference Medical Image Computing and Computer Assisted Intervention (MICCAI), pages 107–118. Springer, Springer-Verlag, 2021.
  286. Deep metric learning with angular loss. In IEEE International Conference on Computer Vision (ICCV), pages 2612–2620. IEEE, 2017.
  287. Deep visual domain adaptation: A survey. Neurocomputing, 312:135–153, 2018.
  288. Racial faces in the wild: Reducing racial bias by information maximization adaptation network. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 692–702. IEEE, 2019.
  289. Medical image segmentation using deep learning: A survey. IET Image Processing, 16(5):1243–1267, 2020.
  290. Multiple sclerosis identification by 14-layer convolutional neural network with batch normalization, dropout, and stochastic pooling. Frontiers in Neuroscience, 12:818, 2018.
  291. IDOL-net: An interactive dual-domain parallel network for CT metal artifact reduction. IEEE Transactions on Radiation and Plasma Medical Sciences, 6(8):874–885, 2022.
  292. Image quality assessment: From error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4):600–612, 2004.
  293. Metal artifact reduction using iterative CBCT reconstruction algorithm for head and neck radiation therapy: A phantom and clinical study. European Journal of Radiology, 132, 2020.
  294. Bias, awareness, and ignorance in deep-learning-based face recognition. AI and Ethics, 2(3):509–522, 2022.
  295. Learning steerable filters for rotation equivariant CNNs. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 849–858. IEEE, 2018.
  296. Caltech-UCSD birds 200. Report or Paper CNS-TR-2010-001, California Institute of Technology, 2010.
  297. CNNs on surfaces using rotation-equivariant features. ACM Transactions on Graphics, 39(4):92–1, 2020.
  298. A survey of unsupervised deep domain adaptation. ACM Transactions on Intelligent Systems and Technology (TIST), 11(5):1–46, 2020.
  299. Deep cosine metric learning for person re-identification. In IEEE Winter Conference on Applications of Computer Vision (WACV), pages 748–756. IEEE, IEEE, 2018.
  300. Why we should have seen that coming: comments on microsoft’s tay “experiment,” and wider implications. The ORBIT Journal, 1(2):1–12, 2017.
  301. CBAM: Convolutional block attention module. In Vittorio Ferrari, Martial Hebert, Cristian Sminchisescu, and Yair Weiss, editors, Proceedings of the European conference on Computer Vision (ECCV), Lecture Notes in Computer Science, pages 3–19. Springer, 2018.
  302. Deep learning computed tomography: Learning projection-domain weights from image domain in limited angle problems. IEEE Transactions on Medical Imaging, 37(6):1454–1463, 2018.
  303. Distance metric learning with application to clustering with side-information. In Advances in Neural Information Processing Systems, volume 15, pages 521–528. MIT Press, 2003.
  304. Xuehan Xiong and Fernando De la Torre. Supervised descent method and its applications to face alignment. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 532–539, 2013. ISSN: 1063-6919.
  305. FairGAN: Fairness-aware generative adversarial networks. In Proceedings of the IEEE International Conference on Big Data (Big Data), pages 570–575. IEEE, IEEE, 2018.
  306. Feature squeezing: Detecting adversarial examples in deep neural networks. In Proceedings 2018 Network and Distributed System Security Symposium. Internet Society, 2018.
  307. Fast and accurate image super resolution by deep CNN with skip connection and network in network. In Derong Liu, Shengli Xie, Yuanqing Li, Dongbin Zhao, and El-Sayed M. El-Alfy, editors, Proceedings of the International Conference on Neural Information Processing (NIPS, volume 10635, pages 217–225. Springer, Springer, 2017.
  308. Deep lesion graphs in the wild: Relationship learning and organization of significant radiology image findings in a diverse large-scale lesion database. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9261–9270. IEEE, 2018.
  309. Unbox the black-box for the medical explainable AI via multi-modal and multi-centre data fusion: A mini-review, two showcases and beyond. Information Fusion, 77:29–52, 2022.
  310. Facial expression recognition based on facial action unit. In Proceedings of the Tenth International Green and Sustainable Computing Conference (IGSC), pages 1–6. IEEE, 2019.
  311. COVID-CT-dataset: A CT scan dataset about COVID-19. Computing Research Repository (CoRR), abs/2003.13865, 2020.
  312. Initial evaluation of a novel cone-beam CT-based semi-automated online adaptive radiotherapy system for head and neck cancer treatment - a timing and automation quality study. Cureus, 12(8):e9660, 2020.
  313. Exploring racial bias within face recognition via per-subject adversarially-enabled data augmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 83–92. IEEE, 2020.
  314. Explainability of deep vision-based autonomous driving systems: Review and challenges. International Journal of Computer Vision, 130(10):2425–2452, 2022.
  315. Visualizing and understanding convolutional networks. In David Fleet, Tomas Pajdla, Bernt Schiele, and Tinne Tuytelaars, editors, Proceedings of the European Conference on Computer Vision (ECCV), Lecture Notes in Computer Science, pages 818–833. Springer International Publishing, 2014.
  316. Efficient convolutions for real-time semantic segmentation of 3d point clouds. In Proceedings of the International Conference on 3D Vision (3DV), pages 399–408. IEEE, 2018.
  317. Visual interpretability for deep learning: a survey. Frontiers of Information Technology & Electronic Engineering, 19(1):27–39, 2018.
  318. Convolutional neural network based metal artifact reduction in x-ray computed tomography. IEEE Transactions on Medical Imaging, 37(6):1370–1381, 2018.
  319. Road extraction by deep residual U-Net. IEEE Geoscience and Remote Sensing Letters, 15(5):749–753, 2018.
  320. A sparse-view CT reconstruction method based on combination of DenseNet and deconvolution. IEEE Transactions on Medical Imaging, 37(6):1407–1417, 2018.
  321. Prediction of interfacial interactions related with membrane fouling in a membrane bioreactor based on radial basis function artificial neural network (ANN). Bioresource Technology, 282:262–268, 2019.
  322. Statistical debugging of sampled programs. In S. Thrun, L. Saul, and B. Schölkopf, editors, Proceedings of the Advances in Neural Information Processing Systems (NIPS), volume 16. MIT Press, 2003.
  323. Pose-independent facial action unit intensity regression based on multi-task deep transfer learning. In Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), pages 872–877. IEEE, IEEE, 2017.
  324. Multi-label CNN based pedestrian attribute learning for soft biometrics. In International Conference on Biometrics (ICB), pages 535–540. IEEE, 2015.
  325. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), pages 2242–2251. IEEE, 2017.
  326. Learning transferable architectures for scalable image recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8697–8710. IEEE, 2018.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com