Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms (2401.01200v1)

Published 2 Jan 2024 in cs.CV and cs.AI

Abstract: Skin lesions are classified in benign or malignant. Among the malignant, melanoma is a very aggressive cancer and the major cause of deaths. So, early diagnosis of skin cancer is very desired. In the last few years, there is a growing interest in computer aided diagnostic (CAD) using most image and clinical data of the lesion. These sources of information present limitations due to their inability to provide information of the molecular structure of the lesion. NIR spectroscopy may provide an alternative source of information to automated CAD of skin lesions. The most commonly used techniques and classification algorithms used in spectroscopy are Principal Component Analysis (PCA), Partial Least Squares - Discriminant Analysis (PLS-DA), and Support Vector Machines (SVM). Nonetheless, there is a growing interest in applying the modern techniques of machine and deep learning (MDL) to spectroscopy. One of the main limitations to apply MDL to spectroscopy is the lack of public datasets. Since there is no public dataset of NIR spectral data to skin lesions, as far as we know, an effort has been made and a new dataset named NIR-SC-UFES, has been collected, annotated and analyzed generating the gold-standard for classification of NIR spectral data to skin cancer. Next, the machine learning algorithms XGBoost, CatBoost, LightGBM, 1D-convolutional neural network (1D-CNN) were investigated to classify cancer and non-cancer skin lesions. Experimental results indicate the best performance obtained by LightGBM with pre-processing using standard normal variate (SNV), feature extraction providing values of 0.839 for balanced accuracy, 0.851 for recall, 0.852 for precision, and 0.850 for F-score. The obtained results indicate the first steps in CAD of skin lesions aiming the automated triage of patients with skin lesions in vivo using NIR spectral data.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (34)
  1. Data augmentation using principal component resampling for image recognition by deep learning, in: Artificial Intelligence and Soft Computing: 19th International Conference, ICAISC 2020, Zakopane, Poland, October 12-14, 2020, Proceedings, Part II 19, Springer. pp. 39–48.
  2. Optuna: A next-generation hyperparameter optimization framework, in: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Association for Computing Machinery, New York, NY, USA. p. 2623–2631.
  3. Finding reduced Raman spectroscopy fingerprint of skin samples for melanoma diagnosis through machine learning. Artificial Intelligence in Medicine 120, 102161.
  4. Standard normal variate transformation and de-trending of near-infrared diffuse reflectance spectra. Applied Spectroscopy 43, 772–777.
  5. Algorithms for hyper-parameter optimization. Advances in Neural Information Processing Systems 24.
  6. A training algorithm for optimal margin classifiers, in: Proceedings of the 5th Annual Workshop on Computational Learning Theory, pp. 144–152.
  7. Pattern recognition in chemometrics. Chemometrics and Intelligent Laboratory Systems 149, 90–96.
  8. SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 16, 321–357.
  9. Xgboost: A scalable tree boosting system. Knowledge Disc. Data Mining , 785–794.
  10. Stochastic gradient boosting. Computational Statistics & Data Analysis 38, 367–378.
  11. Melanoma diagnosis by Raman spectroscopy and neural networks: Structure alterations in proteins and lipids in intact cancer tissue. Journal of Investigative Dermatology 122, 443–449.
  12. Deep Learning. MIT Press. http://www.deeplearningbook.org.
  13. Generative adversarial nets. Advances in Neural Information Processing Systems 27.
  14. Diagnosis of dementias using partial least squares discriminant analysis. dementia geriatric cognit disorders. Journal of Big Data 6(2), 83–8.
  15. Catboost for big data: an interdisciplinary review. Journal of Big Data 7, 2196–1115.
  16. LightGBM: A highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems 30.
  17. A smartphone based application for skin cancer classification using deep learning with clinical images and lesion information. arXiv preprint arXiv:2104.14353 .
  18. Deep convolutional neural networks for Raman spectrum recognition: a unified solution. The Analyst 142, 4067–4074.
  19. A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems 30 (NIP 2017) , 4765–4774.
  20. One-dimensional convolutional neural networks for spectroscopic signal regression. Journal of Chemometrics 32, e2977.
  21. Near-infrared spectroscopy for dermatological applications. Vibrational Spectroscopy 28, 53–58.
  22. Tutorial: multivariate classification for vibrational spectroscopy in biological samples. Nature Protocols 15, 2143–2162.
  23. Rectified linear units improve restricted Boltzmann machines. International Conference on Machine Learning , 807–814.
  24. The influence of training sample size on the accuracy of deep learning models for the prediction of soil properties with near-infrared spectroscopy data. SOIL 6, 565–578.
  25. The impact of patient clinical information on automated skin cancer detection. Computers in Biology and Medicine 116, 103545.
  26. PAD-UFES-20: a skin lesion dataset composed of patient data and clinical images collected from smartphones. Data in Brief 32, 106221.
  27. Deep adversarial data augmentation for biomedical spectroscopy: Application to modelling raman spectra of bone. Chemometrics and Intelligent Laboratory Systems 228, 104634.
  28. Catboost: unbiased boosting with categorical features. arXiv:1706.09516.
  29. Applications of machine learning in spectroscopy. Applied Spectroscopy Reviews 56, 733–763.
  30. Learning representations by back-propagating errors. Nature 323, 533–536.
  31. Noise in biological Raman spectroscopy, in: International Conference on Noise and Fluctuations (ICNF), IEEE. pp. 1–6.
  32. Infrared spectroscopy: fundamentals and applications. John Wiley & Sons.
  33. Quantitative analysis modeling of infrared spectroscopy based on ensemble convolutional neural networks. Chemometrics and Intelligent Laboratory Systems 181, 1–10.
  34. Application of XGBoost algorithm in the detection of SARS-CoV-2 using Raman spectroscopy. Journal of Physics: Conference Series 1775, 012007.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (15)
Citations (5)

Summary

We haven't generated a summary for this paper yet.