Assessing the Performance of 1D-Convolution Neural Networks to Predict Concentration of Mixture Components from Raman Spectra (2306.16621v1)
Abstract: An emerging application of Raman spectroscopy is monitoring the state of chemical reactors during biologic drug production. Raman shift intensities scale linearly with the concentrations of chemical species and thus can be used to analytically determine real-time concentrations using non-destructive light irradiation in a label-free manner. Chemometric algorithms are used to interpret Raman spectra produced from complex mixtures of bioreactor contents as a reaction evolves. Finding the optimal algorithm for a specific bioreactor environment is challenging due to the lack of freely available Raman mixture datasets. The RaMix Python package addresses this challenge by enabling the generation of synthetic Raman mixture datasets with controllable noise levels to assess the utility of different chemometric algorithm types for real-time monitoring applications. To demonstrate the capabilities of this package and compare the performance of different chemometric algorithms, 48 datasets of simulated spectra were generated using the RaMix Python package. The four tested algorithms include partial least squares regression (PLS), a simple neural network, a simple convolutional neural network (simple CNN), and a 1D convolutional neural network with a ResNet architecture (ResNet). The performance of the PLS and simple CNN model was found to be comparable, with the PLS algorithm slightly outperforming the other models on 83\% of the data sets. The simple CNN model outperforms the other models on large, high noise datasets, demonstrating the superior capability of convolutional neural networks compared to PLS in analyzing noisy spectra. These results demonstrate the promise of CNNs to automatically extract concentration information from unprocessed, noisy spectra, allowing for better process control of industrial drug production. Code for this project is available at github.com/DexterAntonio/RaMix.
- Therapeutic antibodies: successes, limitations and hopes for the future: Therapeutic antibodies: an update. 157(2):220–233.
- Recent advances in the field of anti-cancer immunotherapy. 3:280–288.
- The process defines the product: what really matters in biosimilar design and production? 56:iv14–iv29.
- Challenges of glycosylation analysis and control: an integrated approach to producing optimal and consistent therapeutic drugs. 21(5):740–765.
- Toward biotherapeutic product real-time quality monitoring. 39(3):289–305.
- Raman spectroscopy as a process analytical technology for pharmaceutical manufacturing and bioprocessing. 409(3):637–649.
- Challenges and opportunities in biopharmaceutical manufacturing control. 110:106–114.
- Development of novel bioreactor control systems based on smart sensors and actuators. 8:7.
- Quantitative monitoring of yeast fermentation using raman spectroscopy. 406(20):4911–4919.
- Optimization of raman-spectrum baseline correction in biological application. 141(8):2396–2404.
- Modified PCA and PLS: Towards a better classification in raman spectroscopy–based biological applications. 34(4):1–10.
- Deep convolutional neural networks for raman spectrum recognition: A unified solution.
- Cross decomposition — scikit-learn 0.24.2 documentation.
- A review of deep learning approaches for image analysis. In 2019 International Conference on Smart Systems and Inventive Technology (ICSSIT), pages 709–714. IEEE.
- Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825–2830, 2011.
- Tensorflow: Large-scale machine learning on heterogeneous distributed systems, 2015.
- Applications of raman spectroscopy in biopharmaceutical manufacturing: A short review. 71(6):1085–1116. Publisher: SAGE Publications Ltd STM.
- Michael S. Lineshapes in IR and raman spectroscopy: A primer. 30(11):42–46. Publisher: MJH Life Sciences.
- Baseline correction for raman spectra using an improved asymmetric least squares method. 6(12):4402–4407. Publisher: The Royal Society of Chemistry.
- Baseline correction using adaptive iteratively reweighted penalized least squares. 135(5):1138–1146. Publisher: The Royal Society of Chemistry.