Interpretable Mechanistic Representations for Meal-level Glycemic Control in the Wild (2312.03344v1)
Abstract: Diabetes encompasses a complex landscape of glycemic control that varies widely among individuals. However, current methods do not faithfully capture this variability at the meal level. On the one hand, expert-crafted features lack the flexibility of data-driven methods; on the other hand, learned representations tend to be uninterpretable which hampers clinical adoption. In this paper, we propose a hybrid variational autoencoder to learn interpretable representations of CGM and meal data. Our method grounds the latent space to the inputs of a mechanistic differential equation, producing embeddings that reflect physiological quantities, such as insulin sensitivity, glucose effectiveness, and basal glucose levels. Moreover, we introduce a novel method to infer the glucose appearance rate, making the mechanistic model robust to unreliable meal logs. On a dataset of CGM and self-reported meals from individuals with type-2 diabetes and pre-diabetes, our unsupervised representation discovers a separation between individuals proportional to their disease severity. Our embeddings produce clusters that are up to 4x better than naive, expert, black-box, and pure mechanistic features. Our method provides a nuanced, yet interpretable, embedding space to compare glycemic control within and across individuals, directly learnable from in-the-wild data.
- American Diabetes Association Professional Practice Committee. 6. Glycemic Targets: Standards of Medical Care in Diabetes—2022. Diabetes Care, 45(Supplement_1):S83–S96, December 2021. ISSN 0149-5992. 10.2337/dc22-S006. URL https://doi.org/10.2337/dc22-S006.
- Exaggerated Hyperglycemia After A Pizza Meal in Well-Controlled Diabetes. Diabetes Care, 16(4):578–580, April 1993. ISSN 0149-5992. 10.2337/diacare.16.4.578. URL https://doi.org/10.2337/diacare.16.4.578.
- Mechanistic models versus machine learning, a fight worth fighting for the biological community? Biology Letters, 14(5):20170660, May 2018. 10.1098/rsbl.2017.0660. URL https://royalsocietypublishing.org/doi/10.1098/rsbl.2017.0660. Publisher: Royal Society.
- Personalized Postprandial Glucose Response–Targeting Diet Versus Mediterranean Diet for Glycemic Control in Prediabetes. Diabetes Care, 44(9):1980–1991, July 2021. ISSN 0149-5992. 10.2337/dc21-0162. URL https://doi.org/10.2337/dc21-0162.
- Recommendations for Standardizing Glucose Reporting and Analysis to Optimize Clinical Decision Making in Diabetes: The Ambulatory Glucose Profile. Journal of Diabetes Science and Technology, 7(2):562–578, March 2013. ISSN 1932-2968. 10.1177/193229681300700234. URL https://doi.org/10.1177/193229681300700234. Publisher: SAGE Publications Inc.
- Glucose Management Indicator (GMI): A New Term for Estimating A1C From Continuous Glucose Monitoring. Diabetes Care, 41(11):2275–2280, September 2018. ISSN 0149-5992. 10.2337/dc18-1581. URL https://doi.org/10.2337/dc18-1581.
- Identification of a Minimal Model of Glucose Disappearance for Estimating Insulin Sensitivity. IFAC Proceedings Volumes, 12(8):883–890, September 1979. ISSN 1474-6670. 10.1016/S1474-6670(17)65505-8. URL https://www.sciencedirect.com/science/article/pii/S1474667017655058.
- Human postprandial responses to food and potential for precision nutrition. Nature Medicine, 26(6):964–973, June 2020. ISSN 1546-170X. 10.1038/s41591-020-0934-0. URL https://www.nature.com/articles/s41591-020-0934-0. Number: 6 Publisher: Nature Publishing Group.
- Variational Inference: A Review for Statisticians. Journal of the American Statistical Association, 112(518):859–877, April 2017. ISSN 0162-1459. 10.1080/01621459.2017.1285773. URL https://doi.org/10.1080/01621459.2017.1285773. Publisher: Taylor & Francis _eprint: https://doi.org/10.1080/01621459.2017.1285773.
- Generating Sentences from a Continuous Space. In Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, pages 10–21, Berlin, Germany, August 2016. Association for Computational Linguistics. 10.18653/v1/K16-1002. URL https://aclanthology.org/K16-1002.
- Glycaemic index methodology. Nutrition Research Reviews, 18(1):145–171, June 2005. ISSN 1475-2700, 0954-4224. 10.1079/NRR2005100. URL https://www.cambridge.org/core/journals/nutrition-research-reviews/article/glycaemic-index-methodology/4AEE3FFEBBA327530129C2381907D4F2. Publisher: Cambridge University Press.
- Beyond HbA1c: using continuous glucose monitoring metrics to enhance interpretation of treatment effect and improve clinical decision-making. Diabetic Medicine, 36(6):679–687, 2019. ISSN 1464-5491. 10.1111/dme.13944. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/dme.13944.
- Model identification using stochastic differential equation grey-box models in diabetes. Journal of Diabetes Science and Technology, 7(2):431–440, March 2013. ISSN 1932-2968. 10.1177/193229681300700220.
- Bayesian parameter estimation in the oral minimal model of glucose dynamics from non-fasting conditions using a new function of glucose appearance. Computer Methods and Programs in Biomedicine, 200:105911, March 2021. ISSN 0169-2607. 10.1016/j.cmpb.2020.105911. URL https://www.sciencedirect.com/science/article/pii/S0169260720317442.
- Population-level management of type 1 diabetes via continuous glucose monitoring and algorithm-enabled patient prioritization: Precision health meets population health. Pediatric Diabetes, 22(7):982–991, 2021. ISSN 1399-5448. 10.1111/pedi.13256. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/pedi.13256.
- Type 2 diabetes: one disease, many pathways. American Journal of Physiology-Endocrinology and Metabolism, 319(2):E410–E426, August 2020. ISSN 0193-1849. 10.1152/ajpendo.00512.2019. URL https://journals.physiology.org/doi/full/10.1152/ajpendo.00512.2019. Publisher: American Physiological Society.
- Glucotypes reveal new patterns of glucose dysregulation. PLOS Biology, 16(7):e2005143, July 2018. ISSN 1545-7885. 10.1371/journal.pbio.2005143. URL https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.2005143. Publisher: Public Library of Science.
- Array programming with NumPy. Nature, 585(7825):357–362, September 2020. 10.1038/s41586-020-2649-2. URL https://doi.org/10.1038/s41586-020-2649-2. Publisher: Springer Science and Business Media LLC.
- A Simple Robust Method for Estimating the Glucose Rate of Appearance from Mixed Meals. Journal of Diabetes Science and Technology, 6(1):153–162, January 2012. ISSN 1932-2968. 10.1177/193229681200600119. URL https://doi.org/10.1177/193229681200600119. Publisher: SAGE Publications Inc.
- beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework. November 2016. URL https://openreview.net/forum?id=Sy2fzU9gl.
- A quantitative description of membrane current and its application to conduction and excitation in nerve. The Journal of Physiology, 117(4):500–544, 1952. ISSN 1469-7793. 10.1113/jphysiol.1952.sp004764. URL https://onlinelibrary.wiley.com/doi/abs/10.1113/jphysiol.1952.sp004764.
- Roman Hovorka. Closed-loop insulin delivery: from bench to clinical practice. Nature Reviews Endocrinology, 7(7):385–395, July 2011. ISSN 1759-5037. 10.1038/nrendo.2011.32. URL https://www.nature.com/articles/nrendo.2011.32. Number: 7 Publisher: Nature Publishing Group.
- Pancreatic β𝛽\betaitalic_β-Cell Responsiveness during Meal Tolerance Test: Model Assessment in Normal Subjects and Subjects with Newly Diagnosed Noninsulin-Dependent Diabetes Mellitus1. The Journal of Clinical Endocrinology & Metabolism, 83(3):744–750, March 1998. ISSN 0021-972X. 10.1210/jcem.83.3.4646. URL https://doi.org/10.1210/jcem.83.3.4646.
- Partitioning glucose distribution/transport, disposal, and endogenous production during IVGTT. American Journal of Physiology-Endocrinology and Metabolism, 282(5):E992–E1007, May 2002. ISSN 0193-1849. 10.1152/ajpendo.00304.2001. URL https://journals.physiology.org/doi/full/10.1152/ajpendo.00304.2001. Publisher: American Physiological Society.
- J. D. Hunter. Matplotlib: A 2D graphics environment. Computing in Science & Engineering, 9(3):90–95, 2007. 10.1109/MCSE.2007.55. Publisher: IEEE COMPUTER SOC.
- Neural Pharmacodynamic State Space Modeling. In Proceedings of the 38th International Conference on Machine Learning, pages 4500–4510. PMLR, July 2021. URL https://proceedings.mlr.press/v139/hussain21a.html. ISSN: 2640-3498.
- Unsupervised Feature Extraction by Time-Contrastive Learning and Nonlinear ICA. In Advances in Neural Information Processing Systems, volume 29. Curran Associates, Inc., 2016. URL https://proceedings.neurips.cc/paper/2016/hash/d305281faf947ca7acade9ad5c8c818c-Abstract.html.
- Identification of intraday metabolic profiles during closed-loop glucose control in individuals with type 1 diabetes. Journal of Diabetes Science and Technology, 3(5):1047–1057, September 2009. ISSN 1932-2968. 10.1177/193229680900300508.
- Physics-informed machine learning. Nature Reviews Physics, 3(6):422–440, June 2021. ISSN 2522-5820. 10.1038/s42254-021-00314-5. URL https://www.nature.com/articles/s42254-021-00314-5. Number: 6 Publisher: Nature Publishing Group.
- CGMap: Characterizing continuous glucose monitor data in thousands of non-diabetic individuals. Cell Metabolism, 35(5):758–769.e3, May 2023. ISSN 1550-4131. 10.1016/j.cmet.2023.04.002. URL https://www.sciencedirect.com/science/article/pii/S1550413123001298.
- Adam: A Method for Stochastic Optimization. 2015. URL https://arxiv.org/abs/1412.6980.
- Auto-Encoding Variational Bayes. December 2013. URL https://openreview.net/forum?id=33X9fd2-9FyZd.
- Boris P. Kovatchev. Metrics for glycaemic control — from HbA1c to continuous glucose monitoring. Nature Reviews Endocrinology, 13(7):425–436, July 2017. ISSN 1759-5037. 10.1038/nrendo.2017.3. URL https://www.nature.com/articles/nrendo.2017.3. Bandiera_abtest: a Cg_type: Nature Research Journals Number: 7 Primary_atype: Reviews Publisher: Nature Publishing Group Subject_term: Diagnostic devices;Outcomes research;Prognostic markers;Type 1 diabetes;Type 2 diabetes Subject_term_id: diagnostic-devices;outcomes-research;prognostic-markers;type-1-diabetes-mellitus;type-2-diabetes-mellitus.
- Adherence to Ketogenic and Mediterranean Study Diets in a Crossover Trial: The Keto–Med Randomized Trial. Nutrients, 13(3):967, March 2021. ISSN 2072-6643. 10.3390/nu13030967. URL https://www.mdpi.com/2072-6643/13/3/967. Number: 3 Publisher: Multidisciplinary Digital Publishing Institute.
- Stochastic differential equations as a tool to regularize the parameter estimation problem for continuous time dynamical systems given discrete time measurements. Mathematical Biosciences, 251:54–62, May 2014. ISSN 0025-5564. 10.1016/j.mbs.2014.03.001. URL https://www.sciencedirect.com/science/article/pii/S0025556414000510.
- M. Levine and A. Stuart. A Framework for Machine Learning of Model Error in Dynamical Systems. ArXiv, 2021.
- The UVA/PADOVA Type 1 Diabetes Simulator: New Features. Journal of Diabetes Science and Technology, 8(1):26–34, January 2014. ISSN 1932-2968. 10.1177/1932296813514502. URL https://doi.org/10.1177/1932296813514502. Publisher: SAGE Publications Inc.
- A Novel Approach to Continuous Glucose Analysis Utilizing Glycemic Variation. Diabetes Technology & Therapeutics, 7(2):253–263, April 2005. ISSN 1520-9156. 10.1089/dia.2005.7.253. URL https://www.liebertpub.com/doi/10.1089/dia.2005.7.253. Publisher: Mary Ann Liebert, Inc., publishers.
- Learning Insulin-Glucose Dynamics in the Wild. In Proceedings of the 5th Machine Learning for Healthcare Conference, pages 172–197. PMLR, September 2020. URL https://proceedings.mlr.press/v126/miller20a.html. ISSN: 2640-3498.
- Breiman’s Two Cultures: You Don’t Have to Choose Sides. Observational Studies, 7(1):161–169, 2021. ISSN 2767-3324. 10.1353/obs.2021.0003. URL https://muse.jhu.edu/article/799728. Publisher: University of Pennsylvania Press.
- Day-to-day variation of continuously monitored glycaemia: A further measure of diabetic instability. Diabetologia, 8(5):342–348, November 1972. ISSN 1432-0428. 10.1007/BF01218495. URL https://doi.org/10.1007/BF01218495.
- PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019. URL https://proceedings.neurips.cc/paper_files/paper/2019/hash/bdbca288fee7f92f2bfa9f7012727740-Abstract.html.
- Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825–2830, 2011.
- Glycemic Variability Percentage: A Novel Method for Assessing Glycemic Variability from Continuous Glucose Monitor Data. Diabetes Technology & Therapeutics, 20(1):6–16, January 2018. ISSN 1520-9156. 10.1089/dia.2017.0187. URL https://www.liebertpub.com/doi/full/10.1089/dia.2017.0187. Publisher: Mary Ann Liebert, Inc., publishers.
- Teamwork, Targets, Technology, and Tight Control in Newly Diagnosed Type 1 Diabetes: the Pilot 4T Study. The Journal of Clinical Endocrinology & Metabolism, 107(4):998–1008, April 2022. ISSN 0021-972X. 10.1210/clinem/dgab859. URL https://doi.org/10.1210/clinem/dgab859.
- Integrating Expert ODEs into Neural ODEs: Pharmacology and Disease Progression. In Advances in Neural Information Processing Systems, volume 34, pages 11364–11383. Curran Associates, Inc., 2021. URL https://proceedings.neurips.cc/paper/2021/hash/5ea1649a31336092c05438df996a3e59-Abstract.html.
- Stochastic Backpropagation and Approximate Inference in Deep Generative Models. In Proceedings of the 31st International Conference on Machine Learning, pages 1278–1286. PMLR, June 2014. URL https://proceedings.mlr.press/v32/rezende14.html. ISSN: 1938-7228.
- Modelling endogenous insulin concentration in type 2 diabetes during closed-loop insulin delivery. BioMedical Engineering OnLine, 14(1):19, March 2015. ISSN 1475-925X. 10.1186/s12938-015-0009-5. URL https://doi.org/10.1186/s12938-015-0009-5.
- H. Sakoe and S. Chiba. Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing, 26(1):43–49, February 1978. ISSN 0096-3518. 10.1109/TASSP.1978.1163055. Conference Name: IEEE Transactions on Acoustics, Speech, and Signal Processing.
- Mean Amplitude of Glycemic Excursions, a Measure of Diabetic Instability. Diabetes, 19(9):644–655, September 1970. ISSN 0012-1797. 10.2337/diab.19.9.644. URL https://doi.org/10.2337/diab.19.9.644.
- Learning Structured Output Representation using Deep Conditional Generative Models. In Advances in Neural Information Processing Systems, volume 28. Curran Associates, Inc., 2015. URL https://papers.nips.cc/paper_files/paper/2015/hash/8d55a249e6baa5c06772297520da2051-Abstract.html.
- Dropout: A Simple Way to Prevent Neural Networks from Overfitting. Journal of Machine Learning Research, 15(56):1929–1958, 2014. ISSN 1533-7928. URL http://jmlr.org/papers/v15/srivastava14a.html.
- Potential Identification of Type 2 Diabetes with Elevated Insulin Clearance. NEJM Evidence, 1(4):EVIDoa2100052, March 2022. 10.1056/EVIDoa2100052. URL https://evidence.nejm.org/doi/full/10.1056/EVIDoa2100052. Publisher: Massachusetts Medical Society.
- Physics-Integrated Variational Autoencoders for Robust and Interpretable Generative Modeling. May 2021. URL https://openreview.net/forum?id=0p0gt1Pn2Gv.
- Tslearn, a machine learning toolkit for time series data. Journal of Machine Learning Research, 21(118):1–6, 2020. URL http://jmlr.org/papers/v21/20-091.html.
- Doubly Stochastic Variational Bayes for non-Conjugate Inference. In Proceedings of the 31st International Conference on Machine Learning, pages 1971–1979. PMLR, June 2014. URL https://proceedings.mlr.press/v32/titsias14.html. ISSN: 1938-7228.
- Python reference manual. Centrum voor Wiskunde en Informatica Amsterdam, 1995.
- J. M. Varah. A Spline Least Squares Method for Numerical Parameter Estimation in Differential Equations. SIAM Journal on Scientific and Statistical Computing, 3(1):28–46, March 1982. ISSN 0196-5204. 10.1137/0903003. URL https://epubs.siam.org/doi/10.1137/0903003. Publisher: Society for Industrial and Applied Mathematics.
- SciPy 1.0: Fundamental algorithms for scientific computing in python. Nature Methods, 17:261–272, 2020. 10.1038/s41592-019-0686-2.
- Learning absorption rates in glucose-insulin dynamics from meal covariates. In NeurIPS 2022 workshop on learning from time series for health, 2022. URL https://openreview.net/forum?id=cbn7xvCCq6e.
- Robust Hybrid Learning With Expert Augmentation. Transactions on Machine Learning Research, October 2022. ISSN 2835-8856. URL https://openreview.net/forum?id=oe4dl4MCGY.
- CGM Metrics Identify Dysglycemic States in Participants From the TrialNet Pathway to Prevention Study. Diabetes Care, 46(3):526–534, February 2023. ISSN 0149-5992. 10.2337/dc22-1297. URL https://doi.org/10.2337/dc22-1297.
- J. M. Wójcicki. “J”-Index. A New Proposition of the Assessment of Current Glucose Control in Diabetic Patients. Hormone and Metabolic Research, 27(1):41–42, January 1995. ISSN 0018-5043, 1439-4286. 10.1055/s-2007-979906. URL http://www.thieme-connect.de/DOI/DOI?10.1055/s-2007-979906. Publisher: © Georg Thieme Verlag Stuttgart · New York.
- Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting. In International Conference on Learning Representations, 2021. URL https://openreview.net/forum?id=kmG8vRXTFv.
- TS2Vec: Towards Universal Representation of Time Series. arXiv:2106.10466 [cs], September 2021. URL http://arxiv.org/abs/2106.10466. arXiv: 2106.10466.
- Personalized Nutrition by Prediction of Glycemic Responses. Cell, 163(5):1079–1094, November 2015. ISSN 0092-8674, 1097-4172. 10.1016/j.cell.2015.11.001. URL https://www.cell.com/cell/abstract/S0092-8674(15)01481-6. Publisher: Elsevier.