Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
133 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Unified Review of Deep Learning for Automated Medical Coding (2201.02797v5)

Published 8 Jan 2022 in cs.CL and cs.IR

Abstract: Automated medical coding, an essential task for healthcare operation and delivery, makes unstructured data manageable by predicting medical codes from clinical documents. Recent advances in deep learning and natural language processing have been widely applied to this task. However, deep learning-based medical coding lacks a unified view of the design of neural network architectures. This review proposes a unified framework to provide a general understanding of the building blocks of medical coding models and summarizes recent advanced models under the proposed framework. Our unified framework decomposes medical coding into four main components, i.e., encoder modules for text feature extraction, mechanisms for building deep encoder architectures, decoder modules for transforming hidden representations into medical codes, and the usage of auxiliary information. Finally, we introduce the benchmarks and real-world usage and discuss key research challenges and future directions.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (214)
  1. Amina Adadi and Mohammed Berrada. 2018. Peeking inside the black-box: a survey on explainable artificial intelligence (XAI). IEEE Access 6 (2018), 52138–52160.
  2. BERT for Long Documents: A Case Study of Automated ICD Coding. In Proceedings of the 13th International Workshop on Health Text Mining and Information Analysis (LOUHI), Alberto Lavelli, Eben Holderness, Antonio Jimeno Yepes, Anne-Lyse Minard, James Pustejovsky, and Fabio Rinaldi (Eds.). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Hybrid), 100–107. https://doi.org/10.18653/v1/2022.louhi-1.12
  3. Problems and barriers during the process of clinical coding: a focus group study of coders’ perceptions. Journal of medical systems 44, 3 (2020), 1–8.
  4. Publicly Available Clinical BERT Embeddings. In Proceedings of the 2nd Clinical Natural Language Processing Workshop. ACL, 72–78.
  5. IxaMed at CLEF eHealth 2018 Task 1: ICD10 Coding with a Sequence-to-Sequence Approach.. In CLEF (Working Notes). 1.
  6. Interpretable deep learning to map diagnostic texts to ICD-10 codes. International journal of medical informatics 129 (2019), 49–59.
  7. Machine learning approaches on diagnostic term encoding with the ICD for clinical documentation. IEEE Journal of Biomedical and Health Informatics 22, 4 (2017), 1323–1329.
  8. A Basic Description Logic. Cambridge University Press, Cambridge, 10–49. https://doi.org/10.1017/9781139025355.002
  9. Tian Bai and Slobodan Vucetic. 2019. Improving Medical Code Prediction from Clinical Text via Incorporating Online Knowledge Sources. In The World Wide Web Conference. ACM, 72–82.
  10. Medical code prediction via capsule networks and ICD knowledge. BMC Medical Informatics and Decision Making 21, 2 (2021), 1–12.
  11. Multi-label Classification of Patient Notes: Case Study on ICD Code Assignment. In AAAI Workshop. AAAI, 1–8.
  12. Longformer: The long-document transformer. arXiv preprint arXiv:2004.05150 (2020).
  13. Is Attention Explanation? An Introduction to the Debate. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Dublin, Ireland, 3889–3900. https://doi.org/10.18653/v1/2022.acl-long.269
  14. TransICD: Transformer Based Code-Wise Attention Model for Explainable ICD Coding. In International Conference on Artificial Intelligence in Medicine. Springer, 469–478.
  15. Extreme multi-label ICD classification: Sensitivity to hospital service and time. IEEE Access 8 (2020), 183534–183545.
  16. Olivier Bodenreider. 2004. The unified medical language system (UMLS): integrating biomedical terminology. Nucleic acids research 32, suppl_1 (2004), D267–D270.
  17. Svetla Boytcheva. 2011. Automatic matching of ICD-10 codes to diagnoses in discharge letters. In Proceedings of the Second Workshop on Biomedical Natural Language Processing. ACL, 11–18.
  18. A systematic review of outpatient billing practices. SAGE Open Medicine 10 (2022), 20503121221099021.
  19. Systematic review of discharge coding accuracy. Journal of Public Health 34, 1 (07 2011), 138–148. https://doi.org/10.1093/pubmed/fdr054
  20. Systematic review of discharge coding accuracy. Journal of public health 34, 1 (2012), 138–148.
  21. Sentic PROMs: Application of sentic computing to the development of a novel unified framework for measuring health-care quality. Expert Systems with Applications 39, 12 (2012), 10533–10543.
  22. Sentic computing for patient centered applications. In IEEE 10th International Conference on Signal Processing Proceedings. IEEE, 1279–1282.
  23. Sharon Campbell and Katrina Giadresco. 2020. Computer-assisted clinical coding: A narrative review of the literature on its benefits, limitations, implementation and impact on clinical coding professionals. Health Information Management Journal 49, 1 (2020), 5–18.
  24. A systematic review of discharge coding accuracy. Journal of Public Health 23, 3 (2001), 205–211.
  25. Autoregressive Entity Retrieval. In International Conference on Learning Representations. 1–20. https://openreview.net/forum?id=5k8F6UU39V
  26. HyperCore: Hyperbolic and Co-graph Representation for Automatic ICD Coding. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. ACL, 3105–3114.
  27. Clinical-coder: Assigning interpretable ICD-10 codes to Chinese clinical notes. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. ACL, 294–301.
  28. Rich Caruana. 1997. Multitask learning. Machine Learning 28 (1997), 41–75.
  29. Towards automated clinical coding. International Journal of Medical Informatics 120 (2018), 50–61.
  30. Directions for Explainable Knowledge-Enabled Systems. In Knowledge Graphs for eXplainable AI – Foundations, Applications and Challenges, Ilaria Tiddi, Freddy Lecue, and Pascal Hitzler (Eds.). Vol. 47. IOS Press, Amsterdam, 245.
  31. Towards interpretable clinical diagnosis with Bayesian network ensembles stacked on entity-aware CNNs. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. ACL, 3143–3153.
  32. Contextual semantic embeddings for ontology subsumption prediction. World Wide Web (2023), 1–23.
  33. OWL2Vec*: Embedding of OWL ontologies. Machine Learning 110, 7 (2021), 1813–1845.
  34. Training a Deep Contextualized Language Model for International Classification of Diseases, 10th Revision Classification via Federated Learning: Model Development and Validation Study. JMIR Medical Informatics 10, 11 (2022), e41342.
  35. Automatic ICD-10 coding and training system: deep neural network based on supervised learning. JMIR Medical Informatics 9, 8 (2021), e23230.
  36. Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity. PloS One 12, 3 (2017), e0173410.
  37. Doctor AI: Predicting clinical events via recurrent neural networks. In Machine Learning for Healthcare Conference. PMLR, 301–318.
  38. E. Coiera. 2015a. Chapter 24: Natural language and formal terminology. In Guide to Health Informatics. CRC Press.
  39. E. Coiera. 2015b. Guide to Health Informatics. CRC Press, Taylor & Francis Group, Boca Raton, Chapter Chapter 23 Healthcare terminologies and classification systems, 381–399. https://doi.org/10.1201/b13617
  40. Isabel Coutinho and Bruno Martins. 2022. Transformer-based models for ICD-10 coding of death certificates with Portuguese text. Journal of Biomedical Informatics 136 (2022), 104232.
  41. Automatic Code Assignment to Medical Text. In Proceedings of BioNLP: Biological, Translational, and Clinical language processing. ACL, 129–136.
  42. Machine intelligence in healthcare—perspectives on trustworthiness, explainability, usability, and transparency. NPJ digital medicine 3, 1 (2020), 47.
  43. Revisiting Transformer-based Models for Long Document Classification. In Findings of the Association for Computational Linguistics: EMNLP 2022. Association for Computational Linguistics, Abu Dhabi, United Arab Emirates, 7212–7230. https://aclanthology.org/2022.findings-emnlp.534
  44. Transformer-XL: Attentive Language Models beyond a Fixed-Length Context. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2978–2988.
  45. A hierarchical approach to the automatic categorization of medical documents. In Proceedings of the International Conference on Information and Knowledge Management. 132–139.
  46. Automated ICD coding for primary diagnosis via clinically interpretable machine learning. International Journal of Medical Informatics 153 (2021), 104543.
  47. Molla S Donaldson et al. 1999. Measuring the quality of health care. (1999).
  48. Ontology Enrichment from Texts: A Biomedical Dataset for Concept Discovery and Placement. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management (CIKM ’23). Association for Computing Machinery, New York, NY, USA, 5316–5320. https://doi.org/10.1145/3583780.3615126
  49. Automated Clinical Coding: What, Why, and Where We Are? npj Digital Medicine 5 (2022), 1–8. Issue 159.
  50. Explainable Automated Coding of Clinical Notes using Hierarchical Label-wise Attention Networks and Label Embedding Initialisation. Journal of Biomedical Informatics 116 (2021), 103728.
  51. Explainable automated coding of clinical notes using hierarchical label-wise attention networks and label embedding initialisation. Journal of Biomedical Informatics 116 (2021), 103728.
  52. Ontology-Based and Weakly Supervised Rare Disease Phenotyping from Clinical Notes. BMC Medical Informatics and Decision Making 86 (2023). Issue 23.
  53. MHLAT: Multi-Hop Label-Wise Attention Model for Automatic ICD Coding. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1–5.
  54. Duodecim. 2023. Current Care Guidelines. https://www.kaypahoito.fi/
  55. CoPHE: A Count-Preserving Hierarchical Evaluation Metric in Large-Scale Multi-Label Text Classification. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. ACL, 907–912.
  56. Horses to Zebras: Ontology-Guided Data Augmentation and Synthesis for ICD-9 Coding. In Proceedings of the 21st Workshop on Biomedical Language Processing. Association for Computational Linguistics, Dublin, Ireland, 389–401.
  57. Can GPT-3.5 Generate and Code Discharge Summaries? arXiv:2401.13512 [cs.CL]
  58. Ontological attention ensembles for capturing semantic concepts in ICD code prediction from clinical text. In Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019). ACL, 168–177.
  59. Richárd Farkas and György Szarvas. 2008. Automatic Construction of Rule-based ICD-9-CM Coding Systems. In BMC Bioinformatics, Vol. 9(Suppl 3). Springer, 1–9.
  60. Active learning for medical code assignment. In ACM Conference on Health, Inference, and Learning (CHIL) Workshop.
  61. Description-based Label Attention Classifier for Explainable ICD-9 Classification. In Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021). ACL, 62–66.
  62. Model-agnostic meta-learning for fast adaptation of deep networks. In International conference on machine learning. PMLR, 1126–1135.
  63. Limitations of Transformers on Clinical Text Classification. IEEE Journal of Biomedical and Health Informatics 25, 9 (2021), 3596–3607. https://doi.org/10.1109/JBHI.2021.3062322
  64. Making Pre-trained Language Models Better Few-shot Learners. (Aug. 2021), 3816–3830. https://doi.org/10.18653/v1/2021.acl-long.295
  65. Multi-features-Based Automatic Clinical Coding for Chinese ICD-9-CM-3. In Proceedings of the 30th International Conference on Artificial Neural Networks and Machine Learning. Springer, 473–486.
  66. DKEC: Domain Knowledge Enhanced Multi-Label Classification for Electronic Health Records. arXiv preprint arXiv:2310.07059 (2023).
  67. Accurate and Well-Calibrated ICD Code Assignment Through Attention Over Diverse Label Embeddings. In Procceedings of EACL.
  68. Irit Hadar and Pnina Soffer. 2006. Variations in conceptual modeling: classification and ontological analysis. Journal of the Association for Information Systems 7, 8 (2006), 1.
  69. Assigning diagnosis codes using medication history. Artificial Intelligence in Medicine 128 (2022), 102307.
  70. Sepp Hochreiter. 1998. The vanishing gradient problem during learning recurrent neural nets and problem solutions. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 6, 02 (1998), 107–116.
  71. Andreas Holzinger. 2016. Interactive machine learning for health informatics: when do we need the human-in-the-loop? Brain Informatics 3, 2 (2016), 119–131.
  72. Modelling long medical documents and code associations for explainable automatic ICD coding. Expert Systems with Applications (2024), 123519.
  73. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 7132–7141.
  74. Transformer quality in linear time. In International Conference on Machine Learning. PMLR, 9099–9117.
  75. ClinicalBERT: Modeling clinical notes and predicting hospital readmission. arXiv preprint arXiv:1904.05342 (2019).
  76. Sähköisen potilaskertomuksen rakenteistaminen: Menetelmät, arviointikäytännöt ja vaikutukset. (2014).
  77. Dilated Convolutional Attention Network for Medical Code Assignment from Clinical Text. In Proceedings of the 3rd Clinical Natural Language Processing Workshop. ACL, 73–78.
  78. Does the Magic of BERT Apply to Medical Code Assignment? A Quantitative Study. Computers in Biology and Medicine 139 (2021), 104998.
  79. Suicidal Ideation and Mental Disorder Detection with Attentive Relation Networks. Neural Computing and Applications 34 (2022), 10309–10319. Issue 13.
  80. Shaoxiong Ji and Pekka Marttinen. 2023. Patient Outcome and Zero-shot Diagnosis Prediction with Hypernetwork-guided Multitask Learning. In Proceedings of EACL.
  81. A Survey on Knowledge Graphs: Representation, Acquisition and Applications. IEEE Transactions on Neural Networks and Learning Systems 33 (2022), 494–514. Issue 2.
  82. Medical Code Assignment with Gated Convolution and Note-Code Interaction. In Findings of ACL-IJCNLP. ACL, 1034–1043.
  83. When in Doubt: Improving Classification Performance with Alternating Normalization. In Findings of the Association for Computational Linguistics: EMNLP 2021. ACL, 1716–1723.
  84. A hybrid method for ICD-10 auto-coding of Chinese diagnoses. In MEDINFO 2017: Precision Healthcare through Informatics. IOS Press, 427–431.
  85. MIMIC-III, a Freely Accessible Critical Care Database. Scientific Data 3 (2016), 160035.
  86. Pollard Tom Horng Steven Celi Leo Anthony Johnson, Alistair and Roger Mark. 2023. MIMIC-IV-Note: Deidentified free-text clinical notes. PhysioNet. https://doi.org/10.13026/1n74-ne17
  87. Automatic diagnosis coding of radiology reports: a comparison of deep learning and conventional classification methods. In Proceedings of BioNLP. ACL, 328–332.
  88. A Systematic Literature Review of Automated ICD Coding and Classification Systems using Discharge Summaries. arXiv preprint arXiv:2107.10652 (2021).
  89. AI-based ICD coding and classification approaches using discharge summaries: A systematic literature review. Expert Systems with Applications 213 (2023), 118997.
  90. A tale of two epidemics: Contextual Word2Vec for classifying twitter streams during outbreaks. Information Processing & Management 56, 1 (2019), 247–257.
  91. Sarika R Khope and Susan Elias. 2023. Strategies of Predictive Schemes and Clinical Diagnosis for Prognosis Using MIMIC-III: A Systematic Review. In Healthcare, Vol. 11. Multidisciplinary Digital Publishing Institute, 710.
  92. Can Current Explainability Help Provide References in Clinical Notes to Support Humans Annotate Medical Codes?. In Proceedings of the 13th International Workshop on Health Text Mining and Information Analysis.
  93. Byung-Hak Kim and Varun Ganapathi. 2021. Read, Attend, and Code: Pushing the Limits of Medical Codes Prediction from Clinical Notes by Machines. In Machine Learning for Healthcare Conference. PMLR, 196–208.
  94. An Automatic ICD Coding Network Using Partition-Based Label Attention. arXiv preprint arXiv:2211.08429 (2022).
  95. AnEMIC: A Framework for Benchmarking ICD Coding Models. In Conference on Empirical Methods in Natural Language Processing (EMNLP), System Demonstrations. ACL.
  96. Yoon Kim. 2014. Convolutional Neural Networks for Sentence Classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Alessandro Moschitti, Bo Pang, and Walter Daelemans (Eds.). Association for Computational Linguistics, Doha, Qatar, 1746–1751. https://doi.org/10.3115/v1/D14-1181
  97. King’s College Hospital. 2021. CogStack wins an Artificial Intelligence in Health and Care. https://www.kch.nhs.uk/news/public/news/view/34965.
  98. Thomas N Kipf and Max Welling. 2017. Semi-supervised classification with graph convolutional networks. In International Conference on Learning Representations.
  99. Hannu Kivijärvi and Karoliina Pärnänen. 2023. Instrumental usability and effective user experience: Interwoven drivers and outcomes of Human-Computer interaction. International Journal of Human-Computer Interaction 39, 1 (2023), 34–51.
  100. Jorma Komulainen. 2012. Suomalainen tautien kirjaamisen ohjekirja. (2012).
  101. Multi-domain clinical natural language processing with MedCAT: The Medical Concept Annotation Toolkit. Artif. Intell. Med. 117 (July 2021), 102083. https://doi.org/10.1016/j.artmed.2021.102083
  102. EL Embeddings: Geometric construction of models for the description logic EL++. In International Joint Conferences on Artificial Intelligence. 6103–6109.
  103. Teven Le Scao and Alexander Rush. 2021. How many data points is a prompt worth?. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Kristina Toutanova, Anna Rumshisky, Luke Zettlemoyer, Dilek Hakkani-Tur, Iz Beltagy, Steven Bethard, Ryan Cotterell, Tanmoy Chakraborty, and Yichao Zhou (Eds.). Association for Computational Linguistics, Online, 2627–2636. https://doi.org/10.18653/v1/2021.naacl-main.208
  104. Fei Li and Hong Yu. 2020. ICD coding from clinical text using multi-filter residual convolutional neural network. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. AAAI, 8180–8187.
  105. Neural natural language processing for unstructured data in electronic health records: a review. Computer Science Review 46 (2022), 100511.
  106. JLAN: medical code prediction via joint learning attention networks and denoising mechanism. BMC Bioinformatics 22, 1 (2021), 1–21.
  107. Automatic International Classification of Diseases Coding via Note-Code Interaction Network with Denoising Mechanism. Journal of Computational Biology 30, 8 (2023), 912–925.
  108. NIDN: Medical Code Assignment via Note-Code Interaction Denoising Network. In Proceedings of 18th International Symposium on Bioinformatics Research and Applications (ISBRA). 62–74.
  109. Towards Automatic ICD Coding via Knowledge Enhanced Multi-Task Learning. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management. 1238–1248.
  110. A structured self-attentive sentence embedding. In International Conference on Learning Representations.
  111. The unified medical language system. Yearbook of medical informatics 2, 01 (1993), 41–51.
  112. Large Scale Diagnostic Code Classification for Medical Patient Records. In Proceedings of IJCNLP. ACL.
  113. Parameter Selection: Why We Should Pay More Attention to It. In ACL-IJCNLP. ACL, 825–830.
  114. Automated ICD Coding using Extreme Multi-label Long Text Transformer-based Models. Journal of Biomedical Informatics 133 (2022), 104161.
  115. Hierarchical label-wise attention transformer model for explainable ICD coding. Journal of Biomedical Informatics 133 (2022), 104161.
  116. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. Comput. Surveys 55, 9 (2023), 1–35.
  117. Effective Convolutional Attention Network for Multi-label Clinical Document Classification. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. ACL, 5941–5953.
  118. TreeMAN: Tree-enhanced Multimodal Attention Network for ICD Coding. In Proceedings of the 29th International Conference on Computational Linguistics. 3054–3063.
  119. Towards Semi-Structured Automatic ICD Coding via Tree-based Contrastive Learning. Advances in Neural Information Processing Systems 36 (2024).
  120. Multi-label Few/Zero-shot Learning with Knowledge Aggregated from Multiple Label Graphs. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). ACL, 2935–2943.
  121. Pengli Lu and Jingjin Xue. 2023. Combining transformer-based model and GCN to predict ICD codes from clinical records. Knowledge-Based Systems 282 (2023), 111113.
  122. CoRelation: Boosting Automatic ICD Coding Through Contextualized Code Relation Learning. In Proceedings of LREC-COLING.
  123. Fusion: Towards Automated ICD Coding via Feature Compression. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. ACL, 2096–2101.
  124. BioGPT: generative pre-trained transformer for biomedical text generation and mining. Briefings in Bioinformatics 23, 6 (2022).
  125. Julia Medori and Cédrick Fairon. 2010. Machine Learning and Features Selection for Semi-automatic ICD-9-CM Encoding. In Proceedings of LOUHI Workshop on Text and Data Mining of Health Documents. ACL, 84–89.
  126. Genevieve B Melton and George Hripcsak. 2005. Automated detection of adverse events using natural language processing of discharge summaries. Journal of the American Medical Informatics Association 12, 4 (2005), 448–457.
  127. ICDBigBird: A Contextual Embedding Model for ICD Code Classification. In Proceedings of Biomedical Natural Language Processing. 330–336.
  128. Distributed Representations of Words and Phrases and their Compositionality. In Advances in neural information processing systems. 3111–3119.
  129. Overview of automatic clinical coding: annotations, guidelines, and solutions for non-english clinical cases at codiesp track of CLEF eHealth 2020. In Working Notes of Conference and Labs of the Evaluation (CLEF) Forum. CEUR Workshop Proceedings.
  130. Mark Morsch. 2010. Computer-assisted coding: the secret weapon. CAC does not eliminate the need for medical-coding professionals to be involved in the coding process, but it can make them more productive and accurate. Health Management Technology 31, 2 (2010), 24–26.
  131. Explainable Prediction of Medical Codes from Clinical Text. In Proceedings of NAACL-HLT. 1101–1111.
  132. G Jaya Nair. 2013. Ensuring quality in the coding process: A key differentiator for the accurate interpretation of safety data. Perspectives in clinical research 4, 3 (2013), 181.
  133. Modelling Temporal Document Sequences for Clinical ICD Coding. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, Andreas Vlachos and Isabelle Augenstein (Eds.). Association for Computational Linguistics, Dubrovnik, Croatia, 1640–1649. https://doi.org/10.18653/v1/2023.eacl-main.120
  134. A Two-Stage Decoder for Efficient ICD Coding. In Findings of the Association for Computational Linguistics: ACL 2023, Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki (Eds.). Association for Computational Linguistics, Toronto, Canada, 4658–4665. https://doi.org/10.18653/v1/2023.findings-acl.285
  135. Retrieve and rerank for automated ICD coding via Contrastive Learning. Journal of Biomedical Informatics 143 (2023), 104396.
  136. Supporting the Billing Process in Outpatient Medical Care: Automated Medical Coding Through Machine Learning. In Enropean Conference on Information Systems. 1–18.
  137. Luis Oberste and Armin Heinzl. 2022. User-centric explainability in healthcare: a knowledge-level perspective of informed machine learning. IEEE Transactions on Artificial Intelligence (2022).
  138. International Health Terminology Standards Development Organisation. 2024. Clinical Finding Defining Attributes. SNOMED CT Editorial Guide https://confluence.ihtsdotools.org/display/DOCEG/Clinical+Finding+Defining+Attributes. Accessed: March 2024.
  139. A survey of the usages of deep learning for natural language processing. IEEE Transactions on Neural Networks and Learning Systems 32, 2 (2020), 604–624.
  140. Continual lifelong learning with neural networks: A review. Neural Networks 113 (2019), 54–71.
  141. The accuracy of ICD codes for cerebrovascular diseases in medical insurance claims. Journal of Preventive Medicine and Public Health 33, 1 (2000), 76–82.
  142. Towards BERT-based Automatic ICD Coding: Limitations and Opportunities. In Proceedings of the 20th Workshop on Biomedical Language Processing. 54–63.
  143. GloVe: Global vectors for word representation. In EMNLP. 1532–1543.
  144. Diagnosis code assignment: models and evaluation metrics. JAMIA 21, 2 (2014), 231–237.
  145. A shared task involving multi-label classification of clinical free text. In Biological, translational, and clinical language processing. 97–104.
  146. Development and external validation of automated ICD-10 coding from discharge summaries using deep learning approaches. Informatics in Medicine Unlocked (2023), 101227.
  147. Condensed Memory Networks for Clinical Diagnostic Inferencing. In Proceedings of AAAI.
  148. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. Journal of Machine Learning Research 21, 140 (2020), 1–67. http://jmlr.org/papers/v21/20-074.html
  149. Tieto-ja viestintäteknologian käyttö terveydenhuollossa vuonna 2020: Tilanne ja kehityksen suunta. (2021).
  150. Assigning ICD-O-3 codes to pathology reports using neural multi-task training with hierarchical regularization. In Proceedings of ACM Conference on Bioinformatics, Computational Biology, and Health Informatics. 1–10.
  151. Anthony Rios and Ramakanth Kavuluru. 2018. Few-Shot and Zero-Shot Multi-Label Learning for Structured Label Spaces. In Proceedings of EMNLP. 3132–3142.
  152. DiLBERT: Cheap embeddings for disease related medical NLP. IEEE Access 9 (2021), 159714–159723.
  153. Medical code prediction with multi-view convolution and description-regularized label-dependent attention. arXiv preprint arXiv:1811.01468 (2018).
  154. Multiparameter Intelligent Monitoring in Intensive Care II (MIMIC-II): A Public-access Intensive Care Unit Database. Critical Care Medicine 39, 5 (2011), 952.
  155. Experimental Evaluation and Development of a Silver-Standard for the MIMIC-III Clinical Coding Dataset. In Proceedings of SIGBioMed Workshop on Biomedical Language Processing. 76–85.
  156. MedCATTrainer: A Biomedical Free Text Annotation Interface with Active Learning and Research Use Case Specific Customisation. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations. 139–144.
  157. Towards Automated ICD Coding Using Deep Learning. arXiv preprint arXiv:1711.04075 (2017).
  158. Gail I Smith and June Bronnert. 2010. Transitioning to CAC: the skills and tools required to work with computer-assisted coding. Journal of AHIMA 81, 7 (2010), 60–61.
  159. Generalized Zero-Shot Text Classification for ICD Coding. In IJCAI. 4018–4024.
  160. A Systematic Literature Review of Automated Clinical Coding and Classification systems. JAMIA 17, 6 (2010), 646–651.
  161. Multitask Recalibrated Aggregation Network for Medical Code Prediction. In Proceedings of ECML-PKDD. 367–383.
  162. Multitask Balanced and Recalibrated Network for Medical Code Prediction. ACM Transactions on Intelligent Systems and Technology 14, 1 (2023), 1–20.
  163. Machine learning to automate the assignment of diagnosis codes to free-text radiology reports: a method description. In Proceedings of the ICML/UAI/COLT workshop on machine learning for health-care applications.
  164. A review on deep neural networks for ICD coding. IEEE Transactions on Knowledge and Data Engineering (2022).
  165. Automatic medical code assignment via deep learning approach for intelligent healthcare. IEEE JBHI 24, 9 (2020), 2506–2515.
  166. Explainable prediction of medical codes with knowledge graphs. Frontiers in bioengineering and biotechnology 8 (2020), 867.
  167. Natural language processing advancements by deep learning: A survey. arXiv preprint arXiv:2003.01200 (2020).
  168. Leveraging hierarchical category knowledge for data-imbalanced multi-label diagnostic text understanding. In Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019). 39–43.
  169. Modeling Diagnostic Label Correlation for Automatic ICD Coding. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 4043–4052.
  170. Attention is all you need. In Advances in neural information processing systems. 5998–6008.
  171. Graph Attention Networks. In International Conference on Learning Representations.
  172. Automating the overburdened clinical coding system: challenges and next steps. npj Digital Medicine 6, 1 (2023), 16. https://doi.org/10.1038/s41746-023-00768-0
  173. A label attention model for ICD coding from clinical text. In Proceedings of IJCAI.
  174. W3C Recommendation. 2012. OWL EL, OWL 2 Web Ontology Language Profiles (Second Edition). https://www.w3.org/TR/owl2-profiles/#OWL_2_EL. Accessed: 2024-03-13.
  175. Joint Embedding of Words and Labels for Text Classification. In Proceedings of ACL. 2321–2331.
  176. A study of entity-linking methods for normalizing Chinese diagnosis and procedure terms to ICD codes. Journal of Biomedical Informatics 105 (2020), 103418.
  177. Meta-LMTC: Meta-Learning for Large-Scale Multi-Label Text Classification. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 8633–8646.
  178. Diagnosis code assignment using sparsity-based disease correlation embedding. IEEE TKDE 28, 12 (2016), 3191–3202.
  179. Few-Shot Electronic Health Record Coding through Graph Contrastive Learning. arXiv preprint arXiv:2106.15467 (2021).
  180. Coding Electronic Health Records with Adversarial Reinforcement Path Generation. In Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval. 801–810.
  181. HieNet: Bidirectional Hierarchy Framework for Automated ICD Coding. In Database Systems for Advanced Applications: 27th International Conference, DASFAA 2022, Virtual Event, April 11–14, 2022, Proceedings, Part II. 523–539.
  182. Using deep learning for automatic ICD-10 classification from free-text data. European Journal of Biomedical Informatics 16, 1 (2020).
  183. A Novel Framework Based on Medical Concept Driven Attention for Explainable Medical Code Prediction via External Knowledge. In Findings of the Association for Computational Linguistics: ACL 2022. 1407–1416.
  184. Generalizing from a few examples: A survey on few-shot learning. Comput. Surveys 53, 3 (2020), 1–34.
  185. Clinical Concept Extraction for Document-Level Coding. In Proceedings of the 18th BioNLP Workshop and Shared Task. 261–272.
  186. Shaping a data-driven era in dementia care pathway through computational neurology approaches. BMC Medicine 18, 1 (16 Dec 2020), 398. https://doi.org/10.1186/s12916-020-01841-1
  187. Model Distillation for Faithful Explanations of Medical Code Predictions. In Proceedings of the 21st Workshop on Biomedical Language Processing. 412–425.
  188. Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 1942–1955.
  189. SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research. Journal of the American Medical Informatics Association 25, 5 (01 2018), 530–537. https://doi.org/10.1093/jamia/ocx160
  190. Graph neural networks for natural language processing: A survey. Foundations and Trends® in Machine Learning 16, 2 (2023), 119–328.
  191. JAN: Joint Attention Networks for Automatic ICD Coding. IEEE Journal of Biomedical and Health Informatics 26, 10 (2022), 5235–5246.
  192. Knowledge-based dynamic prompt learning for multi-label disease diagnosis. Knowledge-Based Systems 286 (2024), 111395.
  193. EHR coding with multi-scale feature attention and structured knowledge graph propagation. In Proceedings of ACM CIKM. 649–658.
  194. Faithful embeddings for EL++ knowledge bases. In International Semantic Web Conference. Springer, 22–38.
  195. Multimodal machine learning for automated ICD coding. In Machine learning for healthcare conference. PMLR, 197–215.
  196. SGM: Sequence Generation Model for Multi-label Classification. In Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics, Santa Fe, New Mexico, USA, 3915–3926.
  197. A large language model for electronic health records. npj Digital Medicine 5, 1 (2022), 194.
  198. Clinical assistant diagnosis for electronic medical record based on convolutional neural network. Scientific reports 8, 1 (2018), 6329.
  199. Multi-label Few-shot ICD Coding as Autoregressive Generation with Prompt. In AAAI.
  200. Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding. In Findings of EMNLP.
  201. Improving Predictions of Tail-end Labels using Concatenated BioMed-Transformers for Long Medical Documents. arXiv preprint arXiv:2112.01718 (2021).
  202. Automatic ICD code assignment of Chinese clinical notes based on multilayer attention BiRNN. Journal of Biomedical Informatics 91 (2019), 103114.
  203. The Graph-based Mutual Attentive Network for Automatic Diagnosis. In IJCAI. 3393–3399.
  204. Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding. In ACL.
  205. Big bird: Transformers for longer sequences. In Advances in neural information processing systems, Vol. 33. 17283–17297.
  206. Fabio Massimo Zanzotto. 2019. Human-in-the-loop artificial intelligence. Journal of Artificial Intelligence Research 64 (2019), 243–252.
  207. Automatic ICD-9 Coding via Deep Transfer Learning. Neurocomputing 324 (2019), 43–50.
  208. Kaizhong Zhang and Dennis Shasha. 1989. Simple fast algorithms for the editing distance between trees and related problems. SIAM journal on computing 18, 6 (1989), 1245–1262.
  209. Ning Zhang and Maciej Jankowski. 2022. Hierarchical BERT for Medical Document Understanding. arXiv preprint arXiv:2204.09600 (2022).
  210. Yu Zhang and Qiang Yang. 2021. A survey on multi-task learning. IEEE TKDE 34 (2021), 5586 – 5609. Issue 12.
  211. BERT-XML: Large Scale Automated ICD Coding Using BERT Pretraining. In Proceedings of the 3rd Clinical Natural Language Processing Workshop. 24–34.
  212. Automated ICD coding for coronary heart diseases by a deep learning method. Heliyon (2023), e14037.
  213. Construction of a semi-automatic ICD-10 coding system. BMC medical informatics and decision making 20 (2020), 1–12.
  214. Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 5948–5957.
Citations (22)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets