Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Novel ICD Coding Method Based on Associated and Hierarchical Code Description Distillation (2404.11132v2)

Published 17 Apr 2024 in cs.CL

Abstract: ICD(International Classification of Diseases) coding involves assigning ICD codes to patients visit based on their medical notes. ICD coding is a challenging multilabel text classification problem due to noisy medical document inputs. Recent advancements in automated ICD coding have enhanced performance by integrating additional data and knowledge bases with the encoding of medical notes and codes. However, most of them ignore the code hierarchy, leading to improper code assignments. To address these problems, we propose a novel framework based on associated and hierarchical code description distillation (AHDD) for better code representation learning and avoidance of improper code assignment.we utilize the code description and the hierarchical structure inherent to the ICD codes. Therefore, in this paper, we leverage the code description and the hierarchical structure inherent to the ICD codes. The code description is also applied to aware the attention layer and output layer. Experimental results on the benchmark dataset show the superiority of the proposed framework over several state-of-the-art baselines.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. Transicd: Transformer based code-wise attention model for explainable icd coding. In Artificial Intelligence in Medicine, pages 469–478, Cham. Springer International Publishing.
  2. Alex Bottle and Paul Aylin. 2008. Intelligent information: a national system for monitoring clinical performance. Health services research, 43(1p1):10–31.
  3. Hypercore: Hyperbolic and co-graph representation for automatic icd coding. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.
  4. Rare codes count: Mining inter-code relations for long-tail clinical text classification. In Clinical Natural Language Processing Workshop.
  5. Doctor ai: Predicting clinical events via recurrent neural networks. In Proceedings of the 1st Machine Learning for Healthcare Conference, volume 56 of Proceedings of Machine Learning Research, pages 301–318, Northeastern University, Boston, MA, USA. PMLR.
  6. Accurate and well-calibrated icd code assignment through attention over diverse label embeddings. In Conference of the European Chapter of the Association for Computational Linguistics.
  7. Mimic-iii, a freely accessible critical care database. Scientific Data.
  8. Fei Li and Hong Yu. 2020. Icd coding from clinical text using multi-filter residual convolutional neural network. Proceedings of the AAAI Conference on Artificial Intelligence, page 8180–8187.
  9. Corelation: Boosting automatic icd coding through contextualized code relation learning. ArXiv, abs/2402.15700.
  10. Fusion: Towards automated icd coding via feature compression. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.
  11. Julia Medori and Cédrick Fairon. 2010. Machine learning and features selection for semi-automatic ICD-9-CM encoding. In Proceedings of the NAACL HLT 2010 Second Louhi Workshop on Text and Data Mining of Health Documents, pages 84–89, Los Angeles, California, USA. Association for Computational Linguistics.
  12. Explainable prediction of medical codes from clinical text. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers).
  13. Measuring diagnoses: Icd code accuracy. Health Services Research, 40(5p2):1620–1639.
  14. Diagnosis code assignment: models and evaluation metrics. Journal of the American Medical Informatics Association, page 231–237.
  15. Selecting relevant features from the electronic health record for clinical code prediction. Journal of Biomedical Informatics, 74:92–103.
  16. F.-C. Tsui. 2002. Value of icd-9-coded chief complaints for detection of epidemics. Journal of the American Medical Informatics Association, 9(90061):41S – 47.
  17. A label attention model for icd coding from clinical text. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence.
  18. Mkfn: Multimodal knowledge fusion network for automatic icd coding. 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pages 2294–2297.
  19. A novel framework based on medical concept driven attention for explainable medical code prediction via external knowledge. In Findings of the Association for Computational Linguistics: ACL 2022, pages 1407–1416, Dublin, Ireland. Association for Computational Linguistics.
  20. Knowledge-based dynamic prompt learning for multi-label disease diagnosis. Knowledge-Based Systems, 286:111395.
  21. Pengtao Xie and Eric Xing. 2018. A neural architecture for automated icd coding. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).
  22. Code synonyms do matter: Multiple synonyms matching network for automatic ICD coding. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 808–814, Dublin, Ireland. Association for Computational Linguistics.
  23. Automatic icd coding via interactive shared representation networks with self-distillation mechanism. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers).
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Bin Zhang (227 papers)
  2. Junli Wang (18 papers)