Scalability of multi-language syntax highlighting models beyond six languages
Determine the performance of the convolutional neural network-based multi-language and few-shot syntax highlighting models introduced in this study (ML32/ML64/ML128 and FS32/FS64/FS128, with or without Token Normalization) when trained and evaluated on datasets comprising more than six programming languages, to assess scalability and potential limitations in handling diverse and larger multilingual datasets.
Sponsor
References
However, the performance of these models in scenarios involving more than six languages has not been investigated.
— Multi Language Models for On-the-Fly Syntax Highlighting
(2510.04166 - Palma et al., 5 Oct 2025) in Section 3.5 (Threats to Validity)