Fine-grained linguistic hierarchies in S3M layerwise representations
Determine whether fine-grained distinctions between phone-, syllable-, word-, and sentence-level linguistic structures are reflected in the layerwise representations of self-supervised speech models.
References
However, because most linguistic investigations into S3M representations have so far focussed on individual structural levels, with analysis data and methods varying between studies, it is currently unknown whether fine-grained distinctions between levels of linguistic organization (e.g. phone-, syllable-, word- and sentence-level structures) are reflected in S3M layerwise hierarchies.
— Tracking the emergence of linguistic structure in self-supervised models learning from speech
(2604.02043 - Kloots et al., 2 Apr 2026) in Section 2.1 (Related work: Layerwise hierarchies and linguistic structure in S3Ms)