Prompting Strategy for CLAP-based Audio Quality Assessment
Determine an effective prompting strategy and computational setup for using CLAP (Contrastive Language-Audio Pretraining), an audio–language model with joint audio–text embeddings, to perform audio quality assessment from audio inputs and quality-related text prompts.
References
Determining the prompting strategy and setup to use CLAP for audio quality assessment is still an open question.
— PAM: Prompting Audio-Language Models for Audio Quality Assessment
(2402.00282 - Deshmukh et al., 1 Feb 2024) in Section 2.1 (Audio Quality Assessment)