Explicit modeling of presentation context in objective audio quality metrics
Develop an explicit modeling framework to incorporate presentation context—such as mixed-condition trials where identical signals and distortions are presented alongside different stereo processing modes (e.g., SHmix and QNmix combining Mid/Side and Left/Right degradations)—into objective audio quality metrics, so that predicted quality scores reflect top-down contextual influences observed in MUSHRA listening tests.
References
Future developments in audio quality metrics need to address this limitation. Although it is not clear how presentation context can be explicitly modeled into the metrics (a top-down process), a potential solution may include a data-driven approach that incorporates ground truth sets with different presentation contexts to map the different distortion metrics (i.e., bottom-up processes) into a quality score estimate.