Identify features driving increases or decreases in perceived kawaii after voice frequency manipulation
Identify the specific acoustic or perceptual features of digital voices—across both text-to-speech systems and prerecorded game character voices—that are responsible for increases (kawaii++) or decreases (kawaii--) in perceived kawaiiness when fundamental (F0) and first-to-third formant (F1–F3) frequencies are manipulated. Determine which voice attributes, beyond F0 and F1–F3, explain the divergent outcomes observed and resolve potential confounds introduced by other voice or sound features.
References
We were not able to identify what specific features of the voices contributed to kawaii++ or kawaii--. Future work will need to explore these voices in more detail or consider other voice or sound features that may have confounded the results.