Mitigating inferential disclosure in wearable VLM interactions
Develop privacy-preserving methods for vision–language model interactions on wearable cameras that prevent inferential disclosure of sensitive attributes—such as location identification from architectural details, signage, and geographical features—especially when query targets contain identifying features.
References
Prior work found that models correctly inferred location from architectural details, signage, and geographical features in 65.6\% of test cases. Addressing such inferential disclosure, particularly when query targets themselves contain identifying features, remains an open challenge for privacy-preserving VLM interaction on wearables.
— VueBuds: Visual Intelligence with Wireless Earbuds
(2603.29095 - Kim et al., 31 Mar 2026) in Discussion and Future Work, Social dynamics and privacy