Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Does CLIP Know My Face? (2209.07341v4)

Published 15 Sep 2022 in cs.LG, cs.CR, and cs.CV

Abstract: With the rise of deep learning in various applications, privacy concerns around the protection of training data have become a critical area of research. Whereas prior studies have focused on privacy risks in single-modal models, we introduce a novel method to assess privacy for multi-modal models, specifically vision-LLMs like CLIP. The proposed Identity Inference Attack (IDIA) reveals whether an individual was included in the training data by querying the model with images of the same person. Letting the model choose from a wide variety of possible text labels, the model reveals whether it recognizes the person and, therefore, was used for training. Our large-scale experiments on CLIP demonstrate that individuals used for training can be identified with very high accuracy. We confirm that the model has learned to associate names with depicted individuals, implying the existence of sensitive information that can be extracted by adversaries. Our results highlight the need for stronger privacy protection in large-scale models and suggest that IDIAs can be used to prove the unauthorized use of data for training and to enforce privacy laws.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Dominik Hintersdorf (17 papers)
  2. Lukas Struppek (21 papers)
  3. Manuel Brack (25 papers)
  4. Felix Friedrich (40 papers)
  5. Patrick Schramowski (48 papers)
  6. Kristian Kersting (205 papers)
Citations (9)

Summary

We haven't generated a summary for this paper yet.