Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
131 tokens/sec
GPT-4o
10 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Context-Aware Membership Inference Attacks against Pre-trained Large Language Models (2409.13745v1)

Published 11 Sep 2024 in cs.CL, cs.AI, cs.CR, cs.LG, and stat.ML

Abstract: Prior Membership Inference Attacks (MIAs) on pre-trained LLMs, adapted from classification model attacks, fail due to ignoring the generative process of LLMs across token sequences. In this paper, we present a novel attack that adapts MIA statistical tests to the perplexity dynamics of subsequences within a data point. Our method significantly outperforms prior loss-based approaches, revealing context-dependent memorization patterns in pre-trained LLMs.

Summary

We haven't generated a summary for this paper yet.