Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A framework for the extraction of Deep Neural Networks by leveraging public data (1905.09165v1)

Published 22 May 2019 in cs.LG, cs.AI, cs.CR, and stat.ML

Abstract: Machine learning models trained on confidential datasets are increasingly being deployed for profit. Machine Learning as a Service (MLaaS) has made such models easily accessible to end-users. Prior work has developed model extraction attacks, in which an adversary extracts an approximation of MLaaS models by making black-box queries to it. However, none of these works is able to satisfy all the three essential criteria for practical model extraction: (1) the ability to work on deep learning models, (2) the non-requirement of domain knowledge and (3) the ability to work with a limited query budget. We design a model extraction framework that makes use of active learning and large public datasets to satisfy them. We demonstrate that it is possible to use this framework to steal deep classifiers trained on a variety of datasets from image and text domains. By querying a model via black-box access for its top prediction, our framework improves performance on an average over a uniform noise baseline by 4.70x for image tasks and 2.11x for text tasks respectively, while using only 30% (30,000 samples) of the public dataset at its disposal.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Soham Pal (17 papers)
  2. Yash Gupta (11 papers)
  3. Aditya Shukla (10 papers)
  4. Aditya Kanade (29 papers)
  5. Shirish Shevade (18 papers)
  6. Vinod Ganapathy (3 papers)
Citations (55)

Summary

We haven't generated a summary for this paper yet.