Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Bayesian Optimization over High-Dimensional Combinatorial Spaces via Dictionary-based Embeddings (2303.01774v1)

Published 3 Mar 2023 in cs.LG and stat.ML

Abstract: We consider the problem of optimizing expensive black-box functions over high-dimensional combinatorial spaces which arises in many science, engineering, and ML applications. We use Bayesian Optimization (BO) and propose a novel surrogate modeling approach for efficiently handling a large number of binary and categorical parameters. The key idea is to select a number of discrete structures from the input space (the dictionary) and use them to define an ordinal embedding for high-dimensional combinatorial structures. This allows us to use existing Gaussian process models for continuous spaces. We develop a principled approach based on binary wavelets to construct dictionaries for binary spaces, and propose a randomized construction method that generalizes to categorical spaces. We provide theoretical justification to support the effectiveness of the dictionary-based embeddings. Our experiments on diverse real-world benchmarks demonstrate the effectiveness of our proposed surrogate modeling approach over state-of-the-art BO methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Aryan Deshwal (20 papers)
  2. Sebastian Ament (19 papers)
  3. Maximilian Balandat (27 papers)
  4. Eytan Bakshy (38 papers)
  5. Janardhan Rao Doppa (62 papers)
  6. David Eriksson (22 papers)
Citations (15)

Summary

We haven't generated a summary for this paper yet.