Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CPMLHO:Hyperparameter Tuning via Cutting Plane and Mixed-Level Optimization (2212.06150v1)

Published 11 Dec 2022 in cs.LG

Abstract: The hyperparameter optimization of neural network can be expressed as a bilevel optimization problem. The bilevel optimization is used to automatically update the hyperparameter, and the gradient of the hyperparameter is the approximate gradient based on the best response function. Finding the best response function is very time consuming. In this paper we propose CPMLHO, a new hyperparameter optimization method using cutting plane method and mixed-level objective function.The cutting plane is added to the inner layer to constrain the space of the response function. To obtain more accurate hypergradient,the mixed-level can flexibly adjust the loss function by using the loss of the training set and the verification set. Compared to existing methods, the experimental results show that our method can automatically update the hyperparameters in the training process, and can find more superior hyperparameters with higher accuracy and faster convergence.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Shuo Yang (244 papers)
  2. Yang Jiao (127 papers)
  3. Shaoyu Dou (8 papers)
  4. Mana Zheng (1 paper)
  5. Chen Zhu (103 papers)

Summary

We haven't generated a summary for this paper yet.