Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Dropout with Tabu Strategy for Regularizing Deep Neural Networks (1808.09907v1)

Published 29 Aug 2018 in cs.LG and stat.ML

Abstract: Dropout has proven to be an effective technique for regularization and preventing the co-adaptation of neurons in deep neural networks (DNN). It randomly drops units with a probability $p$ during the training stage of DNN. Dropout also provides a way of approximately combining exponentially many different neural network architectures efficiently. In this work, we add a diversification strategy into dropout, which aims at generating more different neural network architectures in a proper times of iterations. The dropped units in last forward propagation will be marked. Then the selected units for dropping in the current FP will be kept if they have been marked in the last forward propagation. We only mark the units from the last forward propagation. We call this new technique Tabu Dropout. Tabu Dropout has no extra parameters compared with the standard Dropout and also it is computationally cheap. The experiments conducted on MNIST, Fashion-MNIST datasets show that Tabu Dropout improves the performance of the standard dropout.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Zongjie Ma (3 papers)
  2. Abdul Sattar (23 papers)
  3. Jun Zhou (370 papers)
  4. Qingliang Chen (10 papers)
  5. Kaile Su (8 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.