Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 71 tok/s

Gemini 2.5 Pro 38 tok/s Pro

GPT-5 Medium 36 tok/s Pro

GPT-5 High 39 tok/s Pro

GPT-4o 110 tok/s Pro

Kimi K2 191 tok/s Pro

GPT OSS 120B 462 tok/s Pro

Claude Sonnet 4.5 36 tok/s Pro

2000 character limit reached

Entropy-Optimized Training Scheme

Updated 29 September 2025

Entropy-Optimized Training Scheme is a method that integrates information-theoretic entropy into training objectives or numerical updates for improved stability and physical consistency.
It employs elementwise entropy constraints and a dynamic limiting operator to selectively damp oscillations while preserving high-order accuracy in smooth regions.
An adaptive CFL condition is used to ensure local entropy conditions are met, balancing stability near discontinuities with optimal performance in smooth flows.

An entropy-optimized training scheme refers to any methodological paradigm that integrates entropy—or closely related information-theoretic quantities—directly into the core training objective, model update, or structural constraint. This class of approaches leverages entropy constraints or optimization to promote stability, accuracy, generalizability, compression, and robustness across a range of machine learning, signal processing, and computational science tasks. The following sections detail one representative instantiation of entropy-optimized training from "Entropy-Bounded Discontinuous Galerkin Scheme for Euler Equations" (Lv et al., 2014), which introduces entropy constraints for stabilizing high-order discontinuous Galerkin (DG) methods in computational fluid dynamics, and distills broader principles and implementation considerations of entropy-optimal schemes.

1. Elementwise Entropy Constraint Principle

The essence of the scheme is to enforce a discrete entropy bound at the element level of a DG discretization. The entropy at any quadrature point $x$ in element $\Omega_e$ is enforced to satisfy

$s(U_e^{(\Delta t)}(x)) \geq s_e^0(t)$

where $s(U)$ denotes the physical entropy; for compressible Euler equations, this can be

$s = \ln(p) - \gamma\ln(\rho) + \text{constant}$

The entropy lower bound $s_e^0(t)$ is defined locally as the minimal value taken among all in-element and inflow-boundary quadrature points:

$s_e^0(t) \equiv \min\{ s(U(y)): y \in \Omega_e \cup \partial\Omega_e^- \}$

Enforcement is practical and pointwise, diverging from the global constraints used in positivity-preserving alternatives.

2. Regularization and Limiting via Entropy-Minimum Principle

To stabilize particularly near shocks and discontinuities, a regularizing limiting operator $\mathcal{L}$ is introduced, adjusting the DG solution $U_e$ by blending it with its cell average $\bar{U}_e$ :

$U_e^\mathcal{L} = U_e + \varepsilon (\bar{U}_e - U_e)$

The parameter $\varepsilon$ is determined algebraically at each cell to ensure

$p((1-\varepsilon) U_e + \varepsilon \bar{U}_e) \geq \exp(s_e^0)[(1-\varepsilon)\rho^\gamma(U_e) + \varepsilon\rho^\gamma(\bar{U}_e)]$

Thus, $\varepsilon$ is nonzero only when necessary (i.e., when pointwise entropy is in danger of violating the bound), allowing for selective and targeted damping of oscillations. In regions where the DG solution already satisfies the entropy minimum, $\varepsilon \sim \mathcal{O}(h^p)$ is negligible and regularization remains inactive.

3. High-Order Accuracy Preservation and Recovery

A crucial property is that the entropy-bounded limiting does not degrade the convergence properties of the original high-order DG scheme in smooth regions. The modification induced by the limiter,

$U_e^\mathcal{L} - U_e = \mathcal{O}(h^p)$

remains subordinate to the nominal truncation error. The method therefore preserves the $(p+1)$ -th order accuracy for smooth flows. At discontinuities, the expected reversion to local first-order behavior occurs, but this is recognized as a necessary trade-off for stability. This dual-behavior ensures optimal accuracy elsewhere and robust regularization only where physically required.

4. Discrete Entropy-Minimum Principle and Analytical Guarantees

Application of the entropy-minimum principle is formalized by showing, for cell-averaged solutions (e.g., in the three-point finite volume case),

$s(\tilde{U}_e^{(\Delta t)}) \geq \min \left\{s(\tilde{U}_{e-1}), s(\tilde{U}_e), s(\tilde{U}_{e+1})\right\}$

via convex combinations and analysis using properties of the entropy function. The result is an analytically provable guarantee that, under an appropriate time-step (see Section 5), the update will not decrease cell-averaged entropy below physically admissible values. This guarantees nonlinear stability rooted in physical principles, in contrast to many post-hoc or heuristically chosen limiters.

5. CFL Constraint Specialized for Entropy Consistency

A critical CFL condition is derived to ensure that all auxiliary states remain within the entropy-consistent set. In one dimension,

$\frac{\Delta t \, \lambda}{h} \leq \frac{1}{2} \min\{\theta_l, \theta_r\}$

where $\lambda$ is a bound on the maximum wave speed, and $\theta_l,\theta_r$ are coefficients from cell-averaging via quadrature. In higher dimensions, the restriction becomes

$\frac{\Delta t \, \lambda}{L_e} \leq \frac{1}{2} \cdot \mathrm{CFL}^{EB}$

with $L_e$ a characteristic element length, and $\mathrm{CFL}^{EB}$ determined via analytical or tabulated geometric analysis for the reference element. This constraint is both necessary for the entropy-bounded update and critical for stable explicit time integration.

6. Numerical Behavior and Empirical Properties

A series of canonical tests—including smooth periodic advection, moving shocks, flow over a cylinder, double Mach reflection, and three-dimensional sphere flows—demonstrate the efficacy:

Non-physical oscillations near discontinuities are eliminated without introducing excessive dissipation.
Only a small subset of elements (adjacent to shocks) activate the entropy-bounding regularization.
The local entropy-bounding parameter $\varepsilon$ can serve as a highly sensitive indicator for adaptive mesh refinement.
High-order accuracy is empirically maintained in smooth regions; the entropy limiter does not interfere with nonlinear convergence properties when not strictly needed.

7. Implementation Strategy and Computational Considerations

The algorithmic procedure is practical and modular:

Preprocessing calculates geometry- and quadrature-dependent coefficients and tabulates/CFL numbers for efficient runtime selection.
At each explicit Runge–Kutta stage:
- Update the DG weak form to advance cell averages.
- Calculate local entropy minima at each quadrature and inflow boundary point.
- Apply the limiting operator $\mathcal{L}$ with local determination of $\varepsilon$ , using
$\varepsilon = \frac{\tau}{\tau - [p(\bar{U}_e) - \exp(s_e^0) \rho^\gamma(\bar{U}_e)]}$

where $\tau$ represents the minimum “excess” pressure.
Re-assign the time step if any element demands a tighter CFL value.
This pipeline is highly amenable to vectorized and distributed computing, as all operations are local to elements until flux computation.

Component	Role	Notes
Entropy constraint	Elementwise, at quadrature points	$s(U(x)) \geq s_e^0$ for all $x \in \Omega_e$
Limiting operator $\mathcal{L}$	Regularizes solution to cell average	Selective: inactive wherever constraint already satisfied
CFL criterion	Enforces entropy consistency	Derived for 1D and arbitrary elements
Applicability	Arbitrary high-order, curved, multidimensional meshes	Simple to implement and extend

Summary and Broader Implications

The entropy-bounded DG scheme is an explicit realization of entropy-optimized training in computational PDE: it enforces physical admissibility via local entropy constraints, stabilizing high-order solvers in the presence of discontinuity while preserving formal accuracy in smooth zones. The use of elementwise entropy bounds, analytically justified limiting, and an optimal CFL restriction provides a blueprint for entropy optimization in broader numerical and machine learning settings. The approach generalizes across mesh types and dimensionalities and involves implementation steps—local constraint evaluation, algebraic limiters, CFL adjustment—that are computationally tractable and modular. The more general lesson: entropy-optimization can serve as both a stabilizing principle and a foundation for robust, physically consistent learning and computation in high-dimensional settings.

PDF Markdown Chat (Pro)

References (1)

Entropy-Bounded Discontinuous Galerkin Scheme for Euler Equations (2014)

Follow Topic

Get notified by email when new papers are published related to Entropy-Optimized Training Scheme.