Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 71 tok/s
Gemini 2.5 Pro 38 tok/s Pro
GPT-5 Medium 36 tok/s Pro
GPT-5 High 39 tok/s Pro
GPT-4o 110 tok/s Pro
Kimi K2 191 tok/s Pro
GPT OSS 120B 462 tok/s Pro
Claude Sonnet 4.5 36 tok/s Pro
2000 character limit reached

Entropy-Optimized Training Scheme

Updated 29 September 2025
  • Entropy-Optimized Training Scheme is a method that integrates information-theoretic entropy into training objectives or numerical updates for improved stability and physical consistency.
  • It employs elementwise entropy constraints and a dynamic limiting operator to selectively damp oscillations while preserving high-order accuracy in smooth regions.
  • An adaptive CFL condition is used to ensure local entropy conditions are met, balancing stability near discontinuities with optimal performance in smooth flows.

An entropy-optimized training scheme refers to any methodological paradigm that integrates entropy—or closely related information-theoretic quantities—directly into the core training objective, model update, or structural constraint. This class of approaches leverages entropy constraints or optimization to promote stability, accuracy, generalizability, compression, and robustness across a range of machine learning, signal processing, and computational science tasks. The following sections detail one representative instantiation of entropy-optimized training from "Entropy-Bounded Discontinuous Galerkin Scheme for Euler Equations" (Lv et al., 2014), which introduces entropy constraints for stabilizing high-order discontinuous Galerkin (DG) methods in computational fluid dynamics, and distills broader principles and implementation considerations of entropy-optimal schemes.

1. Elementwise Entropy Constraint Principle

The essence of the scheme is to enforce a discrete entropy bound at the element level of a DG discretization. The entropy at any quadrature point xx in element Ωe\Omega_e is enforced to satisfy

s(Ue(Δt)(x))se0(t)s(U_e^{(\Delta t)}(x)) \geq s_e^0(t)

where s(U)s(U) denotes the physical entropy; for compressible Euler equations, this can be

s=ln(p)γln(ρ)+constants = \ln(p) - \gamma\ln(\rho) + \text{constant}

The entropy lower bound se0(t)s_e^0(t) is defined locally as the minimal value taken among all in-element and inflow-boundary quadrature points:

se0(t)min{s(U(y)):yΩeΩe}s_e^0(t) \equiv \min\{ s(U(y)): y \in \Omega_e \cup \partial\Omega_e^- \}

Enforcement is practical and pointwise, diverging from the global constraints used in positivity-preserving alternatives.

2. Regularization and Limiting via Entropy-Minimum Principle

To stabilize particularly near shocks and discontinuities, a regularizing limiting operator L\mathcal{L} is introduced, adjusting the DG solution UeU_e by blending it with its cell average Uˉe\bar{U}_e:

UeL=Ue+ε(UˉeUe)U_e^\mathcal{L} = U_e + \varepsilon (\bar{U}_e - U_e)

The parameter ε\varepsilon is determined algebraically at each cell to ensure

p((1ε)Ue+εUˉe)exp(se0)[(1ε)ργ(Ue)+εργ(Uˉe)]p((1-\varepsilon) U_e + \varepsilon \bar{U}_e) \geq \exp(s_e^0)[(1-\varepsilon)\rho^\gamma(U_e) + \varepsilon\rho^\gamma(\bar{U}_e)]

Thus, ε\varepsilon is nonzero only when necessary (i.e., when pointwise entropy is in danger of violating the bound), allowing for selective and targeted damping of oscillations. In regions where the DG solution already satisfies the entropy minimum, εO(hp)\varepsilon \sim \mathcal{O}(h^p) is negligible and regularization remains inactive.

3. High-Order Accuracy Preservation and Recovery

A crucial property is that the entropy-bounded limiting does not degrade the convergence properties of the original high-order DG scheme in smooth regions. The modification induced by the limiter,

UeLUe=O(hp)U_e^\mathcal{L} - U_e = \mathcal{O}(h^p)

remains subordinate to the nominal truncation error. The method therefore preserves the (p+1)(p+1)-th order accuracy for smooth flows. At discontinuities, the expected reversion to local first-order behavior occurs, but this is recognized as a necessary trade-off for stability. This dual-behavior ensures optimal accuracy elsewhere and robust regularization only where physically required.

4. Discrete Entropy-Minimum Principle and Analytical Guarantees

Application of the entropy-minimum principle is formalized by showing, for cell-averaged solutions (e.g., in the three-point finite volume case),

s(U~e(Δt))min{s(U~e1),s(U~e),s(U~e+1)}s(\tilde{U}_e^{(\Delta t)}) \geq \min \left\{s(\tilde{U}_{e-1}), s(\tilde{U}_e), s(\tilde{U}_{e+1})\right\}

via convex combinations and analysis using properties of the entropy function. The result is an analytically provable guarantee that, under an appropriate time-step (see Section 5), the update will not decrease cell-averaged entropy below physically admissible values. This guarantees nonlinear stability rooted in physical principles, in contrast to many post-hoc or heuristically chosen limiters.

5. CFL Constraint Specialized for Entropy Consistency

A critical CFL condition is derived to ensure that all auxiliary states remain within the entropy-consistent set. In one dimension,

Δtλh12min{θl,θr}\frac{\Delta t \, \lambda}{h} \leq \frac{1}{2} \min\{\theta_l, \theta_r\}

where λ\lambda is a bound on the maximum wave speed, and θl,θr\theta_l,\theta_r are coefficients from cell-averaging via quadrature. In higher dimensions, the restriction becomes

ΔtλLe12CFLEB\frac{\Delta t \, \lambda}{L_e} \leq \frac{1}{2} \cdot \mathrm{CFL}^{EB}

with LeL_e a characteristic element length, and CFLEB\mathrm{CFL}^{EB} determined via analytical or tabulated geometric analysis for the reference element. This constraint is both necessary for the entropy-bounded update and critical for stable explicit time integration.

6. Numerical Behavior and Empirical Properties

A series of canonical tests—including smooth periodic advection, moving shocks, flow over a cylinder, double Mach reflection, and three-dimensional sphere flows—demonstrate the efficacy:

  • Non-physical oscillations near discontinuities are eliminated without introducing excessive dissipation.
  • Only a small subset of elements (adjacent to shocks) activate the entropy-bounding regularization.
  • The local entropy-bounding parameter ε\varepsilon can serve as a highly sensitive indicator for adaptive mesh refinement.
  • High-order accuracy is empirically maintained in smooth regions; the entropy limiter does not interfere with nonlinear convergence properties when not strictly needed.

7. Implementation Strategy and Computational Considerations

The algorithmic procedure is practical and modular:

  • Preprocessing calculates geometry- and quadrature-dependent coefficients and tabulates/CFL numbers for efficient runtime selection.
  • At each explicit Runge–Kutta stage:

    • Update the DG weak form to advance cell averages.
    • Calculate local entropy minima at each quadrature and inflow boundary point.
    • Apply the limiting operator L\mathcal{L} with local determination of ε\varepsilon, using

    ε=ττ[p(Uˉe)exp(se0)ργ(Uˉe)]\varepsilon = \frac{\tau}{\tau - [p(\bar{U}_e) - \exp(s_e^0) \rho^\gamma(\bar{U}_e)]}

    where τ\tau represents the minimum “excess” pressure.

  • Re-assign the time step if any element demands a tighter CFL value.
  • This pipeline is highly amenable to vectorized and distributed computing, as all operations are local to elements until flux computation.
Component Role Notes
Entropy constraint Elementwise, at quadrature points s(U(x))se0s(U(x)) \geq s_e^0 for all xΩex \in \Omega_e
Limiting operator L\mathcal{L} Regularizes solution to cell average Selective: inactive wherever constraint already satisfied
CFL criterion Enforces entropy consistency Derived for 1D and arbitrary elements
Applicability Arbitrary high-order, curved, multidimensional meshes Simple to implement and extend

Summary and Broader Implications

The entropy-bounded DG scheme is an explicit realization of entropy-optimized training in computational PDE: it enforces physical admissibility via local entropy constraints, stabilizing high-order solvers in the presence of discontinuity while preserving formal accuracy in smooth zones. The use of elementwise entropy bounds, analytically justified limiting, and an optimal CFL restriction provides a blueprint for entropy optimization in broader numerical and machine learning settings. The approach generalizes across mesh types and dimensionalities and involves implementation steps—local constraint evaluation, algebraic limiters, CFL adjustment—that are computationally tractable and modular. The more general lesson: entropy-optimization can serve as both a stabilizing principle and a foundation for robust, physically consistent learning and computation in high-dimensional settings.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)
Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Entropy-Optimized Training Scheme.