The thermodynamic trainability quantity dies the way a quantum loss landscape goes barren — same shape, different machine — and saying exactly how far that analogy reaches is the whole point of this entry.
This is a synthesis entry, not a new result. It assembles two artifacts — synthesis/trainability-theorem.md (the corollary block) and synthesis/parent-bridges.md (the cross-parent map) — into one statement: the Mixing–Expressivity Tradeoff (MET) is a product of independently-fatal factors, and that product is the EBM cousin of the quantum barren plateau. Here we keep the claim-status discipline visible.
The question
Given the factorization — gradient SNR-squared as effective sample size times signal-to-slow-fluctuation ratio — when does ? And: is "the gradient landscape went flat" the same failure that quantum machine learning calls a barren plateau, or only a rhyme?
The setup / method
We work from one reverse-process EBM layer of a Denoising Thermodynamic Model, sampled by a reversible Gibbs kernel . The operational object is — true-gradient power over estimator MSE. The corollary reads the factorization as a Ragone-shaped expression: (the plateau) iff any factor collapses.
The result — three independently-fatal factors
The MET, stated as a theorem-shape, is the product of:
- exponentially — spectral-gap collapse, . This is the MET named as a mechanism: a monolithic EBM expressive enough to fit the data becomes exponentially slow to sample (the DTM paper's core motivation, §I–II + App. B). Quantum analogue: exponential blow-up (reachability/expressivity), where is the QML dynamical Lie algebra — not the gradient .
- — the gradient signal is swamped by the equilibrium fluctuations of along the slow modes; the signal is not where the sampler can resolve it. Quantum analogue: the vanishing g-purity product .
- Budget starvation — too few total Gibbs steps. Small burn-in leaves an bias (); small window leaves too small. Via the DTM energy model (App. E), this total-steps axis is literally the energy-per-sample axis. No quantum analogue.
Each factor is independently sufficient to kill ; all three are needed for trainability. The corollary's tag is conjectured.
After exp1+exp2 (experiments/exp1-exact-diag/, experiments/exp2-thrml-smoke/) the first two factors are read in their observable-projected form: factor 1 with over the gradient-relevant mode set , factor 2 with . The three-factor shape is unchanged; the quantities are the corrected, projected ones.
The bridge — what transfers, what does not
The cross-parent table maps factor-by-factor:
| Quantum (QML) | Classical EBM/DTM (here) | |---|---| | g-purity product | — signal-to-slow-fluctuation | | — reachability | spectral gap | | barren plateau | thermodynamic plateau |
So is the EBM analogue of Ragone's loss-variance .
What DOES transfer is exactly two things: the phenomenon (a gradient SNR that survives or collapses exponentially with system size) and the theorem shape (a product of independently-fatal factors).
Scope & caveats — the sharpest contrast
It does NOT literally transfer. There is no Lie algebra in an EBM: , , and the DLA decomposition are quantum-circuit objects with no EBM counterpart. Do not write EBM quantities as if they were g-purities.
The mechanism is different. The quantum barren plateau is signal extinction: the true gradient variance itself vanishes, . The thermodynamic plateau is an estimation failure: collapses because the estimator MSE swamps a still-nonzero — the recoverability of dies, not . Same phenomenology (SNR collapse), different machine. The Q-program's honesty hinges on keeping that distinction visible.
This entry asserts no tag flip. The corollary is conjectured; the bridge is explicitly a shape analogy, not a literal transfer, per parent-bridges.md. Nothing here is proven-here or validated.
What this feeds: the observable-projected predictor — the corrected, differentiable-in- target that turns this shape into something computable without training to convergence.
Sources
- Jelinčič et al. 2025, Denoising Thermodynamic Models, arXiv:2510.23972 (Eq. 14, §IV, Fig. 5b, App. B/E/G).
- Ragone et al. 2024, the loss-variance factorization (DLA dimension · state purity · observable purity) the corollary is shaped after.