We built the substrate the audit said did not exist, and checked that a network can hand an energy model couplings that are differentiable all the way through — without ever touching the operational hurdle that still blocks the program.
The question
The substrate audit returned a clean zero: of eight published thermodynamic-ML substrates, none lets a network produce differentiable EBM couplings. So the scan's §3 retrofit row recommended building one — a hypernetwork that emits RBM coupling matrices from a teacher-identifying code . This run asks three preregistered questions, no more: can the G1b gate be CONSTRUCTED (couplings NN-produced, the active target differentiable through ); is the R3 "crossable-multimodality" sweet spot non-empty; and is R4 (the verify-or-HALT diagnostic battery) verifiable. It explicitly does not touch G2 — the operational factorization tier stays [conjectured].
The setup
Pre-commitment was frozen (bb342b0, 2026-06-09) before any implementation — see experiments/exp8-hypernet-retrofit/ for the freeze, runner, and per-cell results.json. The run was 447 s wall on a laptop CPU (cap 3 h; midpoint trigger projected 419 s → PROCEED), JAX 0.9.1 in x64. The grid is hidden units split-axis positions 4 teacher codes — 192 cells, with a verdict basis of 72 ( verdict checkpoints).
The load-bearing design choice was the derivative-agreement test for P1(c). Rather than autodiff through an eigendecomposition, the registered Change-4 route builds a JAX mirror of the selection-free via a deflated resolvent — smooth wherever , no eigh. We then compare autodiff against central finite differences along 5 frozen directions.
The result
P1 — G1b CONSTRUCTED (construction/formula-level, never "empirically validated"):
- (a) finite at all 48 entries; .
- (b) nontrivial -dependence: at every final checkpoint (threshold ).
- (c) at , anchor = first crossing (step 66): the value-agreement gate (gate ); autodiff-vs-FD best- relative errors , all → PASS.
The no-eigh route was load-bearing, observed not assumed: the four quotient kernels at the anchor had min adjacent eigenvalue gaps to — numerically degenerate spectra that would have made eigh-autodiff ill-conditioned exactly here. The pre-freeze skeptic's MAJOR was necessary, not precautionary.
P2 — R3 non-empty (empirically measured): 11/72 verdict-basis cells have . Both stability riders are clean — identical across the sweep (zero verdict flips), and the multiway-ordering rider is vacuously stable (zero saddles in all 192 cells; the machinery armed but never fired). Every qualifying event sits from its nearest filter threshold. Qualifying-cell spans to ; from 0.51 to 7.08. Descriptive prevalence over all 192 cells: 19 cells with , per-, all at .
P3 — R4 verifiable (verify-or-HALT, no HALT fired): max A2 detailed-balance residual (); the global Cheeger bound holds on every merge-tree cut (min margin ); zero Sym fallbacks — the quotient was legitimate in all cells.
P4 — split-axis response (descriptive): the capacity-allocation tension is real. Measured everywhere (the rank- manifold tangent dimension), so the observable span contracts monotonically along . At () the teachers are unmatchable — stalls at a genuine stationary point (per- KL up to 2.63), everywhere at .
Scope and caveats
This decides G1b/R3/R4 only, scoped to this family/architecture/teacher set (, one seed table, 4 teachers at ). No tag moves. The code is a teacher-identifying frozen code — a construction witness, not strong encoder evidence. P1(c) certifies the selection-free form; the hard--selection non-smoothness (Risk 2) stays a theorem-side boundary, unresolved here. at everywhere. And critically: nothing here touches A6/A7 at scale — G2 remains the binding empirical hurdle, the row's G2/A6/A7 sub-question stays NOT-DETERMINED-WITHOUT-RUN, and [validated] requires the full A1–A8 + plateau + F4 regime.
What this feeds: G1b is now CONSTRUCTED and R3 is shown non-empty, so the differentiable substrate exists and contains crossable multimodal structure — clearing the way to attack G2 (operational factorization at scale), the one tier this run deliberately left [conjectured].