Phase Memos · Donkey on the Edge

Black Hole Interior Research: Thread Spin-Up

Phases 1-9 complete

Complete context for continuing this research in a fresh thread. Read this end-to-end before starting work. No context from the previous thread is needed beyond what’s in this document and the referenced files.

Part 1: The big question

What happens when two people fall into a black hole together?

This is the physical intuition pump for the entire research program. The sharper scientific question, in current terminology:

Given two observers who both access the interior of an evaporating black hole, how much can their descriptions of the interior disagree, and what controls the size of that disagreement?

This is called observer complementarity in the current literature. It’s the quantum-gravitational refinement of Susskind’s original 1993 black hole complementarity idea, now made mathematically precise via non-isometric holographic codes plus observer-inclusion rules.

The research program aims to produce a Tier-2 JHEP-quality paper on quantitative bounds for observer disagreement. Target: find an explicit function $f$ such that

$|S_A(\psi) - S_B(\psi)| \leq f(d_\text{fund}, d_{\text{Ob},A}, d_{\text{Ob},B}, \text{complexity}(\psi))$

where $S_A, S_B$ are the observer-dependent entropies of the same reference state $\psi$ as seen by two observers with bulk Hilbert-space factors of dimensions $d_{\text{Ob},A}, d_{\text{Ob},B}$ , embedded in an AEHPV non-isometric code with fundamental dimension $d_\text{fund}$ (sometimes called $d_B$ in the AEHPV paper, where $B$ denotes “black hole”). The precise definitions of all objects are in Part 10.

Three possible outcomes, all publishable:

Clean bound: $f$ controlled by known QI inequalities. Supports EGH framework.
Wild disagreement: $f$ unbounded. Tension with EGH framework.
Regime-dependent: $f$ bounded in some regimes, unbounded in others. Classify the regimes.

Part 2: What the field already did

The observer complementarity synthesis exists. It was published in July 2025 by Engelhardt-Gesteau-Harlow (2507.06046). Since then, monthly updates from Harlow’s group at MIT, Akers and DeWolfe’s group at Colorado, and independent critiques (Higginbotham 2512.17993, Liu 2509.14327 and 2512.13807, Kudler-Flam-Witten 2510.06376) have been refining the picture.

We are not first into the frontier. We are trying to contribute to a fast-moving, crowded frontier. Scoop risk is real. The research must move fast (weeks, not years).

Key papers, grouped by logical role

Foundation: non-isometric codes

AEHPV 2207.06536 / JHEP 2024:155 (Akers-Engelhardt-Harlow-Penington-Vardhan), the foundational paper. $V: \mathcal{H}_\ell \otimes \mathcal{H}_r \to \mathcal{H}_B$ with $d_{\mathcal{H}_\ell \otimes \mathcal{H}_r} > d_{\mathcal{H}_B}$ . Null states, complexity-protected non-isometry, Page curve.
DeWolfe-Higginbotham 2304.12345, backwards-forwards map for time-dependent interactions. Generalizes AEHPV.
Bueller-DeWolfe-Higginbotham 2407.01666, hyperbolic tensor network version of AEHPV with locality.

Observer algebras

CLPW 2206.10780, Chandrasekaran-Longo-Penington-Witten. Type III_1 → Type II_1 via observer in de Sitter.
Witten 2308.03663, background-independent algebra along worldline.
De Vuyst-Eccles-Höhn-Kirklin 2405.00114, 2412.15502, observer-dependent entropy is a QRF choice.

Closed universe problem

HUZ 2501.02359 (Harlow-Usatyuk-Zhao), observer in closed universe gives effective Hilbert space dim ~ $e^{S_\text{Ob}}$ . Observer is classical, cloned to external reference in pointer basis.
AAIL 2501.02632 (Abdalla-Antonini-Iliesiu-Levine), independent alternative prescription; gives different dimension than HUZ.
Akers-Bueller-DeWolfe-Higginbotham-Reinking-Rodriguez 2503.09681, the “Colorado rule”. Observer already in fundamental; remove the part of $V$ acting on observer’s patch. Compares to HUZ concretely.

The current synthesis

EGH 2507.06046 (Engelhardt-Gesteau-Harlow), the central paper. Applies HUZ as a general principle. Mathematical formulation of observer complementarity. Handles evaporating BH throughout history + Antonini-Sasieta-Swingle-Rath (AS2R) configuration.
Higginbotham 2512.17993 / JHEP 2026:183, critique of EGH. EGH’s observables are suboptimal; refined operators give different conclusions.
Kudler-Flam-Witten 2510.06376, “Emergent Mixed States.” Parallel approach: large- $N$ CFT states converge to mixed states, with algebras having nontrivial commutants interpretable as baby universe operators. May or may not be equivalent to observer complementarity.

Parallel threads

Akers-Lucas-Vikram 2506.18975, explicit reconstruction map in JT gravity via action-angle variables. Wormhole length dynamics.
Chen 2505.15892 / JHEP 2025:139, observers and abstract sources for path integrals.
Engelhardt-Gesteau 2504.14586, Gesteau 2509.14338, no-go theorems for semiclassical baby universes.
Liu 2509.14327, 2512.13807, filtered CFTs, closed universes at large $N$ .

The seven research gaps

Identified in the previous thread’s literature sweep. Gap 5 is the primary target; others are potential pivots.

HUZ vs AAIL reconciliation, two rules give different Hilbert space dimensions. Which is right? Or when does each apply? (Tractable; Harlow actively working on this.)
Optimal observer-operators, Higginbotham showed EGH’s specific SWAP observables are suboptimal. What’s optimal? (Tractable via SDP.)
KFW vs EGH, mixed-state emergence vs observer complementarity. Equivalent? When do they disagree? (Medium tractable; groups haven’t converged.)
Dynamics / time-dependence, EGH claims to work “throughout evaporation” but dynamics is not worked out. (Medium-high tractable.)
Multi-observer interior reconstruction, the primary target. Bounds on observer disagreement. (Tractable; not obviously being done.)
AKV JT + observers, add observers to Akers-Lucas-Vikram reconstruction. (Medium-low; needs JT expertise.)
When non-isometry is visible, decoding complexity for different observer choices. (High tractable.)

Part 3: What the previous thread produced

Working code (all in `/home/claude/bh_research/` unless noted)

Canonical, use going forward:

bh_lab_backend.py, the unified numerical backend. Haar ensembles, entropies, SDP via cvxpy, mpmath for precision, statsmodels for fits. Every research script should import from this. Self-tested. This is the canonical entry point.
vn_algebra/vnalgebra.py, commutants, algebra closure, cyclic/separating vectors, Tomita-Takesaki, modular flow, Type I classification. Finite-dim only. Fully tested.
vn_algebra/crossed_product.py, $M \rtimes_\alpha \mathbb{Z}_n$ construction with covariance verified.
vn_algebra/trace_on_crossed.py, canonical trace on the crossed product. Cyclic, positive, entropies computable.

Scratch / reference only (do NOT cite numbers from these):

step1_viability.py, confirmed observer-dependent entropy is computable. Used separable Hamiltonian; specific numbers are meaningless.
step1_diagnostic.py, discovered that coupling matters; scan of $J$ was a single-draw, not Haar-averaged.
step2a_huz_rule.py, first HUZ implementation. Used pre-fix AEHPV normalization. Dimension-scaling result is valid (integer-valued, normalization-independent); specific entropy values are not.
step2b_colorado_rule.py, same status. Dimension scaling is valid; specific entropy values shift under correct normalization.

Key conceptual results established

Result 1: Observer coupling is the heart of the physics.
Without a coupling term between observer and system, observers trivially agree. The interesting question is what coupling is physical, i.e., matches the CLPW/HUZ/EGH prescriptions.

Result 2: HUZ cloning rule has clean finite-dim implementation.
$|\text{ob}, m, 0\rangle_R \to |\text{ob}, m, \text{ob}\rangle_R$ in pointer basis. Then apply $V \otimes I_R$ . Effective accessible dim = $d_\text{Ob} \cdot d_\text{fund}$ , verified across 9 test cases.

Result 3: Colorado rule is different.
Observer is already in fundamental; $V$ doesn’t act on observer’s patch. Implements as $I_\text{Ob} \otimes V_M$ where $V_M$ acts on matter only. Effective accessible dim = $d_\text{Ob} \cdot d_\text{fund,M}$ .

Result 4: HUZ and Colorado agree on product states, disagree on observer-superposed states.
The rules differ precisely when the observer is itself in a quantum superposition. For $|\text{ob}\rangle_\text{Ob} \otimes |\psi\rangle_M$ , both give $S = 0$ . For $\sum_i |i\rangle_\text{Ob} |i\rangle_M / \sqrt{d}$ , they differ substantially.

Result 5: There’s a non-monotonic disagreement pattern in coupling strength.
At $J \in \{0.1, 0.5, 1.0, 2.0\}$ : $|S_A - S_B| \in \{0.015, 0.077, 0.050, 0.059\}$ . Whether this is a real feature or an artifact of a single random $X_\text{sys}$ is unknown. Needs Haar averaging to decide.

Result 6 (important caveat on the step1 scripts): The step1 scripts did NOT implement the proper two-observer setup. They used a single observer factor $\mathcal{H}_\text{Ob}$ with two different clock states $|c_A\rangle, |c_B\rangle$ drawn from it. This is the weakest notion of “two observers” and does not correspond to the Gap 5 paper setup, which uses two separate observer factors $\mathcal{H}_{\text{Ob},A} \otimes \mathcal{H}_{\text{Ob},B}$ with independent cloning. The step1 results are therefore not directly applicable to the paper; they are preliminary viability tests only. The proper two-observer HUZ setup is the Phase 4 deliverable.

The normalization bug that was caught

The old AEHPV convention was $V = \sqrt{d_\text{fund}/d_\text{eff}} \cdot P U$ which gives $VV^\dagger = (d_\text{fund}/d_\text{eff}) I$ rather than $VV^\dagger = I$ . The backend now uses the corrected convention: $V$ = first $d_\text{fund}$ rows of a Haar unitary, giving $VV^\dagger = I$ exactly. All future scripts use SeededRNG.aehpv_map() from the backend.

Part 4: What to install

The tools below are the complete research stack. Everything else the previous thread considered (Einstein Toolkit, LALSuite, Cadabra2, EinsteinPy, LIGO stack, cosmology tools, particle physics stack, etc.) is not relevant to this project. Resist the temptation to install more until a specific concrete need arises.

Already loaded in the Claude sandbox environment (no install needed)

numpy 2.4+, arrays, linear algebra, FFT
scipy 1.17+, linear algebra, stats, optimization, sparse matrices
scipy.stats.unitary_group, Haar unitary sampling (canonical implementation)
sympy 1.14+, symbolic math
mpmath 1.3+, arbitrary precision arithmetic
pandas 3.0+, tabular data
matplotlib 3.10+, plotting
seaborn 0.13+, statistical visualization
networkx 3.6+, graph theory (for tensor networks if needed later)
scikit-learn, classical ML (not central, but available)

Install at thread start (run once):

pip install --break-system-packages cvxpy statsmodels

cvxpy, convex optimization for SDP-based bounds. Used for Helstrom distinguishability, optimal observer-disagreement bounds, minimum-error discrimination.
statsmodels, proper regression with confidence intervals for scaling-law fits. Used for the main paper figure.

Install conditionally (if/when needed, not at thread start):

cadabra2, ONLY IF we start deriving analytic Haar-averaged quantities symbolically. The Weingarten calculations for Haar unitary moments. Trigger: when we have a specific analytic expression we need to verify.
qutip, ONLY IF we extend to open-system dynamics (Lindbladian evaporation). Trigger: when we want real-time evaporation dynamics with a radiation bath.
quimb or tenpy, ONLY IF we extend to the Bueller-DeWolfe-Higginbotham tensor network version. Trigger: when the bulk locality structure matters for a result.
stim, ONLY IF we test complexity-protected non-isometry via stabilizer circuits. Trigger: when we need actual decoding-complexity benchmarks.

Default assumption: don’t install anything else. Installations should be triggered by an identified computational gap, not by browsing tool lists.

Network configuration

The Claude sandbox has network access to pip, GitHub, PyPI, arxiv.org (with rate limits), and Springer/APS journal domains. No credentials needed for any tool installations above.

Part 5: The research plan

Phase-gated approach

Each phase has a gating condition. Don’t proceed to the next phase until the gate is passed.

Phase 0: Spin up the new thread (this document’s job)

Gate: backend self-test passes; key papers listed above have been skimmed; paper draft has been reviewed.

Phase 1: Port canonical HUZ + Colorado implementations to backend

What: rewrite step2a_huz_rule.py and step2b_colorado_rule.py as a single clean script using bh_lab_backend.py. Verify the dimension-scaling result still holds (it should; it’s normalization-independent). Record the corrected entropy numbers as the canonical values.

Deliverable: phase1_rules_canonical.py producing a table of HUZ vs Colorado entropies and dimensions at multiple $d_\text{fund}$ and $d_\text{Ob}$ .

Gate: all dimension predictions match exactly; Haar-averaged entropies are reproducible with given seeds; backend self-test still passes.

Phase 2: HUZ verification beyond dimension counting

What: verify a second HUZ prediction beyond the effective-dim count. Specifically: HUZ 2501.02359 claims errors in the observer-dependent description are exponentially small in $S_\text{Ob}$ . Test this numerically by computing the error between observer’s predicted entropy and the true reduced-state entropy as a function of $d_\text{Ob}$ .

Deliverable: phase2_huz_verification.py showing exponential error suppression with $d_\text{Ob}$ .

Gate: scaling fit of error vs $d_\text{Ob}$ gives exponent close to $-1$ (linear) or shows a specific alternative pattern we can interpret.

Phase 3: Reproduce EGH’s SWAP test result

What: implement the AS2R (Antonini-Sasieta-Swingle-Rath) configuration, closed universe entangled with pair of AdS universes. Compute EGH’s SWAP test expectation value using HUZ-rule observer. Reproduce their published number.

Deliverable: phase3_egh_reproduction.py matching a specific figure or number from 2507.06046.

Gate: our SWAP test value matches EGH’s to within a specified tolerance (e.g., 1% for numerical comparisons, or qualitative agreement on sign/scaling for analytic claims).

If this phase fails: something about our implementation doesn’t match EGH’s. Do not proceed to Phase 4 until resolved. Options: debug our implementation, or flag genuine disagreement with EGH as a potential paper result in itself.

Phase 4: Two-observer HUZ setup

What: extend HUZ to the genuine two-observer setup specified in Part 10.2: two separate observer factors $\mathcal{H}_{\text{Ob},A} \otimes \mathcal{H}_{\text{Ob},B}$ in the bulk, each cloned independently to its own reference $R_A, R_B$ . Apply $V_\text{HUZ,AB}$ as defined in Part 10.2. Compute $\rho_{R_A}, \rho_{R_B}$ by the partial traces specified in Part 10.3, and from them $S_A, S_B$ and the disagreement $\Delta S = |S_A - S_B|$ .

Note: this supersedes the step1 scripts’ approach, which used a single observer factor with different clock states. The proper setup requires two observer factors.

Deliverable: phase4_two_observers.py producing observer-dependent entropies for two-observer HUZ with all definitions matching Part 10.

Gate: (a) with identical observers ( $d_{\text{Ob},A} = d_{\text{Ob},B}$ and symmetric state), $S_A = S_B$ to numerical precision; (b) with asymmetric observer dimensions, disagreement is nonzero; (c) disagreement is bounded by $\log \min(d_{\text{Ob},A}, d_{\text{Ob},B})$ trivially.

Phase 5: The money plot, scaling of disagreement with $d_B$

What: the Figure 3 from the paper draft. Scan $d_B \in \{4, 8, 16, 32, 64\}$ at fixed ratios $d_\text{eff}/d_B$ and $d_{C_A,B}/d_B$ , Haar-averaged. Plot $\langle |S_A - S_B| \rangle$ with error bars. Fit to scaling law.

Deliverable: phase5_scaling_plot.py producing Figure 3 and a fitted scaling exponent.

Gate: scaling fit has $R^2 > 0.9$ and a well-defined exponent. If scaling is messy, pivot to “identify which observables scale cleanly.”

Phase 6: State-class dependence

What: repeat Phase 5 with three state classes: Haar-random, thermofield double (TFD), low-complexity circuit states. Produces Figure 4 of the paper.

Deliverable: phase6_state_classes.py.

Gate: at least one state class shows a clean scaling; all three ideally.

Phase 7: Analytic bound

What: derive (or verify via SDP) an explicit upper bound on $\langle |S_A - S_B| \rangle$ . Either analytic using Fannes-Audenaert + measure concentration, or numerical using cvxpy SDP.

Deliverable: phase7_bound.py producing a function $f(d_B, d_{C_A}, d_{C_B})$ that the numerics never exceeds.

Gate: SDP bound matches or exceeds the Haar-averaged numerics from Phase 5.

Phase 8: Connect to EGH framework

What: apply our bound to EGH’s AS2R setup from Phase 3. State what our bound predicts for EGH’s observables. Compare with EGH’s qualitative claims.

Deliverable: phase8_egh_connection.py plus a draft paper section discussing the comparison.

Phase 9: Paper writing

What: actually write the paper. Sections, figures, references. Target: JHEP-length (~30-40 pages plus appendices).

Pivots built into the plan

If Phase 2 fails: HUZ has a deeper subtlety we missed. Pivot to “identifying the subtlety” as a publishable clarification paper.

If Phase 3 fails: we genuinely disagree with EGH. Pivot to “critical examination of EGH’s claimed SWAP results” as a Higginbotham-style critique.

If Phase 3 reveals that EGH use a different entropy convention than ours: adopt their convention going forward, and add a clarifying appendix to the paper showing the translation between conventions. This is a likely outcome and should not be treated as a failure, it’s a consistency check that is expected to produce either a match or an informative mismatch.

If Phase 5 shows no clean scaling: Gap 5 may not have a universal answer. Pivot to Gap 1 (HUZ vs AAIL discriminator), which uses most of the same infrastructure.

If Phase 5 shows unbounded disagreement: the result is Outcome 2 (“wild complementarity”). This is a bigger paper, potentially requiring generalization of the EGH framework.

If at any phase we discover the two-observer HUZ setup has ambiguities (e.g., which observer gets cloned first; whether there’s cross-talk between the two Clone operations): flag the ambiguity explicitly in the paper, test both conventions numerically, and present the robustness (or sensitivity) of results to the choice.

Tempo

AI-speed: phases 1-5 should be days, not weeks. The previous thread’s pace was too slow because of over-planning. Default is: state the phase goal, write the code, check the gate, move on. Documentation and write-up happen in parallel, not sequentially.

Scoop risk: high. If a preprint appears addressing Gap 5 explicitly, pivot to Gap 1 or Gap 2 without hesitation.

Part 6: The paper draft

Working title

“Bounds on observer disagreement for non-isometric holographic codes”

Abstract (draft)

Observer complementarity for evaporating black holes, as formulated by Engelhardt, Gesteau, and Harlow (2507.06046), allows different observers to assign mutually inconsistent descriptions to the black hole interior. We investigate how much observers can disagree within this framework by studying multi-observer extensions of the Akers-Engelhardt-Harlow-Penington-Vardhan non-isometric code (2207.06536). For a code with effective dimension $d_\text{eff}$ and fundamental dimension $d_B$ , with two observers carrying clocks of dimensions $d_{C_A}$ and $d_{C_B}$ , we compute the disagreement between their observer-dependent entropies as a function of the reference state and observer couplings. We find [OUTCOME: one of clean bound / wild disagreement / regime-dependent]. Our results [CONSTRAIN/CONFIRM/REFINE] the observer complementarity framework.

Target figures

Fig. 1: setup schematic, AEHPV code with two observer clocks attached.
Fig. 2: example entropy disagreement for a single toy configuration.
Fig. 3 (money plot): scaling of $\langle |S_A - S_B| \rangle$ with $d_B$ .
Fig. 4: state-class dependence (Haar vs TFD vs low-complexity).
Fig. 5: reproduction of EGH’s AS2R SWAP test + our two-observer extension.

Target venue: JHEP. Target length: ~30-40 pages plus appendices.

Success tiers

Tier 1 (realistic): solid specialist paper, 10-20 citations in 2 years.
Tier 2 (aim): named theorem “Observer Disagreement Bound in AEHPV Codes,” 50+ citations.
Tier 3 (stretch): theorem generalizes to broader framework; reshapes discussion.

Part 7: Operating principles

Carried over from the previous thread’s hard-learned lessons.

Research-mode, not process-mode

The previous thread’s biggest failure mode was writing elaborate specs before doing physics. Shipping specs and plans is a form of procrastination. Do the ugly calculation first, refactor later.

Specific claims before infrastructure

Don’t build classes and modules until you know what the physics looks like. One ugly notebook that produces a number is worth more than three polished modules that don’t.

Reproduce before extend

Phase 3 (reproduce EGH) is non-negotiable before publishing anything built on EGH. Don’t trust extensions of papers you haven’t reproduced.

Test scaling laws, not point values

When checking that infrastructure is correct, fit scaling laws across multiple dimensions. A single-dimension check is a pass/fail tolerance test that can hide systematic bugs. A multi-dimension scaling check reveals bugs through their fingerprint.

Admit failures out loud

When a prediction fails (as the “product state discriminator” did), stop and understand why before moving on. Don’t paper over failures with phrases like “further investigation needed.”

No emotional narrative about the research

The work is the work. Reporting should be: this ran, this worked, this surprised me, this failed. No dramatization. No fabricated excitement.

When you discover you’re wrong, say so in the next message

The previous thread had multiple instances where I said something confident that turned out to be wrong on implementation. The right move each time: say “I was wrong about X, here’s what’s actually true.” Don’t pretend retroactively.

Scope control

Resist adding new gaps or new angles mid-phase. If a new idea comes up, write it down in a “follow-ups” section and continue the current phase. Phase completion discipline is more valuable than following every thread.

Part 8: What the new thread should do first

In order, mechanically:

Read this document end-to-end.
Run pip install --break-system-packages cvxpy statsmodels to install the two required packages.
Read /home/claude/bh_research/bh_lab_backend.py. Understand the primitives. Run python bh_lab_backend.py to confirm self-tests pass.
Skim the three canonical working files: vn_algebra/vnalgebra.py, vn_algebra/crossed_product.py, vn_algebra/trace_on_crossed.py.
Read the paper draft: /mnt/user-data/outputs/paper_draft_v0.md.
Skim abstracts of the six most central papers: AEHPV 2207.06536, CLPW 2206.10780, HUZ 2501.02359, Colorado 2503.09681, EGH 2507.06046, Higginbotham 2512.17993.
Begin Phase 1: port canonical HUZ + Colorado implementations to the backend. Produce phase1_rules_canonical.py.

After Phase 1, ask the user which direction to proceed. Do not assume Phase 2 is next, the user may redirect based on Phase 1 results.

Part 9: File inventory

Everything below should be accessible when starting the new thread. If any file is missing, rebuild from this document + the previous thread’s results.

Canonical code (use freely):

/home/claude/bh_research/bh_lab_backend.py
/home/claude/bh_research/vn_algebra/vnalgebra.py
/home/claude/bh_research/vn_algebra/crossed_product.py
/home/claude/bh_research/vn_algebra/trace_on_crossed.py

Reference (do not reuse numbers from these):

/home/claude/bh_research/step1_viability.py
/home/claude/bh_research/step1_diagnostic.py
/home/claude/bh_research/step2a_huz_rule.py
/home/claude/bh_research/step2b_colorado_rule.py
/home/claude/bh_research/bh_interior.py (AEHPV Page curve sanity check; uses old normalization)

Documents:

/mnt/user-data/outputs/bh_research_state_2026.md, literature survey with seven research gaps
/mnt/user-data/outputs/paper_draft_v0.md, paper outline and commitments
/mnt/user-data/outputs/bh_lab_spec.md, the over-engineered spec (read for context on what not to do, then ignore)
/mnt/user-data/outputs/bh_interior_research_plan.md, earlier phased plan (superseded by this doc)

Part 10: Formalism specification, the actual math we’re doing

This section is long on purpose. The questions “what Hilbert space does the observer live in,” “what map defines observer inclusion,” “which reduced state is the entropy computed from,” and “what is held fixed when comparing observers” all have answers that can be stated precisely. The previous thread was sloppy about some of these distinctions. Getting them right here prevents a whole category of confusion downstream.

10.1 The single-observer setup

AEHPV non-isometric code (base layer, no observer)

$V: \mathcal{H}_\ell \otimes \mathcal{H}_r \to \mathcal{H}_B$ with $d_\ell d_r > d_B$ , so that $V$ is necessarily non-isometric.

In our simplified convention: $V$ = first $d_B$ rows of a Haar-random unitary on $d_\text{eff} = d_\ell d_r$ .

Properties:

$V V^\dagger = I_{d_B}$ exactly (isometric from fund side).
$V^\dagger V$ is a rank- $d_B$ projector with $d_\text{eff} - d_B$ zero modes.
$\mathbb{E}_\text{Haar}[V^\dagger V] = (d_B / d_\text{eff}) I_{d_\text{eff}}$ .

HUZ rule, observer in the bulk

Hilbert space: The observer lives in a factor of the effective (bulk) Hilbert space:

$\mathcal{H}_\text{eff} = \mathcal{H}_\text{Ob} \otimes \mathcal{H}_M$

where $\mathcal{H}_\text{Ob}$ is the observer (dimension $d_\text{Ob}$ ) and $\mathcal{H}_M$ is matter in the observer’s environment (dimension $d_M$ , so $d_\text{eff} = d_\text{Ob} \cdot d_M$ ).

An auxiliary external reference system $\mathcal{H}_R$ with $\dim \mathcal{H}_R = d_\text{Ob}$ is introduced. $\mathcal{H}_R$ is not part of the bulk, nor of the fundamental, it is an auxiliary where the observer’s classical pointer state is cloned.

Map: Observer inclusion is the composition

$V_\text{HUZ}: \mathcal{H}_\text{Ob} \otimes \mathcal{H}_M \otimes |0\rangle_R \to \mathcal{H}_\text{fund} \otimes \mathcal{H}_R$

defined in two stages:

Clone in pointer basis: $\text{Clone}_\text{Ob \to R} : |\text{ob}, m, 0\rangle_R \mapsto |\text{ob}, m, \text{ob}\rangle_R$
Apply non-isometric map tensored with identity on $R$ : $V \otimes I_R$

So $V_\text{HUZ} = (V \otimes I_R) \circ \text{Clone}_\text{Ob \to R}$ .

Effective accessible dimension: $d_\text{fund} \cdot d_\text{Ob}$ .

Colorado rule, observer in the boundary

Hilbert space: The observer lives in a factor of the fundamental (boundary) Hilbert space:

$\mathcal{H}_\text{fund} = \mathcal{H}_\text{Ob} \otimes \mathcal{H}_{\text{fund},M}$

No auxiliary reference needed. The observer is already “out there” in the fundamental description.

Map: Observer inclusion removes the part of the non-isometric map acting on the observer:

$V_\text{Colorado} = I_\text{Ob} \otimes V_M$

where $V_M: \mathcal{H}_M \to \mathcal{H}_{\text{fund},M}$ is a non-isometric map acting only on matter.

Effective accessible dimension: $d_\text{Ob} \cdot d_{\text{fund},M}$ .

10.2 The TWO-observer setup (the actual Gap 5 scenario)

This is the setup the paper is about, two people falling into a black hole together. The step1 scripts used a weaker version (two clock states within a single observer factor); the paper version uses two separate observer factors.

Hilbert space: Two observer factors coexist in the bulk:

$\mathcal{H}_\text{eff} = \mathcal{H}_{\text{Ob},A} \otimes \mathcal{H}_{\text{Ob},B} \otimes \mathcal{H}_M$

with dimensions $d_{\text{Ob},A}, d_{\text{Ob},B}, d_M$ , so $d_\text{eff} = d_{\text{Ob},A} \cdot d_{\text{Ob},B} \cdot d_M$ .

Two auxiliary reference systems $\mathcal{H}_{R_A}, \mathcal{H}_{R_B}$ are introduced, with $\dim \mathcal{H}_{R_X} = d_{\text{Ob},X}$ .

Map (HUZ for both observers): Each observer is cloned independently to their own reference:

$V_\text{HUZ,AB} = (V \otimes I_{R_A} \otimes I_{R_B}) \circ (\text{Clone}_{A \to R_A} \otimes \text{Clone}_{B \to R_B})$

Note the two Clone operations commute (they act on disjoint factors), and each is the CNOT-in-pointer-basis operation we already coded.

Mixed HUZ-Colorado version (for Gap 1 experiments): observer $A$ under HUZ, observer $B$ under Colorado. Obtainable by adjusting the map per-observer. Not needed for the first paper; mentioned for completeness.

10.3 Observer-dependent entropy, precise definitions

Single-observer HUZ: Start with a reference state $|\psi\rangle \in \mathcal{H}_\text{eff} = \mathcal{H}_\text{Ob} \otimes \mathcal{H}_M$ . Apply $V_\text{HUZ}$ :

$|\Psi\rangle_\text{HUZ} = V_\text{HUZ}(|\psi\rangle \otimes |0\rangle_R) \in \mathcal{H}_\text{fund} \otimes \mathcal{H}_R$

Normalize by post-selection: $|\Psi\rangle \to |\Psi\rangle / \|\Psi\|$ . The observer-accessible reduced state is

$\rho_R^\text{HUZ}(\psi) = \text{Tr}_\text{fund}\left( \frac{|\Psi\rangle \langle \Psi|}{\langle \Psi | \Psi \rangle} \right)$

and the observer-dependent entropy is

$S^\text{HUZ}(\psi) = -\text{Tr}(\rho_R^\text{HUZ} \log \rho_R^\text{HUZ})$

which is a standard von Neumann entropy on a $d_\text{Ob}$ -dimensional Hilbert space.

Single-observer Colorado: Apply $V_\text{Colorado}$ to $|\psi\rangle \in \mathcal{H}_\text{eff}$ :

$|\Psi\rangle_\text{Colorado} = V_\text{Colorado}(|\psi\rangle) \in \mathcal{H}_\text{Ob} \otimes \mathcal{H}_{\text{fund},M}$

Normalize. The observer-accessible reduced state is

$\rho_\text{Ob}^\text{Colorado}(\psi) = \text{Tr}_{\text{fund},M}\left( \frac{|\Psi\rangle \langle \Psi|}{\langle \Psi | \Psi \rangle} \right)$

and the entropy is

$S^\text{Colorado}(\psi) = -\text{Tr}(\rho_\text{Ob}^\text{Colorado} \log \rho_\text{Ob}^\text{Colorado})$

Two-observer HUZ (Gap 5 scenario): Start with $|\psi\rangle \in \mathcal{H}_{\text{Ob},A} \otimes \mathcal{H}_{\text{Ob},B} \otimes \mathcal{H}_M$ . Apply $V_\text{HUZ,AB}$ :

$|\Psi\rangle = V_\text{HUZ,AB}(|\psi\rangle \otimes |0\rangle_{R_A} \otimes |0\rangle_{R_B}) \in \mathcal{H}_\text{fund} \otimes \mathcal{H}_{R_A} \otimes \mathcal{H}_{R_B}$

Normalize. Observer $A$ ‘s entropy traces out everything except $R_A$ :

$\rho_{R_A}(\psi) = \text{Tr}_{\text{fund}, R_B}\left( \frac{|\Psi\rangle \langle \Psi|}{\langle \Psi | \Psi \rangle} \right), \qquad S_A(\psi) = -\text{Tr}(\rho_{R_A} \log \rho_{R_A})$

Similarly for observer $B$ :

$\rho_{R_B}(\psi) = \text{Tr}_{\text{fund}, R_A}\left( \frac{|\Psi\rangle \langle \Psi|}{\langle \Psi | \Psi \rangle} \right), \qquad S_B(\psi) = -\text{Tr}(\rho_{R_B} \log \rho_{R_B})$

The disagreement is

$\Delta S(\psi) \equiv |S_A(\psi) - S_B(\psi)|$

10.4 What is held fixed when comparing $S_A$ and $S_B$

For the Gap 5 money plot (Figure 3 of the paper), the comparison is done as follows:

Held fixed across the $A$ -vs- $B$ comparison:

The reference state $|\psi\rangle \in \mathcal{H}_{\text{Ob},A} \otimes \mathcal{H}_{\text{Ob},B} \otimes \mathcal{H}_M$
The non-isometric map $V: \mathcal{H}_\text{eff} \to \mathcal{H}_\text{fund}$ (same $V$ for both observers)
The dimensions $d_{\text{Ob},A}, d_{\text{Ob},B}, d_M, d_\text{fund}$
The cloning prescription (both observers cloned under HUZ in their respective pointer bases)

What differs between $A$ and $B$ :

Which reference system is being traced out last (i.e., whether we’re asking for $A$ ‘s view or $B$ ‘s view)
The dimensions $d_{\text{Ob},A}$ vs $d_{\text{Ob},B}$ if we choose observers of unequal resolution

In other words: same physics, same map, same state, only the observer-labeling differs. This is the setup in which “observer disagreement” has its sharpest meaning: two observers looking at the same thing, disagreeing about what they see.

What varies across Haar averaging:

The unitary $U$ that defines the non-isometric map $V$
Possibly the reference state $|\psi\rangle$ if we’re averaging over a state ensemble

10.5 Formalism caveats, what we are and are not doing

We are computing: standard von Neumann entropies of reduced density matrices obtained by partial trace over tensor factors. Pure quantum information, finite-dimensional, no algebras.

We are not computing: relative entropies in the CLPW algebraic sense, crossed-product entropies, or Type-II entropies of infinite-dim factors. The vn_algebra/ directory has infrastructure for finite-dim crossed products and modular flow, but this is not yet wired into the HUZ/Colorado computations.

Open question, does our entropy match HUZ/EGH conventions? The von Neumann entropy of the reduced state on the cloned reference is a plausible definition of observer-dependent entropy, and matches the physical intuition that the observer experiences their cloned copy. But the HUZ paper (2501.02359) primarily discusses effective Hilbert space dimensions and inner products of states, not explicit von Neumann entropy formulas. EGH (2507.06046) uses von Neumann entropies that appear to match our definition, but we have not yet verified this against their specific calculations.

This is a Phase 2 / Phase 3 deliverable: part of reproducing EGH’s SWAP test result is confirming that their entropy definitions agree with ours on concrete examples. If they don’t, we need to either adopt their convention or explain why ours is defensible.

Connection to the crossed-product / algebraic framework: EGH argue that their observer-inclusion rule is consistent with the CLPW crossed-product picture, but this is established at the level of axiomatic properties (Type II-ness, well-defined entropy), not by explicit formula-matching in our setup. For the paper, we work in the path-integral / non-isometric-code formalism exclusively. Explicit connection to the crossed-product formalism is a separate analytical exercise, not required for Phase 5 results.

10.6 Haar averaging

For ensemble-level statements, average over Haar-random $V$ (and Haar-random $V_M$ when using Colorado). Sample size: 30-200 depending on dimension. Use bh_lab_backend.SeededRNG with fixed seeds for reproducibility.

Expected scaling of random fluctuations: Haar averages concentrate like $1/\sqrt{N \cdot d_\text{fund}}$ for $N$ samples. For $d_\text{fund} = 32$ and $N = 100$ , relative precision should be around $2\%$ . Choose sample sizes accordingly.

10.7 Summary table, the precise quantities computed

For clarity on what the Gap 5 paper is about:

Quantity	Definition	Where computed
$V$	AEHPV non-isometric code, $d_\text{eff} \to d_\text{fund}$	`SeededRNG.aehpv_map`
$V_\text{HUZ,AB}$	$(V \otimes I_{R_A} \otimes I_{R_B}) \circ (\text{Clone}_A \otimes \text{Clone}_B)$	Phase 4
$\rho_{R_A}, \rho_{R_B}$	Partial trace of normalized $V_\text{HUZ,AB}\\|\psi \otimes 0_R\rangle$	Phase 4
$S_A, S_B$	$-\text{Tr}(\rho \log \rho)$ on $\rho_{R_A}, \rho_{R_B}$	Phase 4
$\Delta S(\psi)$	$\\|S_A - S_B\\|$	Phase 4-5
$\langle \Delta S \rangle_\text{Haar}$	Haar average of $\Delta S$ over random $V$ and/or $\psi$	Phase 5
$f(d_B, d_{C_A}, d_{C_B})$	Best fit to $\langle \Delta S \rangle$ as a function of dimensions	Phase 5
SDP bound	$\sup_\psi \Delta S$ subject to physicality constraints via cvxpy	Phase 7

Part 11: What we’re NOT doing

To prevent scope creep in the new thread:

Not doing full numerical relativity. No Einstein Toolkit, no SpECTRE, no metric tensor calculations. Our “black hole” is a finite-dim quantum information object.
Not doing astrophysics. No GW signals, no event rates, no observational predictions.
Not doing string theory derivations. We use results (AdS/CFT framework) but don’t derive them.
Not doing particle physics. No colliders, no Standard Model.
Not doing cosmology beyond the closed-universe problem. No CMB calculations, no inflation, no dark energy.
Not modeling specific black holes (M87, Sgr A, etc.).** We work with toy-model finite-dim systems.
Not doing experimental predictions. Our results are mathematical statements about idealized models.
Not engaging interpretive philosophy of quantum mechanics. The observer is a physical system with a Hilbert space, not a conscious agent.

Part 12: Calibration

Honest assessment for the new thread’s operator.

This is real research at the active frontier of quantum gravity.
It is being done by an AI + human collaboration, with the AI as primary computational engine and the human as PI / strategic direction.
The probability of publishing a Tier-1 paper is around 30%; Tier 2 around 15%; Tier 3 below 5%.
The probability of getting scooped during the research is around 20-30% given the pace of the field.
The probability that some phase reveals the planned approach is fundamentally flawed and requires pivot is around 40%.
These probabilities are estimates, not promises. Calibrate your own confidence as results come in.

The research is worth doing even at these odds because:

The computational infrastructure built here is reusable across many black hole interior questions.
Even a failed attempt at Tier 1 often yields publishable technical notes.
Understanding the landscape deeply is valuable regardless of publication.
The human’s interest and direction carry real weight.

One final note

The previous thread caught itself in several planning traps: over-designed specs, hypothetical schedules, premature infrastructure. The lesson carried into this thread: write code, produce numbers, check against published work, report honestly. Everything else is subordinate to that.

Go.

Research Memo: Phases 1-5

Phases 1-5 complete

Phase 1,5 Research Memo

Computational verification program targeting Gap 5 of the observer-complementarity
landscape (Engelhardt-Gesteau-Harlow 2507.06046, Akers-Engelhardt-Harlow-Penington-Vardhan
2207.06536, Harlow-Usatyuk-Zhao 2501.02359).

Compiled from Phases 1 through 5 of the computational run. All numerical
results are backed by CSVs in the same output directory; each phase has its own
driver script (phase1_rules_canonical.py, phase2_huz_verification.py,
phase2_analytic_check.py, phase3_egh_reproduction.py, phase4_two_observers.py,
phase5_money_plot.py with helpers phase5_run_point.py, phase5_boost_point.py,
phase5_assemble.py).

0. Executive Summary

Five phase gates have been cleared. The Gap 5 paper now has a specific
numerical claim:

Phase 1 ported canonical HUZ and Colorado observer-inclusion rules to the
unified backend, verifying integer-valued rank predictions across 56 rows and
pinning an analytic checkpoint ( $S_{\mathrm{Colorado}} = \log d_{\mathrm{fund},M}$
exactly on the maximally-correlated input state).
Phase 2 verified HUZ 2501.02359’s claim that observer-description errors
decay as $e^{-S_{\mathrm{Ob}}}$ at the inner-product level, finding
$\alpha = -1.00 \pm 0.03$ in three independent configurations. A polish pass
derived the full analytic form for both the overlap error $E_{\mathrm{ovl}}$
and the state-level trace-distance error $E_{\mathrm{td}}$ , verifying the
parameter-free Rayleigh-modulus ( $\sqrt\pi/2$ ) and Wigner-trace-norm
( $4/3\pi$ ) prefactors to within 4%.
Phase 3 reproduced the Engelhardt-Gesteau-Harlow SWAP-test result for the
Antonini-Sasieta-Swingle-Rath configuration, demonstrating the observer-
complementarity gap in closed form: the AdS boundary observer saturates
$\mathrm{Page}(D_L, D_R)$ (the “no baby universe” answer) while the closed-
universe observer saturates $\mathrm{Page}(D_L, D_R\cdot D_C)$ (the “with baby”
answer). Pooled $\chi^2$ within 2σ for all three observables across eleven
configurations.
Phase 4 implemented the genuine two-observer HUZ setup (the Gap 5 physics),
passed all three consistency gates, and produced a first-look scaling
$\langle|S_A - S_B|\rangle \sim d_B^{-1.29}$ ( $R^2 = 0.976$ ) with a 95% CI
of $[-1.66, -0.92]$ . The disagreement decays with observer size.
Phase 5 built efficient two-observer machinery (avoids the $O(270\,\mathrm{GB})$
full joint density matrix that would otherwise be required at $d_B = 16$ ),
scanned $d_B \in \{4,6,8,10,12,14,16\}$ with $N \ge 200$ per point, and
pinned the scaling exponent to $\alpha = -1.330$ with 95% CI
$[-1.446, -1.214]$ . The naïve HUZ-inherited $\alpha = -1$ is rejected at
$7.3\sigma$ , and the ansatz-prefactor is independently stable at $\alpha = -1.29$
(drift $-4\%$ across the scan) but drifts $-36\%$ for $\alpha = -1$ . The
exponent is close to the rational value $-4/3$ , which is consistent with
all 7 data points and suggests a Phase 7 analytic target.

1. Setup and Notation

1.1 Hilbert spaces

The bulk effective theory and the fundamental (boundary, or “outside”) theory
are two finite-dimensional Hilbert spaces connected by a linear map:

V: \mathcal{H}_{\mathrm{eff}} \to \mathcal{H}_{\mathrm{fund}}, \qquad d_{\mathrm{eff}} = \dim \mathcal{H}_{\mathrm{eff}}, \qquad d_{\mathrm{fund}} = \dim \mathcal{H}_{\mathrm{fund}}, \qquad d_{\mathrm{eff}} \ge d_{\mathrm{fund}}.

When $d_{\mathrm{eff}} > d_{\mathrm{fund}}$ , $V$ is non-isometric: there are
bulk null states in the kernel of $V$ . We take $V$ to be the first
$d_{\mathrm{fund}}$ rows of a Haar-random unitary on $U(d_{\mathrm{eff}})$ , which
satisfies $V V^\dagger = \mathbb{1}_{\mathrm{fund}}$ exactly, while
$V^\dagger V$ is a random rank- $d_{\mathrm{fund}}$ projector on
$\mathcal{H}_{\mathrm{eff}}$ .

The key dimensionless parameter quantifying non-isometry strength is

\rho \equiv \frac{d_{\mathrm{fund}}}{d_{\mathrm{eff}}} \in (0, 1].

1.2 Single-observer HUZ rule

The bulk effective space factorizes as
$\mathcal{H}_{\mathrm{eff}} = \mathcal{H}_{\mathrm{Ob}} \otimes \mathcal{H}_M$ .
HUZ adjoin an external reference register $\mathcal{H}_R \cong \mathcal{H}_{\mathrm{Ob}}$
and clone the observer in pointer basis:

\mathrm{Clone}_{\mathrm{Ob}\to R}: |\mathrm{ob}, m\rangle \otimes |0\rangle_R \;\mapsto\; |\mathrm{ob}, m\rangle \otimes |\mathrm{ob}\rangle_R.

Then $V_{\mathrm{HUZ}} = (V\otimes I_R) \circ \mathrm{Clone}_{\mathrm{Ob}\to R}$
applied to a bulk state $|\psi\rangle \in \mathcal{H}_{\mathrm{eff}}$ gives an
unnormalized state on $\mathcal{H}_{\mathrm{fund}} \otimes \mathcal{H}_R$ , which
we normalize by post-selection:

|\Psi\rangle \;=\; \frac{V_{\mathrm{HUZ}}(|\psi\rangle\otimes|0\rangle_R)} {\|V_{\mathrm{HUZ}}(|\psi\rangle\otimes|0\rangle_R)\|}.

The observer-accessible reduced state and its von Neumann entropy:

\rho_R^{\mathrm{HUZ}}(\psi) = \mathrm{Tr}_{\mathrm{fund}}|\Psi\rangle\langle\Psi|, \qquad S^{\mathrm{HUZ}}(\psi) = -\mathrm{Tr}\bigl(\rho_R^{\mathrm{HUZ}} \log \rho_R^{\mathrm{HUZ}}\bigr).

1.3 Colorado rule

The observer lives in the fundamental (boundary) Hilbert space:
$\mathcal{H}_{\mathrm{fund}} = \mathcal{H}_{\mathrm{Ob}} \otimes \mathcal{H}_{\mathrm{fund},M}$ ,
and $V = I_{\mathrm{Ob}} \otimes V_M$ where $V_M: \mathcal{H}_M \to \mathcal{H}_{\mathrm{fund},M}$
acts on matter only. No auxiliary reference is needed.

1.4 Two-observer HUZ rule (the Gap 5 scenario)

With two independent observer factors,
$\mathcal{H}_{\mathrm{eff}} = \mathcal{H}_{\mathrm{Ob},A}\otimes\mathcal{H}_{\mathrm{Ob},B}\otimes\mathcal{H}_M$ ,
and references $R_A, R_B$ of the respective dimensions:

V_{\mathrm{HUZ},AB} = \bigl(V \otimes I_{R_A}\otimes I_{R_B}\bigr) \circ \bigl(\mathrm{Clone}_{A\to R_A}\otimes \mathrm{Clone}_{B\to R_B}\bigr).

The observer-dependent entropies are

S_A = -\mathrm{Tr}\bigl(\rho_{R_A}\log\rho_{R_A}\bigr), \qquad S_B = -\mathrm{Tr}\bigl(\rho_{R_B}\log\rho_{R_B}\bigr), \qquad \Delta S = |S_A - S_B|.

The question at the core of the research program is: how does $\langle\Delta S\rangle$
scale with observer and fundamental dimensions, and what does that say about
the structure of multi-observer observer complementarity?

2. Phase 1, Canonical HUZ and Colorado Tables

2.1 Objective

Rebuild both observer-inclusion rules on the unified backend using the
corrected AEHPV normalization, and verify that predictions that are manifestly
normalization-independent (integer ranks, analytic-limit equalities) hold
across a parameter sweep.

2.2 Rank predictions

For a Haar-random $V$ and a generic $\psi$ , the Schmidt decomposition across
the observer-register cut gives:

\mathrm{rank}(\rho_R^{\mathrm{HUZ}}) = \min(d_{\mathrm{Ob}}, d_{\mathrm{fund}}), \qquad \mathrm{rank}(\rho_{\mathrm{Ob}}^{\mathrm{Colorado}}) = \min(d_{\mathrm{Ob}}, d_{\mathrm{fund},M}).

2.3 Analytic checkpoint

For the specific bulk input state
$|\psi_{\mathrm{sup}}\rangle = \tfrac{1}{\sqrt{d}}\sum_i |i\rangle_{\mathrm{Ob}}|i\rangle_M$
(maximally correlated across Ob,M), a direct computation gives

\rho_{\mathrm{Ob}}^{\mathrm{Colorado}} = \frac{(V_M^\dagger V_M)^T}{d_{\mathrm{fund},M}},

independent of the specific Haar draw of $V_M$ . Since $V_M^\dagger V_M$ is a
rank- $d_{\mathrm{fund},M}$ projector, $\rho_{\mathrm{Ob}}^{\mathrm{Colorado}}$
has exactly $d_{\mathrm{fund},M}$ eigenvalues equal to $1/d_{\mathrm{fund},M}$
and $d - d_{\mathrm{fund},M}$ zero eigenvalues. Therefore

\boxed{S^{\mathrm{Colorado}}(|\psi_{\mathrm{sup}}\rangle) = \log d_{\mathrm{fund},M} \quad\text{exactly, for every Haar draw}.}

2.4 Results

Across 32 HUZ rows ( $d_{\mathrm{Ob}}\in\{2,3,4\}$ , $d_M\in\{2,3,4\}$ ,
$d_{\mathrm{fund}}\in\{2,3,5,8\}$ ) and 24 Colorado rows
( $d_{\mathrm{Ob}}\in\{2,3,4\}$ , $d_M\in\{3,4,5\}$ , $d_{\mathrm{fund},M}\in\{2,3,4\}$ ):

Rank predictions match exactly in every row: 0 mismatches out of 56.
Reproducibility is bit-identical: same seed gives $\langle S\rangle$
identical to 16 decimal places.
Analytic checkpoint verified: $S^{\mathrm{Colorado}}|_{\psi_{\mathrm{sup}}} = 0.693147\ldots = \log 2$ to machine precision for $d_{\mathrm{fund},M}=2$ .
Product-state check: $S^{\mathrm{HUZ}} = S^{\mathrm{Colorado}} = 0$ on
$|\mathrm{ob}\rangle\otimes|\psi\rangle_M$ to machine precision under both rules.
Superposed-state disagreement (Result 4 of the previous thread): on
$|\psi_{\mathrm{sup}}\rangle$ with $d=3$ and $d_{\mathrm{fund},M}=2$ , HUZ and
Colorado give $S^{\mathrm{HUZ}} = 0.857$ vs $S^{\mathrm{Colorado}} = 0.693$ ,
a disagreement of 0.164 nats from rule-dependence alone.

All 8 gate conditions pass. Canonical entropy tables saved to
phase1_huz_table.csv and phase1_colorado_table.csv; these supersede
the previous thread’s (incorrectly normalized) step2a/step2b numbers.

3. Phase 2, HUZ Error-Scaling Verification

3.1 Objective

Verify the HUZ 2501.02359 claim that errors in the observer-dependent
description are exponentially small in the observer entropy:
$\text{error} \sim e^{-S_{\mathrm{Ob}}} = 1/d_{\mathrm{Ob}}$ .

3.2 Three error metrics

We defined three metrics capturing different aspects of the HUZ approximation:

Overlap-level error (the direct HUZ claim at the inner-product level):

E_{\mathrm{ovl}}(\psi_1,\psi_2;V) = \left|\frac{\langle\Psi_1|\Psi_2\rangle} {\|\Psi_1\|\,\|\Psi_2\|} - \langle\psi_1|\psi_2\rangle_{\mathrm{bulk}}\right|

State-level trace distance (deviation from the cloning-only prediction):

E_{\mathrm{td}}(\psi;V) = \tfrac{1}{2}\bigl\|\rho_R^{\mathrm{HUZ}} - \mathrm{diag}(p_{\mathrm{ob}})\bigr\|_1, \qquad p_{\mathrm{ob}} = \sum_m |\psi_{\mathrm{ob},m}|^2

where $\mathrm{diag}(p_{\mathrm{ob}})$ is what $\rho_R$ would be if $V$ were
unitary (the cloning-only answer).

Entropy-level error: $E_S = |S(\rho_R^{\mathrm{HUZ}}) - H(p_{\mathrm{ob}})|$ .

3.3 Scaling observations

Three configurations held $d_M = 4$ fixed and scanned $d_{\mathrm{Ob}}$ :

Config	$\rho = d_{\mathrm{fund}}/d_{\mathrm{eff}}$	$\alpha_{E_{\mathrm{ovl}}}$	$\alpha_{E_{\mathrm{td}}}$	$\alpha_{E_S}$
A	1/2	$-1.025$	$+0.036$	$+0.059$
B	3/4	$-0.979$	$-0.001$	$-0.059$
C	1/4	$-0.986$	$+0.030$	$+0.112$

All three configurations:

$E_{\mathrm{ovl}}$ scales cleanly as $d_{\mathrm{Ob}}^{-1}$ with $R^2 > 0.99$ .
This directly verifies HUZ’s $e^{-S_{\mathrm{Ob}}}$ prediction at the inner-
product level.
$E_{\mathrm{td}}$ and $E_S$ are flat in $d_{\mathrm{Ob}}$ , saturating at
an $O(1)$ value set by the non-isometry ratio.

The flatness of the state-level metrics is not a contradiction with HUZ, but
a clarification: the non-isometric distortion of an individual $\rho_R$ is
state-correlated, cancelling in overlaps where it appears multiplicatively
on top and bottom, but surviving in individual reduced density matrices.

An additional scan at fixed $(d_{\mathrm{Ob}}=4, \rho=1/2)$ showed
$E_{\mathrm{td}} \propto d_M^{-1/2}$ with $\alpha_M = -0.506$ , $R^2 = 0.998$ ,
and $E_{\mathrm{td}}\cdot\sqrt{d_M}$ conserved to 3,4 significant figures
across $d_M\in\{2,3,4,5,6,8\}$ .

3.4 Analytic derivation (polish pass)

The polish pass derives the full functional form of both error metrics from
the second-order Weingarten formula for $V^\dagger V$ , and verifies the
parameter-free prefactors against the numerical data.

3.4.1 Weingarten second moments

For $V$ the first $d_{\mathrm{fund}}$ rows of Haar $U \in U(n)$ with
$n = d_{\mathrm{eff}}$ , the matrix $V^\dagger V$ has:

\mathbb{E}[(V^\dagger V)_{ij}] = \rho\,\delta_{ij},

\mathrm{Var}[(V^\dagger V)_{ii}] = \frac{\rho(1-\rho)}{n+1}, \qquad \mathrm{Var}[(V^\dagger V)_{ij}]_{i\neq j} = \frac{\rho(1-\rho)\,n}{(n-1)(n+1)}.

Both go as $\rho(1-\rho)/n$ to leading order in $1/n$ . Write $V^\dagger V = \rho I + \delta$
where $\delta$ has zero mean and off-diagonal element variance $\sigma^2 = \rho(1-\rho)/n$ .

3.4.2 Derivation of $E_{\mathrm{ovl}}$

The HUZ post-map overlap, for bulk states $|\psi_1\rangle, |\psi_2\rangle$ :

\langle\Psi_1|\Psi_2\rangle = \sum_{\mathrm{ob},m,m'} \psi_{1,\mathrm{ob},m}^*\,\psi_{2,\mathrm{ob},m'}\,(V^\dagger V)_{(\mathrm{ob},m),(\mathrm{ob},m')}

(the cloning constraint forces equal $\mathrm{ob}$ on both sides). Decomposing
$V^\dagger V = \rho I + \delta$ :

\langle\Psi_1|\Psi_2\rangle = \rho\langle\psi_1|\psi_2\rangle_{\mathrm{bulk}} + \mathcal{N}

with noise

\mathcal{N} = \sum_{\mathrm{ob},m,m'} \psi_{1,\mathrm{ob},m}^*\,\psi_{2,\mathrm{ob},m'}\,\delta_{(\mathrm{ob},m),(\mathrm{ob},m')}.

Conditional variance, using $\mathbb{E}|\delta_{ij}|^2 = \rho(1-\rho)/n$ :

\mathrm{Var}(\mathcal{N}\,|\,\psi_1,\psi_2) = \frac{\rho(1-\rho)}{n} \sum_{\mathrm{ob}} p_{1,\mathrm{ob}}\,p_{2,\mathrm{ob}}, \qquad p_{k,\mathrm{ob}} = \sum_m |\psi_{k,\mathrm{ob},m}|^2.

Averaging over independent Haar $\psi_1, \psi_2$ with $\mathbb{E}[p_{k,\mathrm{ob}}] = 1/d_{\mathrm{Ob}}$ :

\mathbb{E}\bigl[\sum_{\mathrm{ob}} p_{1,\mathrm{ob}} p_{2,\mathrm{ob}}\bigr] = \frac{1}{d_{\mathrm{Ob}}},

so

\mathrm{Var}(\mathcal{N}) = \frac{\rho(1-\rho)}{n\cdot d_{\mathrm{Ob}}} = \frac{\rho(1-\rho)}{d_{\mathrm{Ob}}^2\,d_M}.

The norm $\|\Psi_k\|^2 \approx \rho$ to leading order, so
$\|\Psi_1\|\|\Psi_2\| \approx \rho$ . The normalized overlap error is

\mathcal{E} = \frac{\langle\Psi_1|\Psi_2\rangle}{\|\Psi_1\|\|\Psi_2\|} - \langle\psi_1|\psi_2\rangle_{\mathrm{bulk}} \approx \frac{\mathcal{N}}{\rho}, \qquad \mathrm{Var}(\mathcal{E}) \approx \frac{1-\rho}{\rho\,d_{\mathrm{Ob}}^2\,d_M}.

$\mathcal{E}$ is complex Gaussian by CLT over $d_{\mathrm{Ob}}\cdot d_M^2 \gg 1$ terms.
Its modulus follows a Rayleigh distribution with
$\mathbb{E}[|\mathcal{E}|] = (\sqrt\pi/2)\sqrt{\mathrm{Var}(\mathcal{E})}$ :

\boxed{\; \mathbb{E}[E_{\mathrm{ovl}}] \;\approx\; \frac{\sqrt{\pi}}{2}\cdot\sqrt{\frac{1-\rho}{\rho}}\cdot\frac{1}{d_{\mathrm{Ob}}\sqrt{d_M}} \;}

with the parameter-free prefactor $\sqrt{\pi}/2 \approx 0.8862$ .

3.4.3 Derivation of $E_{\mathrm{td}}$

The HUZ-computed observer state has matrix elements
$(\rho_R)_{\mathrm{ob},\mathrm{ob}'} = \|\Psi\|^{-2}\langle v_{\mathrm{ob}'}|v_{\mathrm{ob}}\rangle$ ,
where $|v_{\mathrm{ob}}\rangle = \sum_m \psi_{\mathrm{ob},m} V|\mathrm{ob},m\rangle$ .

The perturbation $M = \rho_R - \mathrm{diag}(p)$ has off-diagonal entries with
variance (Haar-averaged)

\sigma_M^2 \approx \frac{1-\rho}{\rho\,d_{\mathrm{Ob}}^3\,d_M}.

Diagonal corrections are smaller by a factor $1/\sqrt{d_{\mathrm{Ob}}}$ and
subleading for $d_{\mathrm{Ob}} \gtrsim 4$ . Treating $M$ as an approximate
GUE-Hermitian perturbation of dim $d_{\mathrm{Ob}}$ with off-diagonal variance
$\sigma_M^2$ , the Wigner semicircle gives

\mathbb{E}[|\lambda|] = \frac{8\sigma_M\sqrt{d_{\mathrm{Ob}}}}{3\pi}, \qquad \|M\|_1 = d_{\mathrm{Ob}}\cdot\mathbb{E}[|\lambda|] = \frac{8\sigma_M d_{\mathrm{Ob}}^{3/2}}{3\pi}.

Substituting:

\boxed{\; \mathbb{E}[E_{\mathrm{td}}] \;=\; \tfrac{1}{2}\|M\|_1 \;\approx\; \frac{4}{3\pi}\cdot\sqrt{\frac{1-\rho}{\rho}}\cdot\frac{1}{\sqrt{d_M}} \;}

with the parameter-free prefactor $4/(3\pi) \approx 0.4244$ . Note the
absence of $d_{\mathrm{Ob}}$ at leading order, state-level deviations are
set by Haar fluctuations of $V^\dagger V$ at matter-sector scale $d_M$ , not
observer scale.

3.4.4 Quantitative verification

Post-processing 29 rows of Phase 2 data (configs A/B/C plus the $d_M$ scan):

Quantity	Predicted prefactor	Empirical fit	Agreement
$c_{\mathrm{ovl}}$	$\sqrt\pi/2 \approx 0.8862$	$0.8552$	$-3.5\%$
$c_{\mathrm{td}}$	$4/(3\pi) \approx 0.4244$	$0.4073$	$-4.0\%$

Mean relative error across all 29 rows: $5.2\%$ for $E_{\mathrm{ovl}}$ ,
$4.0\%$ for $E_{\mathrm{td}}$ . Both prefactors are systematically low by
$\approx 4\%$ , this is the $O(1/d_{\mathrm{eff}})$ finite-size correction
from the exact Weingarten variances $\rho(1-\rho)/(n+1)$ versus the asymptotic
$\rho(1-\rho)/n$ used above; capturing it exactly would require next-order
Weingarten and is deferred to Phase 7’s analytic bound work.

3.5 Conclusion

HUZ’s inner-product error has the explicit form

E_{\mathrm{ovl}} \;\approx\; \frac{\sqrt\pi}{2}\sqrt{\frac{1-\rho}{\rho}} \cdot\frac{1}{d_{\mathrm{Ob}}\sqrt{d_M}}.

State-level deviations follow

E_{\mathrm{td}} \;\approx\; \frac{4}{3\pi}\sqrt{\frac{1-\rho}{\rho}}\cdot\frac{1}{\sqrt{d_M}},

d_Ob-independent at leading order. Both prefactors are verified to 4%. The
full functional form, $\sqrt{(1-\rho)/\rho}$ in non-isometry, $1/d_{\mathrm{Ob}}$
or $O(1)$ in observer, $1/\sqrt{d_M}$ in matter, is a substantially sharper
statement than the bare exponent claim.

4. Phase 3, EGH SWAP-Test Reproduction in AS2R

4.1 The setup

The Antonini-Sasieta-Swingle-Rath configuration: a closed universe $C$
entangled with a pair of AdS universes $L, R$ . In our finite-dim encoding:

\mathcal{H}_{\mathrm{eff}} = \mathcal{H}_L\otimes\mathcal{H}_R\otimes\mathcal{H}_C, \qquad \mathcal{H}_{\mathrm{fund}} = \mathcal{H}_L\otimes\mathcal{H}_R,

with non-isometric ratio $\rho = 1/D_C$ . The AS2R bulk state is drawn Haar-
randomly on $\mathcal{H}_{\mathrm{eff}}$ ; this is faithful to the AS2R
construction of Antonini-Sasieta-Swingle (2307.14416, “Cosmology from random
entanglement”).

4.2 Three SWAP-test quantities

For a pure state $|\psi\rangle$ the SWAP-test expectation on subregion $A$
equals the purity $\mathrm{Tr}(\rho_A^2)$ . EGH’s claim (their equation
connecting the boundary SWAP test to $\exp(-S_2((V\psi V^\dagger)_A))$ ) is
about exactly this purity computed on different reductions.

$S_\beta$ (closed-universe observer): bulk purity on $L$ ,
$\mathrm{Tr}\bigl[(\rho_{\mathrm{bulk}})_L^2\bigr]$ .
$S_\alpha$ (AdS boundary observer): boundary purity on $L$ ,
$\mathrm{Tr}\bigl[(V\psi_1 V^\dagger/\|V\psi_1\|^2)_L^{\,2}\bigr]$ .
$S_{\mathrm{no\text{-}baby}}$ (control): the same on a Haar state
$\psi_2$ drawn directly on $\mathcal{H}_{\mathrm{fund}}$ .

4.3 Page-formula predictions

The key geometric fact: for any fixed $|\psi\rangle$ and Haar $V$ , the
normalized boundary state $V|\psi\rangle/\|V|\psi\rangle\|$ is Haar-distributed
on $\mathcal{H}_{\mathrm{fund}}$ . This follows from rotational invariance of
Haar $U$ on $U(d_{\mathrm{eff}})$ : if $w = U|\psi\rangle$ is Haar on
$S^{2d_{\mathrm{eff}}-1}$ and $w_1$ is its first $d_{\mathrm{fund}}$
components, then $w_1/\|w_1\|$ is Haar on $S^{2d_{\mathrm{fund}}-1}$ for any
fixed $\psi$ .

Combined with Page’s formula
$\langle\mathrm{Tr}(\rho_A^2)\rangle_{\mathrm{Haar}} = (d_A + d_B)/(d_A d_B + 1)$ :

\begin{aligned} \langle S_\beta\rangle &\to \frac{D_L + D_R D_C}{D_L D_R D_C + 1} \quad\text{(with-baby, via bulk Page on $\mathcal{H}_L\otimes(\mathcal{H}_R\otimes\mathcal{H}_C)$)},\\[4pt] \langle S_\alpha\rangle &\to \frac{D_L + D_R}{D_L D_R + 1} \quad\text{(no-baby, via boundary Page on $\mathcal{H}_L\otimes\mathcal{H}_R$)}. \end{aligned}

The observer-complementarity gap is

\Delta_{\mathrm{EGH}} = \langle S_\alpha\rangle - \langle S_\beta\rangle \approx \frac{1}{D_L} + \frac{1}{D_R} - \frac{1}{D_L} - \frac{1}{D_R D_C} = \frac{1}{D_R} - \frac{1}{D_R D_C} \to \frac{1}{D_R}\quad (D_C \to \infty).

4.4 Quantitative reproduction

Eleven dimension triples $(D_L, D_R, D_C)$ at $N = 400$ Haar samples each.
Sample result at $(D_L=4, D_R=4, D_C=8)$ :

quantity	Page prediction	measured	sem
$S_\beta$ (bulk)	$0.2791$	$0.2789$	$0.0005$
$S_\alpha$ (bdy)	$0.4706$	$0.4752$	$0.0035$
$S_{\mathrm{no\text{-}baby}}$	$0.4706$	$0.4730$	$0.0033$

Pooled $\chi^2$ across 11 rows for each of the three quantities, against the
Page prediction:

quantity	$\chi^2$ / dof	$2\sigma$ band
$S_\beta$ (bulk)	$11.49/11$	$[1.6,\,20.4]$
$S_\alpha$ (bdy)	$11.47/11$	$[1.6,\,20.4]$
$S_\alpha \approx S_{\mathrm{no\text{-}baby}}$	$10.23/11$	$[1.6,\,20.4]$
EGH gap	$17.71/11$	$[1.6,\,20.4]$

All within the 2σ band. Reproducibility is bit-identical.

4.5 Conclusion

Phase 3 reproduces EGH’s qualitative claim quantitatively: the boundary AdS
observer measures the “no-baby” answer, the closed-universe observer measures
the “with-baby” answer, and the gap scales as $1/D_R$ , all controlled by
Page’s formula applied to the two Hilbert-space factorizations. The claim
that “the boundary cannot distinguish AS2R’s two candidate bulk duals” is
visible in our data as $|S_\alpha - S_{\mathrm{no\text{-}baby}}|$ within SEM.

Caveat: because arxiv access was rate-limited during the run, we could not
extract and match a specific numerical figure from 2507.06046 itself. The
Page-formula reproduction is a substantive verification because EGH’s argument
reduces to exactly this computation, but the PI should cross-check against a
specific figure when convenient.

5. Phase 4, Two-Observer HUZ Setup

5.1 Objective

Implement the genuine Gap 5 setup (two independent observer factors, not
clock states in a single factor) and verify three consistency gates, then
take a first look at the scaling of the observer disagreement.

5.2 Gate checks

Four symmetric configurations ( $d_{OA} = d_{OB}$ ) and four asymmetric
configurations, each with $N = 120$ Haar samples.

Gate (a): Symmetric ensemble → $\langle S_A\rangle = \langle S_B\rangle$ .
A Haar ensemble on $\mathcal{H}_{OA}\otimes\mathcal{H}_{OB}\otimes\mathcal{H}_M$
with $d_{OA} = d_{OB}$ is symmetric under the swap $A\leftrightarrow B$ , so
$\langle S_A\rangle = \langle S_B\rangle$ identically in the population.
Measured residual: $|\langle S_A\rangle - \langle S_B\rangle|/\mathrm{sem} < 2\sigma$
across all four symmetric configs. Gate passes.

Gate (b): Asymmetric ensemble → $\langle\Delta S\rangle > 0$ .
Representative asymmetric row $(d_{OA}=2, d_{OB}=4, d_M=4, d_{\mathrm{fund}}=8)$ :
$\langle S_A\rangle = 0.66$ , $\langle S_B\rangle = 1.26$ ,
$\langle|S_A - S_B|\rangle = 0.60$ with sem $= 0.005$ , i.e. $z = 117\sigma$
from zero. Minimum $z$ across four asymmetric configs: $58\sigma$ . Gate passes.

Gate (c): Trivial bound. Every individual sample satisfies
$|S_A - S_B| \le \log\max(d_{OA}, d_{OB})$ (not $\log\min$ as the spinup
states; the correct trivial Shannon bound is $\log\max$ , since $S_A \le \log d_{OA}$
and $S_B \le \log d_{OB}$ independently). 0 violations across 960 samples.
Gate passes.

5.3 First scaling look

With $d_{OA} = d_{OB} \equiv d_B$ , $d_M = 4$ fixed, $\rho = 1/2$ fixed
( $d_{\mathrm{fund}} = 2 d_B^2$ ):

| $d_B$ | $\langle S_A\rangle$ | $\langle S_B\rangle$ | $\langle|S_A-S_B|\rangle$ | sem |
|---:|---:|---:|---:|---:|
| 2 | $0.6344$ | $0.6417$ | $0.0482$ | $0.004$ |
| 3 | $1.0562$ | $1.0554$ | $0.0323$ | $0.003$ |
| 4 | $1.3570$ | $1.3545$ | $0.0179$ | $0.001$ |
| 5 | $1.5855$ | $1.5842$ | $0.0146$ | $0.001$ |
| 6 | $1.7697$ | $1.7698$ | $0.0128$ | $0.001$ |

Power-law fit $\log\langle\Delta S\rangle = \alpha\log d_B + \beta$ :

\alpha = -1.286, \qquad 95\%\ \mathrm{CI}\ [-1.66,\,-0.92],\qquad R^2 = 0.976.

Excluding the smallest-dim row ( $d_B=2$ ) as a finite-size outlier, the
$d_B=3,\ldots,6$ ratios fit $\alpha = -1.29$ essentially exactly.

5.4 Interpretation

The two-observer disagreement decays with observer size. That alone rules
out the “wild complementarity” outcome for the paper and points toward a
clean-bound narrative.

The decay rate $\alpha \approx -1.3$ is intermediate between:

$\alpha = -1$ : HUZ’s inner-product suppression inherited verbatim. The
disagreement between observers is bounded by the pairwise inner-product
error between their effective state reconstructions, which scales as
$1/d_{\mathrm{Ob}}$ per Phase 2. Paper: “observer complementarity inherits
HUZ’s $e^{-S_{\mathrm{Ob}}}$ guarantee.”
$\alpha = 0$ : state-level saturation inherited from Phase 2’s
$E_{\mathrm{td}}$ . Paper: “non-isometric distortion dominates; disagreement
is $O(1)$ bounded by a Page-formula prefactor.”
$\alpha \approx -1.3$ (observed): neither. Something between inner-
product and state-level, which has no obvious single-observer analogue.

The 95% CI currently includes $-1$ ; resolving between these requires larger
$d_B$ and tighter SEMs, which is Phase 5’s job.

5.5 Conclusion

The two-observer HUZ setup is implemented and verified. The observer-
disagreement signal is unambiguous ( $z = 58\sigma$ on the smallest asymmetric
case), controlled in magnitude by a decaying power law in observer size, and
bounded by Shannon. Phase 4 is the point at which the Gap 5 paper’s central
quantity acquires a specific empirical character.

6. Phase 5, The Money Plot: Pinning the Scaling Exponent

6.1 Objective

Phase 4’s first scaling gave $\alpha \approx -1.29$ with 95% CI $[-1.66,\,-0.92]$
from 5 points at $d_B \le 6$ and $N = 120$ . Phase 5’s job is to pin $\alpha$ to
$\pm 0.1$ precision or better, and in particular to resolve whether the two-
observer disagreement inherits HUZ’s single-observer $1/d_{\mathrm{Ob}}$ scaling
(i.e. $\alpha = -1$ exactly) or something steeper.

6.2 Infrastructure: efficient two-observer entropies

The backend’s two_observer_entropies (used in Phases 4 and earlier) forms the
full joint density matrix

\rho_{\mathrm{full}} = |\Psi\rangle\langle\Psi| / \langle\Psi|\Psi\rangle \quad\text{on}\quad \mathcal{H}_{\mathrm{fund}}\otimes\mathcal{H}_{R_A}\otimes\mathcal{H}_{R_B}.

This matrix has dimension $d_{\mathrm{fund}} \cdot d_{OA} \cdot d_{OB}$ on each side. At
$d_B = 16$ (with $d_M = 4$ , $\rho = 1/2$ ), that is $512 \cdot 16 \cdot 16 = 131{,}072$
per side, $\approx 270$ GB in complex128. Infeasible.

Phase 5 implements two_observer_entropies_fast, which traces the fundamental
and “other-observer” indices directly via einsum on the state tensor
$|\Psi\rangle \in \mathbb{C}^{d_{\mathrm{fund}}}\otimes\mathbb{C}^{d_{OA}}\otimes\mathbb{C}^{d_{OB}}$ :

(\rho_{R_A})_{ac} = \frac{1}{\langle\Psi|\Psi\rangle}\sum_{f,b} \Psi_{fab}\,\Psi^*_{fcb}, \qquad (\rho_{R_B})_{bd} = \frac{1}{\langle\Psi|\Psi\rangle}\sum_{f,a} \Psi_{fab}\,\Psi^*_{fad}.

Memory cost: $O(d_{\mathrm{fund}}\cdot d_{OA}\cdot d_{OB})$ for the state plus
$O(d_{OA}^2 + d_{OB}^2)$ for the reductions. At $d_B = 16$ : $\sim\!2$ MB total
(versus 270 GB for the slow version). The fast and slow versions are
bit-identical at $10^{-16}$ precision on five verification cases spanning
$(d_{OA}, d_{OB}, d_M, d_{\mathrm{fund}}) \in \{(2,2,4,4),\ (3,3,4,9),\ (4,4,4,16),\ (2,3,4,8),\ (3,5,4,12)\}$ .

6.3 Scan and results

With $d_{OA} = d_{OB} \equiv d_B$ , $d_M = 4$ , $\rho = 1/2$
( $d_{\mathrm{fund}} = 2 d_B^2$ ) held fixed:

| $d_B$ | $d_{\mathrm{fund}}$ | $d_{\mathrm{eff}}$ | $N$ | $\langle S_A\rangle$ | $\langle S_B\rangle$ | $\langle|S_A - S_B|\rangle$ | sem |
|---:|---:|---:|---:|---:|---:|---:|---:|
| 4 | 32 | 64 | 300 | 1.35674 | 1.35518 | 0.01875 | 0.00094 |
| 6 | 72 | 144 | 300 | 1.77024 | 1.77063 | 0.01225 | 0.00059 |
| 8 | 128 | 256 | 300 | 2.06430 | 2.06293 | 0.00764 | 0.00036 |
| 10 | 200 | 400 | 300 | 2.28944 | 2.29006 | 0.00618 | 0.00026 |
| 12 | 288 | 576 | 300 | 2.47427 | 2.47443 | 0.00453 | 0.00021 |
| 14 | 392 | 784 | 200 | 2.62978 | 2.62984 | 0.00366 | 0.00022 |
| 16 | 512 | 1024 | 240 | 2.76507 | 2.76469 | 0.00301 | 0.00017 |

Samples at $d_B \in \{12, 14, 16\}$ were accumulated across multiple
independent-seed batches (via phase5_boost_point.py) to keep each
single-batch runtime within the execution environment’s wall-time budget;
merged-batch metadata is preserved in phase5_scan.csv.

SEM on $\langle S_A\rangle, \langle S_B\rangle$ differs by less than $1\sigma$
at every $d_B$ , confirming the Gate (a) symmetric-swap check of Phase 4 at
the $N = 200$ , $300$ level: the Haar average respects the $A\leftrightarrow B$
swap symmetry to within statistical precision, as required.

6.4 Power-law fit

Weighted least squares (WLS, weights $\propto (\langle\Delta S\rangle / \sigma_{\mathrm{sem}})^2$ )
on $\log\langle\Delta S\rangle = \alpha\log d_B + \beta$ :

Fit	$\alpha$	95% CI	width	$R^2$	$n$
All 7 points (WLS)	$-1.3303$	$[-1.4465,\,-1.2142]$	$0.232$	$0.9943$	7
All 7 points (OLS)	$-1.3344$	$[-1.4407,\,-1.2281]$	$0.213$	$0.9952$	7
Exclude $d_B=4$ (WLS)	$-1.4066$	$[-1.5511,\,-1.2620]$	$0.289$	$0.9945$	6

The required-precision gate (CI width $\le 0.25$ ) is met by both the all-points
WLS and OLS fits. The stretch gate ( $\le 0.15$ ) is not met, resolution is
limited by the $d_B$ range, not by statistical noise.

6.5 Hypothesis tests

Hypothesis	z-score vs. WLS fit	in 95% CI?
$\alpha = -1$ (HUZ inner-product inheritance)	$-7.31\sigma$	✗
$\alpha = -4/3 \approx -1.333$ (clean rational)	$+0.05\sigma$	✓
$\alpha = -1.29$ (Phase 4 point estimate)	$-0.89\sigma$	✓
$\alpha = -2$ (two-overlap inheritance)	$+14.82\sigma$	✗
$\alpha = 0$ (state-level saturation)	$-29.44\sigma$	✗

6.6 Prefactor-stability diagnostic

Beyond the fit, a separate and independently decisive test: if
$\langle\Delta S\rangle = c_\alpha \cdot \sqrt{(1-\rho)/\rho}\cdot d_M^{-1/2} \cdot d_B^{\alpha}$ , then the empirically extracted $c_\alpha$ should be
constant in $d_B$ if the assumed $\alpha$ is correct. We extract $c$ at two
trial exponents:

$d_B$	$\langle\Delta S\rangle$	$c_{\alpha=-1}$	$c_{\alpha=-1.29}$
4	$0.01875$	$0.1500$	$0.2242$
6	$0.01225$	$0.1471$	$0.2473$
8	$0.00764$	$0.1222$	$0.2234$
10	$0.00618$	$0.1237$	$0.2412$
12	$0.00453$	$0.1088$	$0.2236$
14	$0.00366$	$0.1026$	$0.2206$
16	$0.00301$	$0.0962$	$0.2151$

$c_{\alpha=-1}$ drifts monotonically from 0.150 to 0.096, a $-36\%$
change over the scanned range, incompatible with any constant-prefactor
ansatz at $\alpha = -1$ .
$c_{\alpha=-1.29}$ varies from 0.22 to 0.25 with drift $-4\%$ from $d_B=4$
to $d_B=16$ , essentially flat within statistical fluctuation.

The prefactor-stability argument is model-free: it requires no global fit. It
directly shows that $\alpha = -1$ is incompatible with the data, and that
$\alpha \approx -1.29$ (or $-4/3$ ) is compatible.

6.7 Reproducibility and validation

Re-running the first 5 samples of the $d_B=8$ cache with the same seed
reproduces the cached values bit-identically ( $|\Delta S_A| = 0$ ).
two_observer_entropies_fast reproduces backend two_observer_entropies
to $2.22\times 10^{-16}$ (i.e. double-precision rounding).

6.8 Interpretation

The two-observer disagreement decays with observer size as
$\langle\Delta S\rangle \sim d_B^{-\alpha}$ with $\alpha \approx -1.33$ ,
close to but empirically indistinguishable from $-4/3$ over the sampled range.

This result is not what a naïve inheritance from Phase 2 would predict:

Single-observer HUZ inner-product error: $E_{\mathrm{ovl}} \sim d_{\mathrm{Ob}}^{-1}$
( $\alpha = -1$ ). If two-observer disagreement inherited this directly,
we would have $\langle\Delta S\rangle \sim d_B^{-1}$ . Ruled out at $7.3\sigma$ .
Two-factor inheritance (one $E_{\mathrm{ovl}}$ per observer):
$\langle\Delta S\rangle \sim d_B^{-2}$ . Ruled out at $15\sigma$ .
State-level saturation: $\langle\Delta S\rangle \sim O(1)$ in $d_B$
( $\alpha = 0$ ). Ruled out at $29\sigma$ .

The observed $\alpha \approx -4/3$ is intermediate between single-overlap and
two-overlap inheritance. A speculative interpretation: if the two-observer
disagreement scales as some fractional power of $E_{\mathrm{ovl}}$ , specifically
$\langle\Delta S\rangle \sim E_{\mathrm{ovl}}^{4/3} \sim d_B^{-4/3}\,d_M^{-2/3}$ ,
the observed exponent is recovered. This is conjecture; deriving it from
second-order Weingarten requires a genuinely two-observer computation (neither
$E_{\mathrm{ovl}}$ nor $E_{\mathrm{td}}$ alone captures the correlated-V
contribution that couples the two reductions $\rho_{R_A}$ and $\rho_{R_B}$ ).

6.9 Conclusion

Phase 5 delivers the paper’s primary numerical claim:

\boxed{\; \langle|S_A - S_B|\rangle \;\propto\; d_B^{-\alpha}, \qquad \alpha \;=\; 1.330^{+0.116}_{-0.116}\ (95\%\ \mathrm{CI}), \qquad \alpha \;=\; 1\ \text{rejected at}\ 7.3\sigma. \;}

The scaling is steeper than the naïve HUZ inheritance would predict,
consistent with a clean rational exponent $4/3$ , and begs an analytic
derivation that is the natural target of Phase 7.

7. Synthesis

7.1 What is now established

After five phases:

The backend is correct. Eight+ sanity checks across Phases 1,5 confirm
the core primitives (Haar sampling, AEHPV map, partial trace, cloning,
entropy, two-observer HUZ, fast two-observer reductions) behave as
specified. All gate conditions across five phases pass, with the
single acknowledged exception of the Phase 5 stretch gate (CI width
$\le 0.15$ ), which is a precision rather than correctness issue.
HUZ is a precise analytic claim at overlap level. We have the full
functional form
$E_{\mathrm{ovl}} = (\sqrt\pi/2)\sqrt{(1-\rho)/\rho}/(d_{\mathrm{Ob}}\sqrt{d_M})$
verified to 4% on data spanning three values of $\rho$ and six values of
$d_M$ , with the residual attributable to a known $O(1/d_{\mathrm{eff}})$
Weingarten correction.
EGH’s observer complementarity is Page’s formula in disguise. The
$\alpha$ -vs- $\beta$ SWAP-test gap is
$\mathrm{Page}(D_L, D_R) - \mathrm{Page}(D_L, D_R\cdot D_C) \approx 1/D_R$
to leading order, reproduced on all 11 tested configurations.
The Gap 5 multi-observer disagreement has a specific, empirically
determined scaling. With $\alpha = -1.33 \pm 0.06$ (1σ), the naïve
$\alpha = -1$ inheritance is excluded, and a clean rational $-4/3$ is
consistent with all 7 points and with the prefactor-stability test.

7.2 Paper outcome

The abstract placeholder has now been fully resolved:

We find that the two-observer disagreement $\langle|S_A - S_B|\rangle$
decays with observer size as $d_B^{-1.33 \pm 0.06}$ , significantly steeper
than the $d_B^{-1}$ scaling that would follow from naïve inheritance of
HUZ’s single-observer inner-product bound. The observed exponent is
consistent with the clean rational value $-4/3$ across seven values of $d_B$
spanning $\{4,\ldots,16\}$ , and the same prefactor-stable form fits the data
across $d_M$ and non-isometry $\rho$ as predicted by second-order Weingarten
moments. Our results confirm that observer complementarity gives a clean
analytic bound on multi-observer disagreement in the non-isometric-code
setting, with $\sqrt{(1-\rho)/\rho}$ non-isometry dependence and
$O(1/\sqrt{d_M})$ matter-sector suppression, but the $d_B$ -scaling
itself is a new result that cannot be read off from the single-observer
case, and whose derivation is an analytic target for future work.

The paper narrative is “clean bound with a non-trivial exponent”:
numerical result in hand, analytic target well-posed.

7.3 Open questions and future phases

Phase 6: how does $\alpha$ change for structured bulk states (TFD,
low-complexity) versus Haar? Phase 3 showed that single-observer Page
results hold for any fixed $\psi$ when $V$ is Haar, but two-observer HUZ
involves cloning-induced structure that may not inherit this robustness.
Phase 7: can $\alpha = -4/3$ be derived from a proper two-observer
Weingarten computation? The key obstruction is that neither
$E_{\mathrm{ovl}}$ nor $E_{\mathrm{td}}$ alone captures the correlated- $V$
coupling between $\rho_{R_A}$ and $\rho_{R_B}$ ; a new calculation, likely
a fourth-order Weingarten moment, is required. If $\alpha = -4/3$ is
derivable, the paper has a closed-form answer that the numerics saturate.
Higginbotham’s critique (2512.17993): EGH’s specific SWAP observables
are suboptimal; refined operators change the $\alpha/\beta$ answer. Our
Phase 3 reproduces EGH’s original observables. A follow-up could
implement Higginbotham’s operators and test whether the Gap 5 disagreement
is sensitive to the operator choice.
Larger $d_B$ : the current scan tops out at $d_B=16$ . Extending to
$d_B = 24$ or $32$ would pin $\alpha$ to the stretch-gate $\pm 0.05$
precision and definitively distinguish $-4/3$ from nearby rationals like
$-7/5$ or $-11/8$ . This requires ~ $10^4$ s of wall time per point and is
a natural next step.

End of memo. All numerical results reproducible from seeds documented in the
phase-N driver scripts. CSV tables are the ground truth for every number
quoted here.

Paper 2 Spin-Up: Phase 8+

Phases 8+ in-progress

Complete context for continuing the observer-complementarity research program in a fresh thread. Read this document end-to-end before starting work. Everything in this document is canonical; everything in the transcripts is reference-only.

Author-PI: Adam R. Cagle, Bend OR, adamrcagle@gmail.com. Assistant: Claude (Anthropic). This is the second research program in an ongoing collaboration; the first produced a tier-1 paper currently under arXiv review.

Part 1, Where we are right now

The short version

Paper 1 is written, compiled, and headed for arXiv.

Title: Complexity-Sensitive Complementarity in Non-Isometric Holographic Codes.
Author line: Adam R. Cagle, with Claude (Anthropic) as computational collaborator.
Compiled PDF: /mnt/user-data/outputs/latex/paper.pdf (39 pages, 670 KB).
arXiv-ready tarball: /mnt/user-data/outputs/arxiv_submission.tar.gz (169 KB).

Submission status at the time this doc was written:

arXiv account being created by Adam at adamrcagle@gmail.com.
First-submission endorsement required. Primary target: Kenneth Higginbotham (Perimeter Institute, khigginbotham1@pitp.ca), whose paper 2512.17993 is cited in our §8.2. Backup: Chris Akers (CU Boulder). Second backup: Elliott Gesteau (MIT).
Endorsement email sent; waiting window 10 days before moving to backup.
After arXiv posts, submit to JHEP (one-line swap from \bibliographystyle{unsrt} to \bibliographystyle{JHEP} in paper.tex).

Companion piece is written: /mnt/user-data/outputs/companion_piece_going_to_the_edge.md. ~2,700 words, storytelling voice, targeting general readers. Tells the “generalist + AI + Physics 101 → edge of the field” story with honest caveats. Awaiting paper’s arXiv posting before release.

Do NOT continue editing Paper 1 unless Adam explicitly asks. It’s done. The next thread’s job is Paper 2.

The physics story in one paragraph

Two observers reconstructing a quantum state through a random smudgy encoding (a non-isometric holographic code with HUZ observer cloning) typically disagree about the state’s von Neumann entropy. We proved that the scale of the disagreement depends on the complexity class of the bulk state being reconstructed, with an integer-valued exponent gap: Product-class bulk (rank-1 marginal) gives $\mathbb{E}|S_A - S_B| \approx 0.608/\sqrt{d_B}$ ; Haar-class bulk (maximally mixed marginal) gives $\mathbb{E}|S_A - S_B| \approx (\sqrt{2/\pi}/d_M)\cdot d_B^{-3/2}$ . Both scalings follow from a single structural identity: $\mathbb{E}_V[\rho_{R_A}] = \diag(\rho_A^{\text{bulk}}) + O(1/d^2)$ . All results verified at multi-level numerical precision including out-of-sample tests.

What’s in /mnt/project/

Full research library, 63 files, 2.7 MB. Organized as:

Paper 1 artifacts: paper.pdf, paper_full_draft.md, paper_appendix_A_draft.md, paper_appendix_B_draft.md, A_Little_Help_for_my_Friends.md
Previous spin-up: bh_new_thread_spinup.md (Paper 1 setup; now historical)
Phase memos: phase1-4_research_memo.md, phase1-5_research_memo.md
Physics reference library: 26 QFT documents, plus foundations (Physics 101-102, Lagrangian, SR/tensors, GR, classical fields, condensed matter, stat mech, particle physics, math for QM, modern physics, QM worked problems)
Ideas library: ten_astounding_things.md, ten_neglected_problems.md, ten_unsolved_problems.md
Code: vnalgebra.py, crossed_product.py, trace_on_crossed.py, finite-dim von Neumann algebra toolkit; README.md documents what works and what doesn’t. Phase 1-2 Python drivers (phase1_rules_canonical.py, phase2_huz_verification.py) + CSVs.

What’s in /mnt/user-data/outputs/

All Phase 1,7 drivers and data (28 .py files, 25 .csv files, phase memos, paper drafts, figures, LaTeX build). Any of this can be referenced or reused. In particular:

Canonical backend: the Phase 5+ scripts import from an inline backend embedded in each driver. There is no standalone bh_lab_backend.py file, the backend functionality (Haar sampling, AEHPV map, two-observer entropy computation) is replicated in each Phase 5+ script. First task in Phase 8: factor this into a canonical bh_lab_backend.py so Phase 8 scripts can import cleanly.

Part 2, The five results of Paper 1 to keep in working memory

Every result below is exactly verified in the paper and its appendices. Cite these when they apply; don’t re-derive them.

Result 1: Structural identity (Theorem 3.2)

For any bulk state $|\psi\rangle \in \mathcal{H}_A \otimes \mathcal{H}_B \otimes \mathcal{H}_C$ and the two-observer HUZ setup with AEHPV non-isometric code $V$ ,

\mathbb{E}_V[\rho_{R_A}] = \rho_A^{\text{bulk}} + O(1/d^2), \qquad \mathbb{E}_V[\rho_{R_A}]_\text{diag} = \diag(\rho_A^{\text{bulk}}) + \text{subleading}.

This is the master identity. Both subsequent theorems follow by computing $\text{Var}(H(\diag \rho_A^{\text{bulk}}))$ in different bulk-state classes.

Result 2: Product class (Theorem 4.2)

For $|\psi\rangle = |\psi_A\rangle \otimes |\psi_B\rangle \otimes |\psi_C\rangle$ with each factor Haar:

\mathbb{E}|S_A - S_B| = \sqrt{\frac{4(\pi^2/3 - 3)}{\pi\, d_B}}\,(1+o(1)) \approx 0.6076\, d_B^{-1/2}.

Exponent $\alpha_P = -1/2$ exact. Prefactor exact in closed form.

Result 3: Haar class (Theorem 5.2)

For $|\psi\rangle$ Haar on $\mathcal{H}_\text{eff}$ :

\mathbb{E}|S_A - S_B| = \sqrt{\frac{2}{\pi}} \cdot \frac{1}{d_M\, d_B^{3/2}}\,(1+o(1)) \approx \frac{0.798}{d_M}\, d_B^{-3/2}.

Exponent $\alpha_H = -3/2$ exact. Prefactor exact in closed form.

Result 4: The exponent gap

\alpha_P - \alpha_H = 1 \quad\text{exactly.}

One full power of $d_B$ per unit of bulk-marginal regularity. This is the central physical statement of the paper.

Result 5: Haar subleading structure (empirical, open analytically)

\frac{\mathbb{E}|S_A-S_B|}{(\sqrt{2/\pi}/d_M)\cdot d_B^{-3/2}} = 1 - \frac{1.1331}{d_B} + \frac{4.6538}{d_B^2} + O(1/d_B^3)

fit with $\chi^2/\text{dof}=2.83/4$ across $d_B \in \{16,24,32,48,64,96\}$ . Origin presumably non-Gaussian corrections to the Isserlis identity used in §5.3. Analytic derivation is an explicit open question from the paper’s §8.7; if Phase 8 work produces this, it’s a clean follow-up paper by itself.

Part 3, The PI’s choice: Option A (stay the course)

Adam elected to continue down the current research track rather than pivot to a different field (Option B was strong-field QED / Schwinger pair production). Option A means: take the abstract complexity-sensitive complementarity framework and push it toward the actual black hole interior question that originally motivated the program.

The rest of this document lays out what that concretely means.

Part 4, My complete thinking on what should come next

This section is the honest research-strategy assessment. Read it carefully; it shapes everything that follows.

The three obvious extensions of Paper 1

Extension A: Rank-r interpolation scan. Schmidt rank $r$ of the bulk marginal interpolates between $r=1$ (Product class) and $r = d_\text{eff}$ (Haar class). Scan intermediate ranks; determine whether $\alpha(r)$ is smooth or shows phase-transition-like behavior at some critical $r^*$ . Flagged explicitly as §6.5 of Paper 1. Computationally identical to what we already did; only the bulk-state generation changes.

Extension B: Analytic derivation of subleading $B=-1.13, C=4.65$ . Requires non-Gaussian corrections to the Isserlis identity, plus bulk-norm fluctuation corrections. Pure analytic work. Could be a short technical paper.

Extension C: Apply to the actual black hole interior. Take the framework and connect it to evaporating black holes. This is what Adam actually originally wanted to do.

The rank- $r$ scan (A) is the obvious next paper but it’s narrow. It confirms or refines a prediction we already have. The value is incremental, a Tier-2 result at best, probably a Tier-3 technical note.

The analytic subleading (B) is cleaner mathematically but it’s orthogonal to physics. It tightens a number; it doesn’t open a new question.

The black-hole-interior extension (C) is the scientifically interesting direction and it validates the generalist-plus-AI methodology in a second, related-but-distinct domain. It also answers Adam’s original motivating question.

My proposed structure: Phase 8 = A as bootstrap (ground the rank- $r$ behavior to use as input for C). Phase 9 = C proper. Phase 10 = analytic cleanup if time/interest permits.

What “applying to the actual black hole interior” means concretely

An evaporating black hole’s interior state changes complexity over the course of evaporation:

Early times (before Page time $t_P$ ): The interior is “simple.” Most infalling matter is coherent with the exterior; the bulk marginal $\rho_A^\text{bulk}$ seen by an infaller has low Schmidt rank across the $A$ :(rest) cut. By Paper 1’s framework, Product-like scaling: observer disagreement falls off slowly as $d_B^{-1/2}$ where $d_B$ is the dimension of observable interior factor.
Around the Page time: The interior is maximally entangled with Hawking radiation. The bulk marginal is near maximally mixed. Haar-like scaling: disagreement falls off rapidly as $d_B^{-3/2}$ .
Late times (post-Page): Interior remains complex; radiation carries the entanglement. Scaling stays Haar-like.

The prediction: Observer disagreement exhibits a crossover at the Page time, with exponent changing by 1. This is a concrete, testable, novel statement about the information paradox.

Whether this is exactly right depends on how the AEHPV non-isometric-code construction maps onto the Penington-Shenker-Stanford-Yang replica-wormhole picture of the evaporating black hole. Both are about the same physics; the translation between frameworks is not trivial but is partially done in the literature (AEHPV itself, Akers-Engelhardt-Harlow-Penington-Vardhan 2207.06536 §7-8). The operational question for Phase 9 is: can we compute $\mathbb{E}|S_A - S_B|$ for an infaller vs. an asymptotic observer in a time-parameterized AEHPV code, and does the scaling crossover appear?

If it does, that’s Paper 2, potentially Tier 1.
If it doesn’t, that’s still Paper 2, a carefully-checked null result that refines the mapping between frameworks.
Either way publishable.

How this relates to the firewall debate

The Almheiri-Marolf-Polchinski-Sully (AMPS) firewall argument says: for a black hole past the Page time, an infalling observer should see a firewall if we assume unitarity, locality, and the equivalence principle. The observer-complementarity framework is explicitly a response to this tension.

Our complexity-sensitive complementarity result says: the entropy-level disagreement between the infaller’s and the asymptotic observer’s reconstructions of the interior changes scaling at the Page time. This is adjacent to, though not directly, the question of whether the firewall appears. Specifically, a large entropy disagreement between observers does not automatically imply a firewall, but a vanishing entropy disagreement would impose structural constraints on what either observer can see.

The honest statement: our framework will probably not resolve AMPS one way or the other. But it puts quantitative bounds on a specific observable (the expected entropy disagreement) that’s adjacent to the firewall question, and sharpens the language.

Risks and failure modes

The AEHPV→evaporating-BH translation is harder than it looks. Bearable; this is what Phase 9.0 is for, and if the translation is hard, clarifying it is itself a paper.
The scaling crossover is real but the prefactors/exponents differ from Paper 1’s values due to physical-state constraints. Still publishable, just retunes the numbers.
The scaling crossover doesn’t appear because the bulk complexity doesn’t increase in the relevant way. Would be a null result. Still publishable if the null is solid.
We get scooped. Possible. The observer-complementarity frontier is moving fast. Mitigation: publish Paper 1 on arXiv ASAP, work fast on Phase 9, identify a specific target (early-vs-late-Page scaling crossover) and don’t get distracted by adjacent extensions.
The question has already been answered and we don’t know it. Lit review in Phase 8.0 should catch this. If any of Penington, Harlow, Engelhardt, Akers, or Gesteau has published on “time-dependent observer-complementarity scaling during evaporation,” we need to know before Phase 9.

What success looks like

Paper 2’s one-sentence pitch: “The integer exponent gap of complexity-sensitive complementarity manifests in evaporating black holes as a scaling crossover at the Page time, with an infalling observer and an asymptotic observer agreeing at qualitatively different rates before and after.”

That’s a tier-1 result if the analysis holds. If the scaling crossover is subtler than this, say, continuous in complexity rather than integer-stepped, the paper is still interesting as the first quantitative characterization of the infaller-vs-asymptotic entropy difference during evaporation.

Part 5, Phase-gated research plan

Each phase has a gate. Do not proceed to the next phase until the gate is passed. If the gate reveals something unexpected, pause and assess before continuing.

Phase 8.0, Housekeeping and literature refresh

What:

Factor the embedded backend code out of the Phase 5+ scripts into a canonical bh_lab_backend.py for Phase 8+ reuse.
Fresh arXiv search for papers posted 2025-11-01 through current, using queries: “observer complementarity non-isometric”, “Page time infaller entropy”, “non-isometric code evaporation”, “rank-r random holographic code”, “time-dependent observer complementarity”.
Read AEHPV 2207.06536 §7-8 (time-dependent extensions) carefully; the explicit evaporation construction is there.
Skim Penington-Shenker-Stanford-Yang 1911.11977 and Almheiri-Engelhardt-Marolf-Maxfield 1911.12333 as the canonical sources for Page curve computation via replica wormholes.
Flag anything published since Paper 1 started that changes the game.

Gate: no scoop found; literature landscape understood; backend factored and passing its own self-test.

Phase 8.1, Rank-r bootstrap scan

What: Implement Schmidt-rank- $r$ bulk state generator. For each $r \in \{1, 2, 4, 8, 16, ..., d_\text{eff}\}$ and dimension grid $d_B \in \{6, 8, 12, 16, 24\}$ , sample $N=100\text{,}300$ bulk states, compute $\mathbb{E}|S_A - S_B|$ in the full HUZ+ $V$ pipeline. Fit $\alpha(r)$ .

What to watch for: Is $\alpha(r)$ smooth (no surprises) or does it show a step/transition? Is the prefactor $c(r)$ smooth? The naive guess is smooth monotonic interpolation; any deviation is a finding.

Deliverable: phase8_rankr_scan.py, phase8_rankr_table.csv, short phase8_rankr_memo.md with the fit results and plot.

Gate: fit of $\alpha(r)$ completed with error bars; whether smooth or not, we know.

Phase 8.2, Analytic rank-r structural identity (optional; skip if not tractable)

What: Attempt to derive $\mathbb{E}|S_A - S_B|$ analytically for fixed $r$ . The structural identity (Theorem 3.2 of Paper 1) applies for any $r$ ; the question is computing the variance of the Shannon entropy of the rank- $r$ bulk marginal’s diagonal. This is a generalized Dirichlet-moment calculation.

Deliverable: If successful, phase8_analytic_rankr.md with the closed-form scaling law. If not, honest write-up of where the calculation breaks down.

Gate: Either we have the analytic form or we have a sharp understanding of why we don’t.

Phase 9.0, AEHPV → evaporating BH translation

What: Map the abstract AEHPV non-isometric code setup onto the Penington-style evaporating black hole. Specifically:

Identify $V: \mathcal{H}_\text{eff} \to \mathcal{H}_\text{fund}$ with the black hole’s effective-to-fundamental map at time $t$ .
Identify the observer factors $\mathcal{H}_A, \mathcal{H}_B$ with physical degrees of freedom (infaller local Hilbert space vs. asymptotic observer).
Identify the time parameter $t$ with the complexity parameter of the bulk state.

Key technical subquestion: How does the bulk marginal’s rank grow with $t$ during evaporation? Short answer (to be verified): before the Page time, rank is $O(e^{S_\text{matter}(t)})$ which is small; after the Page time, rank is $O(e^{S_\text{rad}(t)})$ which is $\sim e^{S_\text{BH}}$ .

Deliverable: phase9_aehpv_bh_mapping.md, a detailed mapping document, paralleling Part 10 of the old spin-up doc. Every variable in the AEHPV formalism labeled with its physical BH counterpart.

Gate: The mapping is clean enough that every quantity in Paper 1’s framework has a well-defined BH meaning. Ambiguities identified and flagged.

Phase 9.1, Time-dependent two-observer computation

What: Implement the time-parameterized non-isometric code. For a sequence of times spanning early → Page → late, compute $\mathbb{E}|S_A - S_B|$ where $A$ is the infaller-interior observer and $B$ is the asymptotic observer. This is computationally similar to Paper 1’s Phase 5 but with time-dependent bulk state and code.

What to watch for: Does the measured scaling exponent change at $t_P$ ? If yes, by how much? If no, why not?

Deliverable: phase9_time_dependent_scan.py, phase9_page_curve_scan.csv, phase9_crossover_plot.pdf, phase9_memo.md.

Gate: We have the data. Whether or not the crossover appears, we know.

Phase 9.2, Interpretation and writeup

What: Based on the Phase 9.1 data, write the Paper 2 draft. If the crossover appears as predicted, the paper is about that. If it doesn’t, the paper is about what the actual pattern is and why the naive prediction fails.

Deliverable: Paper 2 first draft.

Gate: draft that stands up to the same internal review standard as Paper 1.

Phase 10+, Possible follow-ups

Phase 10 candidates (pick based on results from 9.1):

Firewall implications if any emerge.
Colorado-rule analog of the crossover computation.
Rank- $r$ extension: does the rank- $r$ scan from Phase 8.1 fit into the evaporating-BH picture at specific intermediate times?
Analytic subleading (orphan from Paper 1’s §8.7).

Part 6, Tool and library stack

Already in the Claude sandbox (no install)

All carried over from Paper 1’s environment:

numpy 2.4+ scipy 1.17+ sympy 1.14+ mpmath 1.3+  
pandas 3.0+ matplotlib 3.10+ seaborn 0.13+  
networkx 3.6+ scikit-learn

Install at thread start

pip install --break-system-packages cvxpy statsmodels

Same as Paper 1. cvxpy for SDP-based bounds (needed if we do Phase 10 firewall-style bounds). statsmodels for regression with confidence intervals.

New install candidate for Phase 8-9: `pymanopt`

pip install --break-system-packages pymanopt # IF needed for rank-r state parameterization

Trigger: if we need to parameterize low-rank bulk states as points on a Stiefel manifold and sample/optimize over the manifold (e.g., to find worst-case disagreement bounds at fixed rank). Probably not needed for the basic Phase 8.1 scan, which can get away with random Schmidt-decomposition sampling via QR of small Gaussian matrices.

Install conditionally (don’t install preemptively)

qutip, if/when we model evaporation as Lindbladian open-system dynamics (not currently needed; the AEHPV picture handles evaporation combinatorially without needing explicit master equations).
quimb or tenpy, if/when we extend to the Bueller-DeWolfe-Higginbotham tensor-network version of AEHPV to get spatial locality (probably not needed for Phase 8-9; may matter for a Phase 10 firewall-locality connection).
cadabra2, if/when we want symbolic Weingarten/Haar integrals verified automatically (likely needed for Phase 8.2 analytic rank-r work; not needed for Phase 8.1 numerical work).
stim, stabilizer circuits for complexity-protected non-isometry tests (currently speculative; skip unless a specific test demands it).

What NOT to install

Full GR solvers (Einstein Toolkit, SpECTRE, EinsteinPy). Our “black hole” is a finite-dim quantum information object; no metric tensor is ever formed.
Particle physics / collider simulators.
Cosmology packages (CLASS, CAMB).
Machine learning frameworks (PyTorch/TensorFlow). If some optimization needs gradient descent, use scipy.optimize.

Backend architecture

Phase 0 action item: Factor the inline backend code duplicated across Phase 5+ drivers into a canonical file:

/home/claude/bh_research/bh_lab_backend.py

Required functions (all already exist inline in Phase 5-7 scripts; just need to be pulled out):

SeededRNG wrapper with haar_unitary(n), haar_state(d), aehpv_map(d_eff, d_fund)
huz_clone_single(psi, d_Ob, d_M), single-observer HUZ
huz_clone_two(psi, dA, dB, dM, V), two-observer HUZ with shared V
two_observer_entropies_fast(psi, dA, dB, dM, V), the einsum-contracted fast version (important: naive density matrix formation is 270GB+ at $d_B=16$ )
product_bulk_sample(dA, dB, dM, rng), sample from Product class
haar_bulk_sample(d_eff, rng), sample from Haar class
rankr_bulk_sample(dA, dB, dM, r, rng), NEW for Phase 8, sample from rank- $r$ class
compute_dS_with_V(psi, V, dA, dB, dM), full HUZ+V pipeline

Self-test: reproduce one row from phase5_extended_scan.csv at $d_B=4$ to float precision.

Part 7, Reference material inventory

What exists and is good

All 63 files in /mnt/project/. The 26-document QFT sequence ( $\approx$ 1100 KB of text) covers everything from scalar field quantization through supersymmetry, string theory, and black holes + quantum gravity. For Phase 8-9 specifically, the relevant existing references are:

qft_24_holography_ads_cft.md, AdS/CFT correspondence, bulk reconstruction
qft_25_black_holes_and_quantum_gravity.md, Bekenstein-Hawking, Hawking radiation, Page curve, island formula, QES
qft_13_finite_temperature.md, thermal field theory context for evaporation
math_for_quantum_mechanics_reference.md, linear algebra, complex analysis, relevant for Dirichlet moments and Weingarten

What we may need to add

Candidates for new reference docs, to be created on demand when Phase 9 math starts needing them:

Replica wormholes in detail. The 2019 Penington-Shenker-Stanford-Yang and Almheiri-Engelhardt-Marolf-Maxfield papers computed the Page curve via replica wormholes. The key calculation is a saddle-point argument over connected vs. disconnected geometries. We’d want a dedicated doc walking through this. ~30-50 pages if done thoroughly.
Random tensor networks and holographic codes. Hayden-Qi-Thomas-Walter 2016 is the seminal paper. Models holographic codes as random tensor networks. AEHPV extends this. We already have pieces of this in qft_24 but a consolidated reference would help. ~30 pages.
Quantum extremal surfaces and islands. Canonical derivation of the QES prescription, specific application to evaporating BHs. May already be covered sufficiently in qft_25; verify when it becomes relevant. ~20 pages if needed.
Worldline methods for pair production / instanton calculus. Would only be needed if Phase 10 pivots to non-perturbative QFT on curved backgrounds. Skip unless triggered.

Do not generate these preemptively. Generate when the specific need arises.

The ideas library

ten_astounding_things.md, ten_neglected_problems.md, ten_unsolved_problems.md, still relevant as prompts for pivot ideas if Phase 9 stalls. Not directly actionable for Phase 8-9 but worth keeping in mind.

Part 8, Operational protocols

Carry these over from Paper 1. They worked.

Wall-time budget

Each bash invocation has approximately a 5-minute ceiling. For any run exceeding this:

Checkpoint every N samples, pickle partial results every 25-50 samples.
Use independent batch seeds, BASE_SEED + d_B*10000 + batch_index so reruns extend coverage, don’t duplicate samples.
Einsum everything, for two-observer entropy at $d_B \geq 10$ , the naive $|\Psi\rangle\langle\Psi|$ formation eats memory. Use direct tensor contractions.

All three mechanisms are demonstrated in the Paper 1 Phase 5 drivers (phase5_boost_point.py, phase5_money_plot.py). Port them into the Phase 8 backend.

Seed conventions (carried forward from Paper 1)

Phase 8 BASE_SEED = 808080 per-point: BASE + d_B*10000 + r*100 + batch  
Phase 9 BASE_SEED = 909090 per-point: BASE + t_step*10000 + d_B*100 + batch  
Phase 10 inline or 101010

Figure conventions

Paper 1’s figures used the seaborn palette with $1\sigma$ statistical error bars, log-log for scaling plots, log-linear for ratio plots, and SEM (not SD) on all error bars. Keep this. Paper 2’s figure plan should be drafted early in Phase 9 so the Phase 9.1 scan produces data in the right format.

Writing conventions

Prose over bullets in paper body, bullets only for explicit enumerations.
No em-dashes anywhere in companion pieces (Adam’s style preference, PI flagged this during Paper 1 assembly).
Honest calibrated reporting: flag σ-levels, confess when something is empirical rather than derived.
Preserve provenance: each phase produces its own memo; don’t delete earlier memos when assembling later ones.

PI interaction protocol

Adam runs the program. His preferences (inferred from Paper 1 collaboration):

Pause after each phase. Don’t marathon through without checkpoints.
Report honestly, including when something doesn’t work.
Push back when something is wrong. He caught the $-4/3$ exponent artifact; he’ll catch other things.
Storytelling matters to him. He likes the dirty-window and symphony analogies. Use metaphor when explaining results.
He’s not a physicist. Technical precision when needed, plain English when possible. If a result requires fancy math, derive it cleanly; if it doesn’t, state it plainly.
Companion pieces are part of the deliverable, not a PR afterthought. He cares about the method story, not just the physics story.

Part 9, What we’re NOT doing

Same scope rules as Paper 1, carried forward:

Not doing full numerical relativity or metric-tensor calculus. Our black hole is a finite-dim QI object throughout.
Not doing astrophysics (GW signals, black-hole masses, accretion physics).
Not claiming to solve the information paradox. We’re quantifying one specific entropic observable in one specific framework.
Not claiming to resolve AMPS firewalls. Our framework is adjacent to the question, not a direct attack on it.
Not engaging the “is consciousness involved in measurement” interpretive debate. The observer is a Hilbert-space factor with a pointer basis.
Not building a BH software framework to compete with Einstein Toolkit or similar. We use finite-dim tools only.

Part 10, Calibration for Phase 8-9

Honest assessment for the new thread’s operator:

Probability Phase 8.1 (rank-r scan) produces a clean, publishable result: ~80%. The math is a direct extension of Paper 1; the main risk is unexpected phase-transition behavior that changes the framing.
Probability Phase 9 (evaporating BH) produces a tier-1 result: ~25%. Higher than Paper 1’s 30% a priori estimate because we now have Paper 1’s framework validated. Lower because the AEHPV→BH translation has genuine physics risk.
Probability Phase 9 produces any publishable result: ~70%. Even null results are publishable if the framework is clean.
Probability of getting scooped during Phase 9: ~30%. The frontier is moving fast; mitigation is Paper 1’s arXiv posting establishing priority on the framework.
Probability Paper 2 ships within 3 months of Paper 1’s arXiv posting: ~50% if the PI stays engaged and works Phase 8-9 efficiently.

These are point estimates, not promises. Update as data arrives.

Part 11, Hand-off checklist for the new thread

First message in the new thread should kick off this sequence:

Read this entire spin-up document end to end.
Read Paper 1’s full draft at /mnt/project/paper_full_draft.md if not already internalized. Especially §3 (structural identity), §4.2 (Theorem 1), §5.4 (Theorem 2), §6 (physical interpretation including the rank- $r$ subsection §6.5), and §8 (discussion with open questions).
Check arXiv for papers posted since 2026-01-31 matching “observer complementarity”, “non-isometric code”, “Page time scaling”, “HUZ cloning”. Flag anything that scoops Phase 9.
Factor bh_lab_backend.py from Phase 5+ drivers in /mnt/user-data/outputs/. Verify self-test by reproducing one phase5_extended_scan.csv row.
Run Phase 8.0 literature refresh; produce phase8_lit_memo.md.
Proceed to Phase 8.1 only after Adam confirms.

Part 12, The big picture

Adam is running a research program to demonstrate that a motivated non-specialist, paired with AI acting as a domain-expert postdoc, can produce real research at the frontier of a technical field. Paper 1 validates this in observer complementarity for non-isometric codes. Paper 2 (if it works) validates it at a physically concrete application: the infaller-vs-asymptotic observer scaling crossover in evaporating black holes.

The two-paper arc, abstract framework, then physical application, is a natural research program structure. It’s what a real postdoc-PI pair in this field would do. That’s the point.

Everything else, the companion piece, the eventual book, the method writeups, follows from getting the physics right. The physics is the thing.

Go.