This appendix proves Theorem 1 (§3.5) for the Haar bulk class: the von
Neumann entropy of the actual observer-reduced state may be replaced by the
Shannon entropy of its diagonal, with an error subleading to the two-observer
signal by a full power of d. All constants below are dimension-independent;
we write D=d2dM for the effective dimension, ρ∈(0,1) for the
non-isometry ratio, and X∈{A,B} for the two observers.
Throughout, ρRX is the observer-X reduced state of the HUZ-included,
V-mapped state (§3.1). Two diagonals enter, and must be kept distinct:
with PX depending only on ∣ψ⟩. The full entropy-replacement error is
taken against the bulk-marginal diagonal PX (the object the scaling laws of
§§4–5 use), and splits into an off-diagonal and a diagonal-to-bulk part:
The off-diagonal perturbation is EX:=ρRX−DX. Sections C.1–C.5
bound Foff (the resolvent machinery); §C.6 adds the short diagonal-to-bulk
bound (Lemma C.6) and assembles the full statement. We prove:
Theorem C.1 (entropy replacement, Haar class). In the joint Haar measure
on bulk and V, with dA=dB=d and dM, ρ fixed, the replacement
against the bulk-marginal diagonal obeys
with both Foff and Fdiag of order O(d−4dM−2) in L2.
Since the Haar two-observer signal has variance Θ(d−3dM−2)
(Theorem 2), the replacement-error variance is suppressed by one power of
d, and Theorem 2 is unconditional.
The proof has four ingredients: an exact resolvent representation of FAB
(§C.1); a linear bound reducing to a bulk-marginal moment (§C.2–C.3); a
nonlinear bound controlled by a fourth moment of EX (§C.4); the
fourth-moment estimate itself (§C.5); and a diagonal-to-bulk bound for
Fdiag (Lemma C.6). The argument is assembled in §C.7.
C.1 An exact antisymmetric resolvent representation
Write the von Neumann entropy through the integral representation
S(σ)=−Tr(σlogσ) and the resolvent identity
logσ=∫0∞[(1+t)−1−(σ+t)−1]dt. Applied
to the antisymmetric combination S(ρRA)−S(ρRB) and to the
diagonal pair H(DA)−H(DB), and subtracting, the constant and single-resolvent
terms cancel between the two observers, leaving an exact representation of the
difference Foff. After the substitution t=s/d (Jacobian ds/d),
This is exact – all orders in the perturbation EX – and is the antisymmetric
analogue of the single-observer resolvent identity of Engelhardt–Gesteau–Harlow.
The integrand is concentrated near the eigenvalue scale t∼1/d
(equivalently s∼ρ−1); integrability at both ends follows from
∥EX∥F≤2 and ∥(⋅+t)−1∥op≤1/t.
Numerical check. The representation reproduces the direct entropy difference
with correlation 1.0000 and unit slope across d∈{4,5,6}, with the
integrand peaking at s=td≈0.5.
We split the integrand by one application of the resolvent identity
(ρRX+t)−1=(DX+t)−1−(DX+t)−1EX(ρRX+t)−1,
giving Ys=Yslin+Ysnl with
where GX:=(DX+t)−1 and GX:=(ρRX+t)−1. The two
pieces are bounded in §C.2–C.3 and §C.4 respectively.
C.2 The linear bound
The linear integrand is a weighted version of the bulk-marginal object of §5.
Writing the diagonal entries pXa=1/d+δaX and centering, define
the weighted operator HX(t)=∑a(pXa+t)−2QX(a),
where QX(a) is the centered single-block operator of §C.3; then
Tr[(DA+t)−2EA]−(A→B)=ρ−1Tr[(PV−ρI)(HA(t)−HB(t))].
Lemma C.2 (linear bound).EψTr[(HA(t)−HB(t))2]≤(1+s)6dMCd2,
C absolute, hence E[Foff2]lin=O(d−4dM−2).
Proof. The weight wXa=(pXa+t)−2 satisfies, by the mean value
theorem on the event {pXa≥c0/d} (whose complement is exponentially
rare), ∣wXa−u−2∣≤K(s)∣δaX∣ with u=(1+s)/d and
K(s)=c032d3(1+s)−3. Thus HX(t) has the form of the §C.3
object GX=∑aδaXQX(a) with weights bounded by
K(s)∣δaX∣. The base lemma’s proof (§C.3) uses only the pointwise bound
∥HA−HB∥F2≤4maxXTr(HX2) and the exact single-block trace
formula, both valid for any weight; pulling out K(s)2 and using the
single-observer base bound EψTr(GX2)≤C1/(d5dM)
gives EψTr[(HA−HB)2]≤4K(s)2C1/(d5dM)=c0616C1d(1+s)−6dM−1≤Cd2(1+s)−6dM−1.
Then, with Yslin=tρ−1Tr[(PV−ρI)(HA−HB)] and
EV∣Tr[(PV−ρI)M]∣2=κ(∥M∥F2−D−1∣TrM∣2)
(Lemma C.3 below), Minkowski in L2(ψ) and ∫0∞s(1+s)−3ds=21
give E[Foff2]lin=O(1/(d4dM2)). □
Numerical check. The constant C=EψTr[(HA−HB)2]⋅(1+s)6dM/d2 measures 1.44,1.33,1.21,1.09,1.01 at d=4,…,8
(dM=2), decreasing toward the leading value; the dM-scaling is exactly
1/dM.
C.3 The base moment lemma
The single-block object is GX=∑aδaXQX(a), where
QX(a)=QX(a)−pXaQ centers the rank-one block operator
QX(a)=∑b∣ϕab⟩⟨ϕab∣ against the total
Q=∑ab∣ϕab⟩⟨ϕab∣. With pab=∥∣ϕab⟩∥2
the block masses (a flat Dirichlet vector on d2 categories of concentration
dM), the relevant block weights are WX(a)=∑bpab2 and
Wblock=∑abpab2.
Lemma (base moment bound).EψTr[(GA−GB)2]≤d4dMC0,
with C0 a dimension-independent constant.
Proof. Pointwise ∥GA−GB∥F2≤2∥GA∥F2+2∥GB∥F2≤4maxXTr(GX2) (the cross term cancellation is not needed). The
single-observer trace has the exact closed form
with SWX=∑aδaXWX(a). The dominant term is the first; by
Cauchy–Schwarz, E∑a(δaX)2WX(a)=dE[(δ1X)2WX(1)]≤dE[(δ1X)4]E[(WX(1))2]. The fourth
central moment of the marginal δ1X (a centered Beta(ddM,d(d−1)dM)
variable) is, in closed form, E[(δ1X)4]=(3+γ2)μ22
with
so E[(δ1X)4]≤7μ22 for d≥3; and E[(WX(1))2]
is the exact Dirichlet moment (d(dM)4+d(d−1)[(dM)2]2)/(D)4.
Substituting the scalings μ2=Θ(d−3dM−1) and
E[(WX(1))2]=Θ(d−6dM−2) gives the dominant term
Θ(d−5dM−1) and the stated bound after multiplying by 4. The
remaining terms are smaller by 1/d. □
Numerical check.EψTr[(GA−GB)2]⋅d4dM=0.40,0.38,0.35,0.30,0.25 at d=4,5,6,8,10; the closed-form γ2
matches Monte Carlo to three digits.
C.4 The nonlinear bound
Lemma (nonlinear bound).E[Foff2]nonlin=O(d−4dM−2),
subleading to the signal by 1/d.
Proof. By Schatten–Hölder applied to Ysnl,
Tr[GXEXGXEXGX]≤∥GX∥op2∥GX∥op∥EX∥F2,
with ∥GX∥op≤1/t deterministically (ρRX⪰0)
and ∥GX∥op≤min(Cd,1/t) on the good event. Hence, with t=s/d,
∣Ysnl∣≤dsmin(Cd,d/s)2sd(∥EA∥F2+∥EB∥F2).
Minkowski in L2 and the kernel integral ∫0∞dsmin(Cd,d/s)2sdds=2Cd2
give E[Foff2]nonlin≤d1⋅2Cd2⋅2E∥EX∥F4.
The fourth moment E∥EX∥F4=O(d−6dM−2) is supplied by §C.5,
whence E[Foff2]nonlin=O(d−4dM−2). The bad event
contributes poly(d)⋅e−cddM, negligible. □
Numerical check. The actual nonlinear contribution is
E[F2]nonlin/signal=0.011,0.005,0.003 at d=4,5,6,
well inside the certified bound.
C.5 The fourth-moment estimate
The nonlinear bound rests on a fourth moment of the off-diagonal perturbation
EX. This follows from a single projector estimate plus a grouped-Dirichlet
moment bound.
Lemma C.3 (fourth-moment projector estimate). Let PV be a Haar-random
rank-r projector on CD, r=ρD, Π=PV−ρI. For any
trace-zero A on CD (Hermitian or not),
EVTr(ΠA)4≤D2C4∥A∥F4,
with a dimension-independent absolute constant C4.
Proof. Since Tr(A)=0, Tr(ΠA)=Tr(PVA), which
has mean zero. Split A=A1+iA2 into Hermitian trace-zero parts, so that
∣Tr(ΠA)∣4≤8[(TrΠA1)4+(TrΠA2)4]; it
suffices to bound each real piece. Writing PV=UΠrU† with U Haar
on U(D) and gk(U)=Tr(UΠrU†Ak), the map
U↦UΠrU† is 2-Lipschitz in the Frobenius metric, so gk is
mean-zero and 2∥Ak∥F-Lipschitz. By the concentration inequality for
Lipschitz functions on U(D) (Meckes, The Random Matrix Theory of the Classical
Compact Groups, 2019), gk is sub-Gaussian with variance proxy
σk2=c0(2∥Ak∥F)2/D, whence E∣gk∣4≤3σk4≤48c02∥Ak∥F4/D2. Summing gives the claim with C4=384c02. □
Remark. The same estimate also follows from the fourth Weingarten moment of a
Haar rank-r projector; the concentration proof is used only for the O(D−2)
scaling, not the sharp constant. Numerically and by the leading Gaussian pairing,
C4D2/∥A∥F4→3κ2D2→3/16 (the variable Tr(ΠA) is
asymptotically Gaussian with variance κ∥A∥F2,
κ=D(D2−1)r(D−r)); measured 0.18, flat across D=16,32,64,
with kurtosis →3.
The entries of EX are exactly first-order in Π: (EX)aa′=ρ−1Tr(ΠMa′a)/(1+η), where Ma′a are trace-zero
operators (the off-diagonal blocks Na′a and the centered diagonal
QX(a)) and η=ρ−1Tr(ΠQ) is a normalization
fluctuation.
Proposition C.4 (fourth moment of the perturbation).E∥EX∥F4≤d6dM2C, C absolute.
Proof. On the good event G={∣1+η∣≥21} (whose
complement has probability ≤e−cddM and contributes
≤∥EX∥op4Pr(Gc)≤16e−cddM, since EX is a
difference of density matrices so ∥EX∥op≤2),
∥EX∥F4≤16ρ−4(∑a,a′∣Tr(ΠMa′a)∣2)2.
Expanding and taking EV, Cauchy–Schwarz and Lemma C.3 give
EV[∣TrΠMa′a∣2∣TrΠMc′c∣2]≤C4D−2∥Ma′a∥F2∥Mc′c∥F2, so
EV∥EX∥F4≤16ρ−4C4D−2(∑a,a′∥Ma′a∥F2)2.
The bracket bound (Lemma C.5) gives Eψ(⋅)2=O(d−2), and
with D2=d4dM2 the claim follows. □
Proof. The bracket equals B=∑b[(pBb)2−WB(b)]+∑aTr((QX(a))2).
The off-diagonal part has Eψ=Θ(1/d) (exact Dirichlet second
moment) and the diagonal part is O(1/d2), so Eψ[B]=Θ(1/d).
B is a degree-2 polynomial in the Dirichlet masses; the grouped-Dirichlet
fourth moments (as in §C.3) give Varψ(B)=O(1/d4), so
Eψ[B2]=(EψB)2+Varψ(B)=O(1/d2). □
Numerical check.E[B]d=0.98→1.03 and E[B2]d2=0.96→1.06 over d=4–8 (both flat, confirming B concentrates), and
E∥EX∥F4⋅d6dM2=18.0,11.9,9.65 at d=4,5,6
(bounded, decreasing).
C.6 The diagonal-to-bulk bound
It remains to bound Fdiag=[H(DA)−H(DB)]−[H(PA)−H(PB)], the error
from replacing the actual reduced diagonal DX=diag(ρRX) by the
bulk-marginal diagonal PX. Write ηaX:=(ρRX)aa−pXa;
since DX and PX are both trace-one, ∑aηaX=0. The argument
reduces Fdiag to the same base moment that controls Foff, through
the centered block operators of §C.3.
Representation. With QX(a)=∑b∣ϕab⟩⟨ϕab∣ the
block operator and P=V†V, the unnormalized diagonal is
(ρRXunnorm)aa=Tr[PQX(a)], with Haar mean
EV=ρTrQX(a)=ρpXa. Dividing by
Z=∥Ψ∥2 (with EVZ=ρ) and using ρpXa=Tr[ρIQX(a)],
the normalization remainder raX being O(d−2)⋅(ρRX)aa by
Lemma 2. The leading-order entropy term of §C.6 (the Taylor expansion of H about
PX, using ∂H/∂pa=−logpa−1 and ∑aηaX=0,
which removes the constant and the −1) is the linear functional
LX=−da∑δaXηaX,Fdiag=(LA−LB)+(RA−RB),
with RX=O(∑a(ηaX)2/pXa) the quadratic Taylor remainder.
Reduction to the base moment. Substituting the representation,
Replacing each QX(a) by its centered form QX(a)=QX(a)−pXaQ
changes the bracket by (∑aδaXpXa)Q=∥δX∥2Q
(since ∑aδaXpXa=∑a(pXa)2−1/d=∥δX∥2), and
Tr[(P−ρI)Q]=Z−ρ is the norm fluctuation; this difference is
absorbed into the remainder. Hence, with GX=∑aδaXQX(a)
the operator of §C.3,
LA−LB=−ρdTr[(P−ρI)(GA−GB)]+(remainder).
For a uniformly random rank-ρD projector and any fixed Hermitian M, the
projector variance gives VarV(Tr[PM])≤ρ(1−ρ)Tr(M2)/(D−1). Applied with M=GA−GB (fixed by ψ),
Remainders. The normalization remainder (∝Z−ρ, of relative size
O(d−2) by Lemma 2) and the quadratic Taylor remainder RA−RB (a sum of
(ηaX)2/pXa terms, controlled by the same Frobenius/fourth-moment estimates
of §§C.4–C.5 applied to the diagonal entries) each contribute
E[(RA−RB)2]=O(d−4dM−2) or smaller. Collecting:
Lemma C.6 (diagonal-to-bulk replacement).In the joint measure,
E[Fdiag2]=O(d−4dM−2),
the same order as Foff, hence one power of d below the signal variance
Θ(d−3dM−2).□
The reduction is exact: Fdiag and Foff are bounded by the same
quantity EψTr[(GA−GB)2] through the same rank-projector
variance estimate, so no separate constant is introduced. (The independent
Monte-Carlo value E[Fdiag2]⋅d4dM2=0.49,0.36,0.34 at
d=4,5,6 is consistent, but the bound above does not rely on it.)
C.7 Assembly
The off-diagonal error Foff is controlled by combining the linear bound
(Lemma C.2: E[Foff2]lin=O(d−4dM−2)) with the
nonlinear bound (§C.4, via Proposition C.4: E[Foff2]nonlin=O(d−4dM−2)) and Minkowski’s inequality, giving E[Foff2]=O(d−4dM−2). Adding the diagonal-to-bulk bound (Lemma C.6:
E[Fdiag2]=O(d−4dM−2)) through FAB=Foff+Fdiag and Minkowski once more,
E[FAB2]=O(d−4dM−2)=o(d−3dM−2),
which is one power of d below the signal variance Θ(d−3dM−2).
By Cauchy–Schwarz, E∣FAB∣≤E[FAB2]=O(d−2dM−1)=o(d−3/2dM−1). This proves Theorem C.1, hence
Theorem 1 for the Haar class, and makes Theorem 2 unconditional. ■
End-to-end numerical check. Directly, E[FAB2]/signal=0.146,0.094,0.067 and E∣FAB∣/(d−3/2dM−1)=0.280,0.233,0.196 at d=4,5,6 – both decreasing, consistent with the proved O(1/d) and
O(1/d) suppression.
Remark (product class). The representation of §C.1 and the nonlinear bound of
§C.4 hold verbatim for any bulk class. The Haar-specific inputs are the
bulk-marginal moments of §C.3 and §C.5, which use the near-maximally-mixed
Dirichlet structure of the Haar marginal. For the product class the marginal is
rank-one, and the small-mass régime of its diagonal is not controlled by the
present argument; establishing the analogue of Lemma C.5 there would make
Theorem 3 unconditional as well. This is the sense in which Theorem 1
remains conjectural for the product class.