|
← 方法论系列 ← Methodology Series
SAE 方法论(VI)
SAE Methodology (VI)

相变窗口与实验设计

Phase-Transition Windows and Experimental Design

Han Qin (秦汉) · April 2026 ·10.5281/zenodo.19464506

Han Qin 2026-04-07 SAE Methodology Paper VI

一、问题的提出

一个干预有效但试验看不到效果,原因可能不在干预本身,而在试验设计与现象结构的不匹配。

具体地说:如果干预的响应函数具有阈值结构,而处理组内部的实际暴露(realized exposure)高度异质,研究又不测量或不利用这个连续暴露状态,那么二元赋值加二元分析会系统性地稀释越过阈值的受试者的信号。

这个问题不新。药理学的exposure-response分析(FDA Guidance, ICH E4),MCP-Mod的连续剂量-反应估计,threshold regression和change-point analysis,adaptive enrichment design,教育学和康复科学中的implementation fidelity文献——都在不同角度处理"阈值存在时怎么办"。

但这些框架有一个共同的盲点:它们关心阈值在哪里,关心阈值前后的效应差,但不关心一个更基本的几何问题——阈值两侧的距离是否对称

如果从干预开始到阈值的距离(萌芽距离)远大于从阈值到效应完全建立的距离(确立距离),即不对称比r >> 1,则信号稀释不是偶然发生的,而是结构性的。增加样本量不能恢复被稀释的效应量本身,只能更精确地估计一个接近零的数。

这个不对称比r是本文的核心对象。它的存在和r >> 1的先验预测,来自ZFCρ(一种建立在ZFC集合论上的整数复杂度理论;基础框架见Qin, DOI: 10.5281/zenodo.18914682)动态规划递推中的相变窗口。

本文的贡献从ZFCρ的数学结构独立推出,与上述已有文献覆盖了部分重叠的领土,但知识路径不同。本文承认这些重叠并明确定位增量贡献,但不声称继承关系。

二、定义

定义1:三阶段响应结构

设干预的响应函数g(z)依赖于受试者在状态空间中的穿透深度z,具有三段结构:

g(z) = 0,当z < F(萌芽区:微观层面已有扰动,但宏观净效果为零或为负)

g(z) = δ · (z - F) / (E - F),当F ≤ z < E(翻转到确立:净效果从零爬升到最大值)

g(z) = δ,当z ≥ E(确立后:全效应)

其中F是翻转点,E是确立点,δ是真实最大效应量。

定义2:不对称比r

r = F / (E - F)

r度量萌芽距离与确立距离的比值。r = 1时两侧对称;r >> 1时萌芽距离远大于确立距离。

定义3:越翻比π_cross

π_cross = P(Z_i ≥ F | treated)

处理组中实际越过翻转点的受试者比例。当Z_i在处理组内服从指数分布Exp(μ)时,π_cross = exp(-F/μ)。r越大,F越大(固定总窗口宽度),π_cross越小。

三、数学来源:ZFCρ相变窗口

3.1 背景

在ZFCρ框架中,整数复杂度函数ρ_E(n)度量用加法(successor, n+1)和乘法(因子分解与重组)两类路径表示整数n的最小代价。当整数按Ω(素因子个数,含重数)分层时,动态规划递推在Ω空间中展现出从无序相(successor路径主导)到有序相(乘法路径主导)的相变。这个相变是一个渐进窗口而非sharp boundary。

3.2 数据

以下数据来自三个独立来源:h(Ω)来自N=10^{10}规模的预筛数值计算,P(J>0)等步间指标来自N=10^7规模的自相关分析程序anticorr.c,两者均随ZFCρ Paper 44(DOI: 10.5281/zenodo.19247859)发布;第六指标z/√j来自ZFCρ Paper 57的谱分析(N=10^{10})。所有代码和数据可通过各自的Zenodo页面获取。

Ω mean h P(J>0) E[A] w_shielded Var(A) η
2 +0.697 28.7% -0.701 76.0% 0.314 0.103
3 +0.361 57.1% -0.326 70.5% 0.558 0.131
4 +0.004 77.1% +0.087 65.1% 0.720 0.144
5 -0.329 88.4% +0.503 60.3% 0.833 0.155
6 -0.615 93.1% +0.896 57.4% 0.923 0.168
8 -1.056 96.8% +1.572 55.4% 1.093 0.204

指标含义:

mean h(局部凸度): h > 0时successor路径在局部仍有竞争力,h < 0时乘法路径局部绝对占优。h = 0标志有序相完全确立。

P(J>0)(乘法路径胜出概率): Ω层内乘法路径净节省J > 0的整数比例。P(J>0) = 50%是从少数变多数的crossover。

E[A](净步长期望): 乘法路径相对于successor的净代价节省。E[A] = 0是净效果从负变正的翻转点。

w_shielded(Le Chatelier屏蔽率): 涨落被系统频率-强度对冲机制吸收的整数比例。w_shielded = 2/3是自然阈值:低于此值exposed sector主导,Le Chatelier缓冲失效。

Var(A)(步长方差): Var(A) = 1.0标志涨落达到O(1),属有序相内部特征。

3.3 五个Crossover点

步间指标的crossover通过相邻Ω值的线性内插计算。谱指标z/√j通过j到Ω的近似映射(Ω_typ ≈ ln j)定位。

P(J>0) = 50%: Ω = 2 + (50 - 28.7)/(57.1 - 28.7) = 2.750*。萌芽。

z/√j peak(谱crossover): z/√j在j=23处达到峰值5.03(Paper 57数据),对应Ω_typ = ln 23 ≈ 3.14。乘法屏蔽开始压制加法惯性积累。涨落谱强度的peak先于净步长的零交叉——susceptibility先于序参量,和物理相变结构一致。

w_shielded = 2/3: Ω = 3 + (70.5 - 66.7)/(70.5 - 65.1) = 3.710*。Le Chatelier缓冲失效。

E[A] = 0: Ω = 3 + 0.326/(0.326 + 0.087) = 3.790*。翻转(物理上最有意义的crossover)。

h(Ω) = 0: Ω = 4 + 0.004/(0.004 + 0.329) = 4.012*。确立。

指标 阈值 Ω* 阶段 来源
P(J>0) = 50% 多数 2.750 萌芽 Paper 44
z/√j peak peak ≈ 3.14 谱翻转 Paper 57
w_shielded = 2/3 2/3 3.710 缓冲失效 Paper 44
E[A] = 0 3.790 翻转 anticorr.c
h(Ω) = 0 4.012 确立 presieve

3.4 四阶段结构

阶段一:萌芽(Ω ≈ 2.75)。 乘法路径首次在多数整数上胜出,但净效果仍为负。加法惯性开始积累。

阶段二:谱翻转(Ω ≈ 3.14)。 z/√j达到peak。加法惯性积累最大值,乘法屏蔽开始介入但还不占优——"散热器刚开机"。涨落控制权从correlation-dominated转为screening-dominated。发生在步间指标翻转之前。

阶段三:翻转(Ω ≈ 3.79)。 E[A] = 0,净效果从负变正。Le Chatelier屏蔽率几乎同时跌破2/3(Ω ≈ 3.71)。

阶段四:确立(Ω ≈ 4.01)。 h = 0,successor路径失去局部竞争力。有序相完全确立。

3.5 不对称比的提取

萌芽到翻转:3.79 - 2.75 = 1.04 翻转到确立:4.01 - 3.79 = 0.22 不对称比:r = 1.04 / 0.22 ≈ 4.7(取整约5)

窗口总宽度:1.26。翻转-确立占窗口17.5%。

3.6 比值的地位

r ≈ 5是ZFCρ模型内的数值结果。本文的核心论证只依赖更弱的条件:r >> 1。具体的r值在不同系统中可能不同,需要独立估计。ZFCρ给出的r ≈ 5是先验预测,其跨领域有效性是待检验的经验问题。

不对称性的结构性直觉:Le Chatelier缓冲在萌芽期全力运转,积累到翻转需要长时间对抗。一旦缓冲被突破,确立很快——因为缓冲机制本身已跌破维持旧pattern的最低要求。

四、核心定理:信号稀释

设每个受试者i有潜在状态Z_i(穿透深度),受随机分组和个体因素共同影响。结局Y_i = g(Z_i) + ε_i,ε_i ~ N(0, σ²)。

定理(信号稀释)。 在二元赋值加二元分析下,ITT平均处理效应ATE = E[g(Z_i) | treated] - E[g(Z_i) | control]。当对照组Z_i ≈ 0时,ATE = E[g(Z_i) | treated]。由于处理组中仅π_cross比例的受试者越过F,ATE被萌芽区的零效果受试者系统性稀释。r越大,π_cross越小,稀释越严重。

推论1(样本量的局限性)。 增加n使ATE估计更精确(标准误下降),但不改变ATE本身的被稀释程度。当π_cross很小时,被稀释的ATE可能小到需要不切实际的n才能达到足够power。

推论2(best responder信号)。 条件于Z_i ≥ F的子群,平均效应接近δ。这解释了事后子群分析常阳性而整体试验阴性的矛盾。

推论3("有前景但证据不足"模式)。 小样本研究中π_cross因selection on Z偏高,效应接近δ;大规模RCT中π_cross回归群体基率,效应被稀释。这与publication bias表现相似但机制不同。

Monte Carlo验证(附录B): r=5, depth=0.3, n=500/arm时,binary power = 9.3%(真实d = 0.8),TIZ power = 97.7%。1000人的"充分powered"RCT对大效应几乎完全失明。

五、主体条件

核心定理的成立需要以下条件同时满足:

条件1:响应函数具有阈值结构。 不是所有干预都有相变响应。纯线性因果关系(如抗体滴度对疫苗剂量)、一次性急性干预(效应迅速且treatment version统一)、外科/器械类明确procedure vs no-procedure试验,不在本文适用范围内。

条件2:处理组内部的realized exposure高度异质。 如果所有受试者达到相同深度,则无论r多大都不存在稀释问题。异质性来源包括依从性差异、生理反应差异、执行质量差异。这在代谢干预、心理治疗、教育改革等复杂干预中几乎必然存在。

条件3:研究不测量或不利用连续暴露状态。 如果试验已经做了exposure-response分析或adaptive titration,稀释问题至少部分被缓解。问题出在二元赋值加二元分析的组合上,不是二元随机化本身。

条件4:结局测量在涌现层而非构层。 干预作用于基础层(构),结局在更高层级(涌现)显现。如果结局直接在构层测量(如直接测血酮水平而非肿瘤大小),则可能看到连续而非阈值型响应。

当四个条件同时成立时,信号稀释定理适用。典型强适用领域:代谢干预(酮症饮食、营养干预),康复治疗,心理治疗中的复杂干预,教育中的课程/教学改革,数字健康和行为改变项目,just-in-time adaptive interventions。

六、射线

射线1:方法学建议

6.1.1 从binary暴露到time-in-zone。 分析变量应包括"在候选翻转点以上的状态空间中累积停留了多长时间"。因果推断限定:time-in-zone是post-randomization变量,直接替代treatment assignment会牺牲因果可识别性。三种合法用法:

(a)mechanistic secondary analysis,在ITT主分析之外做探索性exposure-response分析。 (b)adaptive titration design,在设计层面对暴露强度随机化。 (c)principal stratum或CACE框架,估计compliers的因果效应。需monotonicity/exclusion restriction假设。

额外注意immortal time bias:如果time-in-zone以存活或高依从性为前提,则TIZ与未观测正向混杂因素相关,需用时变协变量Cox模型或landmark分析处理。

6.1.2 监测密度匹配翻转-确立距离。 翻转到确立仅占窗口17.5%,时间窗口可能很短。监测间隔如果大于这个窗口,无法区分萌芽和已过翻转。监测频率应由翻转-确立距离决定,不由行政便利决定。

6.1.3 阴性结论的暴露验证前置条件。 在宣布干预无效之前必须先回答:多大比例的受试者越过了翻转点?维持了多久?如果这个比例很低或不确定,阴性结论说明的是实施方案未达到有效暴露,不是干预机制无效。暴露达标后仍阴性才是对机制的有力证伪。

6.1.4 先做相变结构的先验检验。 大规模RCT前用小样本密集监测:追踪状态变量和结局的联合轨迹,用分段回归/threshold regression/change-point detection检验拐点,估计r,据此设计大规模试验。

射线2:诊断清单

提示性信号(特异性有限,也可能由publication bias或fidelity缺失产生): (1)小样本强效果,大RCT弱/无效果。 (2)best responder子群总是阳性。 (3)干预强度跨研究差异大,效果方向不一致。 (4)领域长期"有前景但证据不足"。 (5)机制研究强支持但现场试验不复现。

确认性信号(特异性更高): (6)处理组内部存在可测连续状态变量,outcome对其呈breakpoint/segmented关系。 (7)best responder与事前定义的"越过翻转点者"高度重合。 (8)控制fidelity/adherence后,threshold模型系统性优于线性模型。

射线3:安全性方向

不对称性同样适用于毒性。如果慢性暴露的毒性响应也有r >> 1结构,Phase III中81%受试者可能未越过毒性翻转点,试验报告"安全"。真实世界长期暴露推更多人过翻转点,造成Phase IV failure。暴露验证建议同时适用于疗效和安全性评估。

射线4:Worked Example — ERGO2

ERGO2是复发性恶性胶质瘤的RCT,比较酮症饮食(KD)+间歇性断食+再放疗 vs 标准饮食+再放疗。PFS未达主要终点,被引用为酮症饮食"缺乏临床获益"的证据。

本文框架下的重新解读:

潜在状态变量Z = Glucose-Ketone Index(GKI)。候选翻转点F = GKI ≤ 2.0。干预仅持续约数天,血酮水平跨个体方差大,许多患者未在极低GKI停留足够时间。但best responders(day-6血糖更低、血酮更高)的PFS和OS显著长于low responders——正是推论2预测的pattern。

五条提示性信号全部成立。三条确认性信号部分成立或待验证(ERGO2有GKI连续数据但未做分段回归;best responder与事前定义的GKI ≤ 2.0 crossers的重合度待检验)。

具体建议:(a)用"GKI ≤ 2.0累积天数"做secondary exposure-response分析。(b)下一个RCT应随机分配到不同GKI目标(如GKI ≤ 1.5 vs GKI ≤ 3.0 vs 标准饮食)。(c)在verified time-in-zone ≥ 4周、GKI ≤ 2.0条件下仍阴性,才是对机制本身的证伪。

七、非平凡预测

预测1:大型阴性RCT中best responder的效应量接近真实效应

在任何符合主体条件的领域中,整体阴性的大型RCT内部,条件于越过候选翻转点的子群,其效应量应接近δ(真实最大效应)。 否证条件: 在多个预注册研究中,事前定义的crossers子群效应仍然为零。

预测2:time-in-zone模型系统性优于binary模型

在符合主体条件的试验中,以连续暴露状态变量做exposure-response分析或threshold regression,应系统性优于binary ITT分析(更高R²,更小AIC/BIC,或更大likelihood ratio)。 否证条件: prospectively measured time-in-zone模型不优于binary模型,无额外解释力。

预测3:不对称比r在多数构-涌现系统中大于1

在具有构-涌现层级结构的系统中独立估计r,多数系统应显示r > 1且中位值显著大于1。ZFCρ的先验预测是r ≈ 5。 否证条件: 跨领域数据表明r不系统偏斜,r ≈ 1或反向偏斜是常态。如果成立,本文适用范围应大幅收窄。

预测4:暴露达标后的阴性试验才是真正的证伪

在现有文献中被引用为"酮症饮食对癌症无效"(或其他"干预X对结局Y无效")的大型阴性试验中,多数试验的处理组应有低π_cross——即多数受试者从未达到有效暴露。如果未来有试验在verified exposure条件下仍阴性,则构成对干预机制本身的强证伪。 否证条件: 有相当比例受试者被事前验证越过翻转点,总体效应仍稳定为零。

八、结论

本文识别并形式化了阈值两侧距离的不对称比r作为影响试验统计功效的关键参数。这个参数有一个独立的数学来源(ZFCρ相变窗口),给出了r ≈ 5的先验预测。Monte Carlo模拟显示,在r = 5、浅穿透条件下,1000人的RCT对大效应(d = 0.8)的检测力仅为9.3%,而time-in-zone设计达97.7%。

四项方法学建议(time-in-zone、监测密度、暴露验证、先验检验)不依赖r = 5这个具体值,只依赖r >> 1。本文的框架同时适用于疗效和安全性评估。

开放问题:(1)r的跨领域分布是什么?(2)在教育学和心理治疗领域,r的操作化估计方法是什么?(3)如何把本文的诊断清单嵌入现有的systematic review和meta-analysis方法中?


与SAE框架的关系:在SAE(Self-as-an-End,自我目的论框架;基础论文见Qin, DOI: 10.5281/zenodo.18528813)中,三阶段结构和不对称性来源于构-涌现关系的普遍几何。"构"(construct)指基础层结构性构件,"涌现"(emergence)指由构件组合在更高层级产生的宏观pattern。Le Chatelier屏蔽决定萌芽期长度;缓冲突破后涌现层迅速响应。这个不对称性是构-涌现层级结构的内在特征,也是本文预测r >> 1跨领域成立的理论基础——但这是一个有待检验的预测,不是已建立的事实。

代谢肿瘤学中的完整应用见 SAE Biology Note 1: Phase Transition Window in Metabolic Oncology(forthcoming)。


附录A:Worked Example完整数据(ERGO2)

见正文第六节射线4。

附录B:Monte Carlo模拟

设计

响应函数:三阶段。真实效应量Cohen's d = 0.8。噪声σ = 1.0。窗口总宽1.26。穿透深度服从指数分布。每条件2000次模拟。比较:binary(t检验)vs TIZ(回归)。总受试者数匹配。

结果1:r系统性摧毁binary power

n=200/arm,depth=50%。

r 萌芽距离 翻转-确立 越翻% Binary TIZ 倍数
1.0 0.630 0.630 37% 0.441 0.999 2.3x
2.0 0.840 0.420 26% 0.335 0.995 3.0x
5.0 1.050 0.210 19% 0.226 0.985 4.4x
10.0 1.145 0.115 16% 0.210 0.976 4.7x

结果2:样本量的局限性

r=5,depth=50%。

n/arm Binary TIZ 倍数
50 0.102 0.568 5.6x
200 0.226 0.985 4.4x
500 0.512 1.000 2.0x

TIZ在n=50时(56.8%)已超过binary在n=500时(51.2%)。

结果3:穿透深度是决定性变量

r=5,n=200/arm。

深度 越翻% Binary TIZ 倍数
0.3 6% 0.064 0.761 12.0x
0.5 19% 0.226 0.985 4.4x
1.0 43% 0.856 1.000 1.2x

关键场景

r=5,depth=0.3,n=500/arm:Binary power 9.3%,TIZ 97.7%,观测d = 0.039(真实0.8)。

代码

见附件 monte_carlo_sim.py。

Han Qin April 7, 2026 SAE Methodology Paper VI

1. The Problem

An intervention may be effective yet invisible to the trial designed to detect it — not because the intervention fails, but because the trial's design is structurally mismatched to the phenomenon.

Specifically: if the response function has threshold structure, if realized exposure within the treatment arm is highly heterogeneous, and if the study neither measures nor exploits this continuous exposure state, then binary assignment combined with binary analysis will systematically dilute the signal from subjects who crossed the threshold.

This problem is not new. Exposure-response analysis in pharmacology (FDA Guidance; ICH E4), MCP-Mod's continuous dose-response estimation, threshold regression and change-point analysis, adaptive enrichment designs, and the implementation fidelity literature in education and rehabilitation science all address aspects of "what to do when thresholds exist."

But these frameworks share a blind spot: they care about where the threshold is and what the effect difference across it looks like. They do not typically ask a more fundamental geometric question: are the distances on each side of the threshold symmetric?

If the distance from intervention onset to threshold (the sprouting distance) is much larger than the distance from threshold to full effect establishment (the establishment distance), i.e. the asymmetry ratio r >> 1, then signal dilution is not accidental but structural. Increasing sample size cannot recover the diluted effect size itself — it can only estimate a near-zero number more precisely.

This asymmetry ratio r is the central object of this paper. Its existence, and the prior prediction that r >> 1, derive from the mathematical structure of phase-transition windows in the dynamic programming recursion of ZFCρ (an integer complexity theory built on ZFC set theory; foundational framework in Qin, DOI: 10.5281/zenodo.18914682).

This paper's contributions were independently derived from ZFCρ's mathematical structure. They overlap with parts of the territory covered by the literatures named above, but the knowledge paths are different. This paper acknowledges these overlaps and positions its incremental contributions, but does not claim inheritance.

2. Definitions

Definition 1: Three-Phase Response Structure

Let the intervention response function g(z) depend on the subject's penetration depth z in the state space, with three segments:

g(z) = 0, for z < F (sprouting zone: microscopic perturbation present, but macroscopic net effect is zero or negative)

g(z) = δ · (z − F) / (E − F), for F ≤ z < E (flip to establishment: net effect ramps from zero to maximum)

g(z) = δ, for z ≥ E (post-establishment: full effect)

where F is the flip point, E is the establishment point, and δ is the true maximum effect size.

Definition 2: Asymmetry Ratio r

r = F / (E − F)

r measures the ratio of sprouting distance to establishment distance. r = 1 is symmetric; r >> 1 means the sprouting distance is much longer than the establishment distance.

Definition 3: Crossing Fraction π_cross

π_cross = P(Z_i ≥ F | treated)

The proportion of treatment-arm subjects who actually cross the flip point. When Z_i follows an exponential distribution Exp(μ) within the treatment arm, π_cross = exp(−F/μ). As r increases, F increases (for fixed total window width), and π_cross decreases.

3. Mathematical Source: The Ω Phase Transition in ZFCρ

3.1 Background

In the ZFCρ framework, the integer complexity function ρ_E(n) measures the minimum cost of representing integer n using two path types: addition (successor, n→n+1) and multiplication (factorization and recombination). When integers are stratified by Ω (number of prime factors with multiplicity), the dynamic programming recursion exhibits a phase transition in Ω-space from a disordered phase (successor paths dominant) to an ordered phase (multiplicative paths dominant). This transition is a gradual window, not a sharp boundary.

3.2 Data

Data come from three independent sources: mean h from presieve computation at N = 10^{10} scale, remaining step-level indicators from autocorrelation analysis (anticorr.c) at N = 10^7 scale — both released with ZFCρ Paper 44 (DOI: 10.5281/zenodo.19247859) — and the sixth indicator z/√j from ZFCρ Paper 57's spectral analysis (N = 10^{10}). All code and data available at the respective Zenodo pages.

Ω mean h P(J>0) E[A] w_shielded Var(A) η
2 +0.697 28.7% −0.701 76.0% 0.314 0.103
3 +0.361 57.1% −0.326 70.5% 0.558 0.131
4 +0.004 77.1% +0.087 65.1% 0.720 0.144
5 −0.329 88.4% +0.503 60.3% 0.833 0.155
6 −0.615 93.1% +0.896 57.4% 0.923 0.168
8 −1.056 96.8% +1.572 55.4% 1.093 0.204

mean h (local convexity): h > 0 means successor retains local competitiveness; h < 0 means multiplicative dominates locally. h = 0 marks full establishment of the ordered phase.

P(J>0) (multiplicative win rate): Proportion of integers where multiplicative net saving J > 0. P(J>0) = 50% is the minority-to-majority crossover.

E[A] (expected net step): Average net saving of multiplicative over successor. E[A] = 0 is the flip point where net effect turns positive.

w_shielded (Le Chatelier shielding rate): Proportion of integers whose fluctuations are absorbed by the system's frequency-intensity hedging. w_shielded = 2/3 is the natural threshold below which the exposed sector dominates and the Le Chatelier buffer fails.

Var(A) (step variance): Var(A) = 1.0 marks O(1) fluctuations, an interior characteristic of the ordered phase.

3.3 Five Crossover Points

Step-level crossovers are computed by linear interpolation. The spectral indicator z/√j is located via the approximate mapping Ω_typ ≈ ln j.

P(J>0) = 50%: Ω = 2 + (50 − 28.7)/(57.1 − 28.7) = 2.750*. Sprouting.

z/√j peak (spectral crossover): z/√j peaks at 5.03 at j = 23 (Paper 57 data), corresponding to Ω_typ = ln 23 ≈ 3.14. Multiplicative shielding begins to suppress additive-inertia accumulation. The fluctuation spectral peak precedes the net-step zero crossing — susceptibility before order parameter, consistent with standard phase-transition structure.

w_shielded = 2/3: Ω = 3 + (70.5 − 66.7)/(70.5 − 65.1) = 3.710*. Le Chatelier buffer failure.

E[A] = 0: Ω = 3 + 0.326/(0.326 + 0.087) = 3.790*. Flip (the most physically meaningful crossover).

h(Ω) = 0: Ω = 4 + 0.004/(0.004 + 0.329) = 4.012*. Establishment.

Indicator Threshold Ω* Phase Source
P(J>0) = 50% majority 2.750 Sprouting Paper 44
z/√j peak peak ≈ 3.14 Spectral flip Paper 57
w_shielded = 2/3 2/3 3.710 Buffer failure Paper 44
E[A] = 0 zero 3.790 Flip anticorr.c
h(Ω) = 0 zero 4.012 Establishment presieve

3.4 Four-Phase Structure

Phase 1: Sprouting (Ω ≈ 2.75). Multiplicative paths first win on a majority of integers, but net effect remains negative. Additive inertia begins accumulating.

Phase 2: Spectral flip (Ω ≈ 3.14). z/√j peaks. Additive inertia reaches maximum; multiplicative shielding begins but does not yet dominate. Fluctuation control shifts from correlation-dominated to screening-dominated. Occurs before step-level indicators flip.

Phase 3: Flip (Ω ≈ 3.79). E[A] = 0; net effect turns positive. Le Chatelier shielding nearly simultaneously drops below 2/3 (Ω ≈ 3.71).

Phase 4: Establishment (Ω ≈ 4.01). h = 0; successor path loses local competitiveness. Ordered phase fully established.

3.5 Extraction of the Asymmetry Ratio

Sprouting to flip: 3.79 − 2.75 = 1.04 Flip to establishment: 4.01 − 3.79 = 0.22 Asymmetry ratio: r = 1.04 / 0.22 ≈ 4.7 (rounded to ~5)

Total window width: 1.26. Flip-to-establishment is 17.5% of the window.

3.6 Status of the Ratio

r ≈ 5 is a numerical result within the ZFCρ model. The core argument depends only on the weaker condition r >> 1. Specific r values may differ across systems and must be independently estimated. ZFCρ's r ≈ 5 is a prior prediction whose cross-domain validity is an empirical question.

Structural intuition: Le Chatelier buffering operates at full strength during sprouting, requiring prolonged effort to reach the flip. Once the buffer is breached, establishment is rapid — the buffer itself has already dropped below the minimum required to maintain the old pattern.

4. Core Theorem: Signal Dilution

Let each subject i have latent state Z_i (penetration depth), jointly determined by random assignment and individual factors. Outcome Y_i = g(Z_i) + ε_i, ε_i ~ N(0, σ²).

Theorem (Signal Dilution). Under binary assignment combined with binary analysis, the ITT average treatment effect ATE = E[g(Z_i) | treated] − E[g(Z_i) | control]. When control-arm Z_i ≈ 0, ATE = E[g(Z_i) | treated]. Since only fraction π_cross of the treatment arm crosses F, the ATE is systematically diluted by sprouting-zone subjects contributing zero effect. Larger r means smaller π_cross means worse dilution.

Corollary 1 (Limitation of sample size). Increasing n improves ATE precision (lower SE) but does not change the diluted ATE itself. When π_cross is small, the diluted ATE may require impractical n for adequate power.

Corollary 2 (Best-responder signal). Conditional on Z_i ≥ F, subgroup mean effect approaches δ. This explains why post-hoc subgroup analyses show effects while overall trials are negative.

Corollary 3 ("Promising but insufficient evidence" pattern). In small studies, selection on Z inflates π_cross and yields effects near δ. In large RCTs, π_cross reverts to population base rate and the effect is diluted. This resembles publication bias but has a different mechanism.

Monte Carlo verification (Appendix B): At r = 5, depth = 0.3, n = 500/arm: binary power = 9.3% (true d = 0.8); TIZ power = 97.7%. A 1000-subject "well-powered" RCT is functionally blind to the true large effect.

5. Subject Conditions

The core theorem requires the following conditions to hold simultaneously:

Condition 1: The response function has threshold structure. Not all interventions have phase-transition responses. Purely linear causal relationships, one-time acute interventions, and surgical procedure-vs-no-procedure trials fall outside this paper's scope.

Condition 2: Realized exposure within the treatment arm is highly heterogeneous. If all subjects reach the same depth, no dilution occurs regardless of r. Sources of heterogeneity include adherence variation, physiological response differences, and implementation quality differences. This is nearly inevitable in complex interventions: metabolic therapy, psychotherapy, educational reform.

Condition 3: The study does not measure or exploit the continuous exposure state. If exposure-response analysis or adaptive titration is already being done, dilution is at least partially mitigated. The problem lies in the combination of binary assignment with binary analysis, not in binary randomization per se.

Condition 4: Outcomes are measured at the emergence layer, not the construct layer. The intervention acts on the foundational layer (construct); the outcome manifests at a higher level (emergence). If the outcome is measured directly at the construct layer (e.g., blood ketone levels rather than tumor size), the response may be continuous rather than threshold-type.

When all four conditions hold, the signal dilution theorem applies. Strongly applicable domains: metabolic interventions, rehabilitation, complex psychological interventions, educational reform, digital health and behavior change, just-in-time adaptive interventions.

6. Rays

Ray 1: Methodological Recommendations

6.1.1 From binary exposure to time-in-zone. The analysis variable should include cumulative time spent above the candidate flip point. Causal caveat: time-in-zone is a post-randomization variable; directly substituting it for treatment assignment sacrifices causal identifiability. Three legitimate uses:

(a) Mechanistic secondary analysis alongside the primary ITT. (b) Adaptive titration design with exposure intensity randomized at the design level. (c) Principal stratum or CACE framework, estimating causal effects for compliers. Requires monotonicity/exclusion restriction assumptions.

Additional note on immortal time bias: if time-in-zone requires being alive or highly adherent, TIZ correlates with unobserved positive confounders. Must be addressed via time-varying covariates in Cox models or landmark analysis.

6.1.2 Monitoring density must match flip-to-establishment distance. The flip-to-establishment window is only 17.5% of the total window. If monitoring intervals exceed this window, sprouting and post-flip states become indistinguishable. Monitoring frequency should be determined by the estimated flip-to-establishment distance, not by administrative convenience.

6.1.3 Exposure verification as prerequisite for negative conclusions. Before declaring an intervention ineffective, one must first establish: what fraction of subjects crossed the flip point, and for how long? Low or undetermined crossing fractions mean the negative conclusion speaks to implementation, not mechanism. Negative results after verified exposure attainment constitute genuine mechanism falsification.

6.1.4 Prior testing for phase-transition structure. Before large-scale RCTs, use small-sample intensive monitoring: track state-outcome joint trajectories, test for breakpoints via segmented regression or change-point detection, estimate r, and design the large trial accordingly.

Ray 2: Diagnostic Checklist

Suggestive signals (limited specificity; may also arise from publication bias or fidelity deficits): (1) Small studies report strong effects; large RCTs report weak/null effects. (2) Best-responder subgroup analyses consistently positive. (3) Intervention intensity varies widely across studies; effect directions inconsistent. (4) The field is chronically "promising but insufficient." (5) Mechanistic research strongly supports efficacy; field trials fail to replicate.

Confirmatory signals (higher specificity): (6) A measurable continuous state variable exists within the treatment arm, and the outcome shows a breakpoint/segmented relationship with it. (7) Best responders overlap substantially with prospectively defined flip-point crossers. (8) After controlling for fidelity and adherence, threshold/segmented models systematically outperform linear models.

Ray 3: Safety Direction

The asymmetry applies equally to toxicity. If a chronic exposure's toxicity response has r >> 1, Phase III may report "safe" because 81% of subjects never crossed the toxicity flip point. Real-world chronic exposure pushes more individuals past the threshold, producing Phase IV failure. The exposure verification recommendations apply to safety assessment as well as efficacy.

Ray 4: Worked Example — ERGO2

ERGO2 was an RCT in recurrent malignant glioma comparing ketogenic diet (KD) plus intermittent fasting plus re-irradiation versus standard diet plus re-irradiation. The primary endpoint (PFS) was not met; the trial is cited as evidence for "lack of clinical benefit."

Under this paper's framework: latent state Z = Glucose-Ketone Index (GKI); candidate flip point F = GKI ≤ 2.0. The intervention lasted only several days; blood ketone levels had high inter-individual variance; many patients did not spend sufficient time at very low GKI. But best responders (lower day-6 glucose, higher day-6 ketones) had significantly longer PFS and OS — precisely the pattern predicted by Corollary 2.

All five suggestive signals are met. Confirmatory signals are partially met or need testing (ERGO2 has GKI-continuous data but segmented regression has not been conducted; overlap between best responders and prospectively defined GKI ≤ 2.0 crossers needs verification).

Recommendations: (a) Secondary analysis using "cumulative days at GKI ≤ 2.0" as exposure variable. (b) Future RCT should randomize patients to different GKI targets rather than KD-vs-standard. (c) Only a trial with verified time-in-zone ≥ 4 weeks at GKI ≤ 2.0 that is still negative would constitute mechanism falsification.

7. Non-Trivial Predictions

Prediction 1: Best-responder effect sizes approximate the true effect in large negative RCTs

In any domain satisfying the subject conditions, within an overall-negative large RCT, the subgroup conditional on crossing the candidate flip point should show effect sizes approaching δ. Falsification: In multiple preregistered studies, prospectively defined crossers show null effects.

Prediction 2: Time-in-zone models systematically outperform binary models

In trials satisfying the subject conditions, exposure-response analysis using the continuous state variable should systematically outperform binary ITT analysis (higher R², lower AIC/BIC, larger likelihood ratio). Falsification: Prospectively measured TIZ models provide no additional explanatory power over binary models.

Prediction 3: The asymmetry ratio r > 1 in most construct-emergence systems

Independently estimated in systems with construct-emergence hierarchical structure, most should show r > 1 with median substantially above 1. ZFCρ's prior: r ≈ 5. Falsification: Cross-domain data show r is not systematically skewed; r ≈ 1 or reverse-skewed is the norm. If so, this paper's scope should be sharply narrowed.

Prediction 4: Exposure-verified negative trials are the true falsifiers

Among large negative trials currently cited as evidence that "intervention X is ineffective for outcome Y," most treatment arms should have low π_cross. Only trials with verified exposure attainment that remain negative constitute strong mechanism falsification. Falsification: A substantial fraction of subjects are prospectively verified to have crossed the flip point, yet the overall effect remains stably null.

8. Conclusion

This paper identifies and formalizes the asymmetry ratio r between distances on each side of the threshold as a key parameter affecting trial statistical power. This parameter has an independent mathematical source (the ZFCρ phase-transition window), yielding a prior prediction of r ≈ 5. Monte Carlo simulation demonstrates that at r = 5 with shallow penetration, a 1000-subject RCT achieves only 9.3% power for a large true effect (d = 0.8), while a time-in-zone design achieves 97.7%.

The four methodological recommendations (time-in-zone, monitoring density, exposure verification, prior testing) do not depend on r = 5 specifically, only on r >> 1. The framework applies to both efficacy and safety assessment.

Open questions: (1) What is the cross-domain distribution of r? (2) How should r be operationally estimated in education and psychotherapy? (3) How can this paper's diagnostic checklist be embedded in existing systematic review and meta-analysis methods?


Relationship to the SAE framework: Within SAE (Self-as-an-End; foundational paper: Qin, DOI: 10.5281/zenodo.18528813), the three-phase structure and asymmetry originate from the general geometry of construct-emergence relations. "Construct" refers to structural components at the foundational layer; "emergence" to macroscopic patterns arising from construct combinations at higher levels. Le Chatelier shielding determines the length of the sprouting phase; once the buffer is breached, emergence-layer response establishes rapidly. This asymmetry is an intrinsic feature of construct-emergence hierarchical structure — but whether it appears across domains is a prediction to be tested, not an established fact.

Full metabolic oncology application: SAE Biology Note 1 (forthcoming).


Appendix A: Worked Example Full Data (ERGO2)

See Section 6, Ray 4.

Appendix B: Monte Carlo Simulation

Design

Three-phase response function. True effect d = 0.8. Noise σ = 1.0. Window width 1.26. Penetration depths exponentially distributed. 2,000 simulations per condition. Binary design (t-test) vs TIZ (regression). Total subjects matched.

Result 1: r systematically destroys binary power

n = 200/arm, depth = 50%.

r Sprout dist. Flip-est. % crossed Binary TIZ Ratio
1.0 0.630 0.630 37% 0.441 0.999 2.3×
2.0 0.840 0.420 26% 0.335 0.995 3.0×
5.0 1.050 0.210 19% 0.226 0.985 4.4×
10.0 1.145 0.115 16% 0.210 0.976 4.7×

Result 2: Sample size has limited effect

r = 5, depth = 50%.

n/arm Binary TIZ Ratio
50 0.102 0.568 5.6×
200 0.226 0.985 4.4×
500 0.512 1.000 2.0×

TIZ at n = 50 (56.8%) surpasses binary at n = 500 (51.2%).

Result 3: Penetration depth is decisive

r = 5, n = 200/arm.

Depth % crossed Binary TIZ Ratio
0.3 6% 0.064 0.761 12.0×
0.5 19% 0.226 0.985 4.4×
1.0 43% 0.856 1.000 1.2×

Critical scenario

r = 5, depth = 0.3, n = 500/arm: Binary power 9.3%, TIZ 97.7%, observed d = 0.039 (true 0.8).

Code

See attached: monte_carlo_sim.py.