The Agent–Habitat Dynamics (AHD) series has developed a framework for Physical AI deployment built around specific constructs: the habitat as a first-class architectural object, the agent–habitat pairing as the unit of analysis, regimes defined by stability margin and write-back, the Trust-Maturity Ladder (TML) of earned operating authority, the Habitat Viability Ladder, the Digital Organism's five functional groups, and consequence learning as the mechanism by which operational experience enriches the environment. This paper is the first structured attempt to apply those constructs, through a coded diagnostic instrument, to real startup–corporate collaborations in the field. The question is whether the framework's vocabulary is adequate to describe what practitioners actually encounter and whether its predicted patterns hold.
The paper's principal finding is that the framework passed two distinct tests. First, semantic modelability: the framework's vocabulary placed every case on its two primary axes. Fourteen startup–corporate collaborations drawn from Physical AI deployments between 2017 and 2026 were scored against the full AHD diagnostic instrument and against the Trust-Maturity Ladder. On the AHD regime axis, the 14 cases distribute as 8 in Fragility, 4 in Managed Usefulness, and 2 in Compounding Resonance. On the TML rung axis, they distribute as 7 at Rung 1, 4 at Rung 2, 1 at Rung 3, and 2 at Rung 4. Every case was placeable on both axes without gap or ambiguity. Second, construct adequacy: the cases labeled with a given regime or rung actually displayed the real-world behavior the framework predicts that label should carry. Fragility cases uniformly showed low stability, weak or absent write-back, low pilot-to-production rates (10–40%), no field crystallization, and consequence learning limited or absent. Managed Usefulness cases uniformly showed moderate stability, pilot-to-production rates (70–80%) above Fragility but below Compounding Resonance, consequence learning limited or absent, and field not yet crystallizing. Compounding Resonance cases uniformly showed high stability, active consequence learning, active field crystallization, and pilot-to-production rates around 90%. The framework's labels carry real behavioral content; the classification is not an empty scheme. Read together as a regime × rung cross-tabulation, the two axes also reveal structural features of the sample that neither axis alone shows: most notably that Fragility never co-occurs with Rung 3 or Rung 4 in this sample, indicating that earned operating authority does not appear where stability margin is low.
Three further patterns the instrument surfaced extended the framework's vocabulary without replacing it. First, a dynamic the series did not formalize: many collaborations enter the sample's lower-left region (Fragility at low TML rungs) in a state of joint sub-adequacy (the Twin Deficit) in which agent capability and habitat readiness are simultaneously below the thresholds the pairing requires, and each side's sub-adequacy masks and amplifies the other's. Escape from this dynamic, in this sample, occurred only through joint maturation on both sides. Second, organism completeness as a property of the pairing rather than the agent alone: the habitat can supply functional groups the agent lacks. Third, the decoupling of nominal authority scope from evidence-earned Trust-Maturity Ladder rung progression: authority scope can advance while rung, stability margin, and binding-constraint diagnosis all indicate the collaboration has not genuinely escaped low-regime status. Each pattern is a natural extension of the framework's existing vocabulary rather than a correction of it, and each is developed as a downstream finding of the same diagnostic instrument that produced the principal result.
This paper's contribution is not proof of the AHD framework. It is the first structured attempt to translate the Agent–Habitat Dynamics (AHD) framework into an observable diagnostic instrument, apply it across real-world collaborations, and report what the instrument reveals: both where the framework's constructs succeed in describing the cases and where the data extends the framework in ways the original formulation did not fully anticipate. All reported figures are descriptive patterns from a small, non-random sample and are not offered as causal or statistically generalizable.