Tal Cohen

Publications

Agent–Habitat Dynamics (AHD) Series Roadmap

Abstract

Agent–Habitat Dynamics: Roadmap to a Working Paper Series on Physical AI Deployment.

Physical AI is failing at scale. Pilots succeed in demonstration and stall in production. Independent research from MIT NANDA, RAND, and Gartner all converge on the same diagnosis: enterprise AI deployments are not failing because the models are weak. They are failing because the operating environments around the models have not been built to receive consequential autonomy.

The Agent–Habitat Dynamics (AHD) working paper series formalizes the missing layer. It names the habitat as the trusted operating environment in which Physical AI takes consequential action under institutional governance, evidence accumulates, oversight is exercised, and learning crystallizes into reusable institutional knowledge. The series argues, with reference to causal inference (Pearl), institutional substance (Spinoza), and ecological resilience theory, that the habitat is the decisive variable in Physical AI deployment, not the model.

This Roadmap introduces the full fourteen-paper series. It maps the constructs developed across the series: Consequence Field Theory, the Trust Maturity Ladder, the Zone of Hyper-Anticipation, Habitat Debt, Governance Inheritance, Consequence-Based Apprenticeship, and the three-wave economic structure of habitat value (Service, Asset, Franchise).

Read this Roadmap first to orient. Then read the papers. The deployment problem is not where the field has been looking. The field has been looking at the agent. The decisive variable is the habitat.


Habitat as the Missing Layer in Physical AI Deployment

Abstract

Physical AI deployment, defined here as autonomous systems that perceive, decide, and act in the physical world under real consequence, is failing at the pilot-to-production boundary at non-trivial rates across multiple industrial settings. Existing integration disciplines, developed over decades for deterministic software and traditional industrial systems, are structurally mismatched to the demands of probabilistic, learning-based agents operating where errors are irreversible and institutional acceptance acts as the binding constraint on scaling.

This paper proposes the Physical AI habitat as the architectural layer through which many agents, built by different teams and operating under real institutional constraints, can coordinate, earn trust incrementally, preserve transferable learning, and be governed under consequence. The habitat is defined formally as a six-tuple H = ⟨Π, Χ, Γ, Ω, Σ, Θ⟩ of communication protocols, challenge definitions, governance rules, observability channels, accumulated challenge state, and trust evidence, with each component specified in measurable content.

A structural necessity claim is established: three minimal functional requirements (trust protocol, shared challenge architecture, runtime governance layer) map in a minimally complete way onto the six components, and no proper subset of the six supports all three. Four testable propositions are advanced, each stated with an explicit falsification condition, including the claim that deployment reliability is better explained by the agent–habitat pairing than by model capability alone.

Evidence is drawn from two sources: publicly documented deployment patterns (NTSB investigations, regulatory proceedings, technical briefings) and structured field observations from fourteen industrial AI deployments observed through a corporate innovation platform over nine years; three are re-read through the formal definition. The field observations are motivating rather than confirmatory. This paper is self-contained; companion papers in the Agent–Habitat Dynamics series extend specific aspects but are not required to evaluate the argument presented here.


Intelligentization

Abstract

Three times in the last two centuries, civilizations went through transitions deep enough that everyday life became unrecognizable to the generation before. Electrification abstracted energy. Telecommunications abstracted distance. Digitization abstracted memory. A fourth transition appears to be underway. This paper argues that it is structurally different from the three before it, and that the difference matters.

The transition is not artificial intelligence in general. It is Physical AI: systems that observe, interpret, act, and learn from the consequences of their actions in the physical world, where the action cannot be taken back. What gets abstracted this time is not a resource. It is the act of consequential choice itself. The key operating zone is the consequence-hardening window: the interval between the earliest machine-detectable signal that a consequence is forming and the moment that consequence becomes irreversible. Value, risk, and institutional difficulty all concentrate in this window. Acting inside it is what makes Physical AI economically powerful. It is also what makes the five institutional questions civilizations have historically used to govern consequential choice (when must action occur, who may act, why may that actor be trusted, how should outcomes shape future conduct, and who answers when harm occurs) lose reach in their inherited form.

The paper shows why the five answers lose reach together rather than one at a time, identifies the human position that holds the architecture together when they do, names the new form of power the same architecture can be turned into against unaware targets, and closes with what builders and operators can responsibly commit to.


Constraint Architecture of Physical AI Deployment: A Coupled Interaction Model

Abstract

Physical AI deployment, defined here as autonomous systems that perceive, decide, and act in the physical world under real consequence, is failing at the pilot-to-production boundary at non-trivial rates across multiple industrial settings. This pattern of pilot-stage success followed by production-stage failure is widely documented across industrial AI; what distinguishes Physical AI within that broader space is that errors are irreversible and institutional trust constrains what the system may do.

This paper formalizes four structural properties of Physical AI deployment as four laws: real-time consequence, physical irreversibility, environmental non-cooperation, and institutional selection. Each is stated as a constraint, traced to its engineering lineage, and shown to be unsatisfiable by model improvement alone. The term "laws" is used in the sense that the constraints are imposed by physics and institutions rather than chosen by designers.

The central contribution is the Constraint Interaction Model (CIM), which demonstrates that the four laws form a fully coupled system: every law interacts with every other, producing six pairwise interactions that are directionally asymmetric in four of the six cases, yielding up to twelve distinct diagnostic signatures. Addressing one constraint without the others does not reduce total deployment risk but transfers it to a different failure mode. Three independent convergence checks (the four-interface test, the exhaustion test, and convergence with IEC 61508) support the structural argument. A severity-calibrated framework reframes the four laws as a portfolio to be managed continuously at deployment-specific thresholds rather than as a pass/fail checklist.

Three propositions for empirical testing are advanced, each paired with an explicit falsification condition. This paper is self-contained; companion papers in the Agent–Habitat Dynamics series extend specific aspects but are not required to evaluate the present argument.


Agent–Habitat Dynamics: A Diagnostic Framework for Physical AI Deployment

Abstract

Physical AI deployment, defined here as autonomous systems that perceive, decide, and act in live physical environments where consequences are irreversible, exploration is costly, and institutional trust constrains what the system may do, is failing at the pilot-to-production boundary at high rates across multiple industrial settings. Model-centric diagnosis (accuracy, precision, latency) is frequently insufficient to explain why technically sound systems stall, lose institutional trust, or succeed in pilot and fail in production.

This paper introduces Agent–Habitat Dynamics (AHD), a diagnostic framework that treats deployment not as a static evaluation of model performance but as a coupled dynamical system in which the agent and its operating environment co-evolve across three timescales. The framework proposes diagnostic quantities (Stability Margin Mₛ, Write-Back Rate W̄, and Constraint Leverage Λᵢ) that together map a deployment into one of the candidate regimes: Fragility, Managed Usefulness, Compounding Resonance, False Comfort Plateau, and Depletion. Each regime implies a structurally different optimal intervention; misdiagnosing the regime is argued to be a recurrent source of wasted engineering effort.

The framework grounds the regime map in critical transitions theory from ecology (Scheffer, 2009; Scheffer et al., 2009) and structured-observation methodology from intensive care medicine (Vincent et al., 1996). The paper's central practitioner tool is the Perfect Model Test: a single diagnostic question that separates model-side constraints from habitat-side constraints before resources are misallocated.

Derivable failure types are presented, each illustrated with a public deployment case. Propositions for empirical testing are advanced, each stated formally and paired with an explicit falsification condition. This paper is self-contained; companion papers in the Agent–Habitat Dynamics series extend specific aspects but are not required to evaluate the present argument.


The Digital Organism

Abstract

Physical AI is often discussed as a race to build better foundation models, world models, and robotic policies. This paper argues that model capability is necessary, but it is not the deployable subject of Physical AI. The physical world admits bounded entities that can sense, decide, act, remain safe, maintain continuity across operating cycles, and be governed as coherent units. I call this entity a digital organism: the smallest bounded unit of matter-and-model co-adaptation that can survive, earn authority, and improve through physical consequence.

This paper makes four contributions to that argument. First, it separates the organism boundary from component assembly through an Organism Boundary Test. Second, it introduces an Organism Completeness Test, deriving a required functional architecture (perception, cognition, action, safety/trust, and reach, plus the energy substrate of metabolism) from the irreducible constraints of real-time consequence, physical irreversibility, environmental non-cooperation, and institutional selection. Third, it explains why safety, reach, and the metabolic substrate must be treated as architectural conditions rather than auxiliary add-ons, and why the organism–habitat pairing, not the startup product alone, is the correct unit of deployment. Fourth, it proposes falsifiable predictions linking functional incompleteness to deployment failure and linking write-back to compounding advantage. The central claim is that Physical AI scales reliably when model intelligence is organized into a bounded, governable, consequence-learning entity.


Trust Maturity Ladder

Abstract

Existing frameworks classify AI capability (SAE J3016) or risk (EU AI Act), but none specifies how a Physical AI system (AI deployed in live physical environments where consequences are irreversible, exploration is costly, and institutional trust constrains what the system may do) earns, maintains, loses, and transports operational authority over time. This paper introduces the Trust Maturity Ladder (TML), a staged governance architecture for machine autonomy in physical environments. The ladder defines five rungs from advisory-only operation to ecosystem-level deployment, governed by four named gates: the entry gate (what evidence is required to advance), the maintenance gate (what must remain true to stay), the regression gate (what events force rollback), and the re-entry gate (the higher threshold required after regression, reflecting the paper's central governance innovation: that regained authority requires more evidence than originally earned authority, because institutional memory of failure is asymmetric).

The TML's novelty is not that staged advancement, evidence thresholds, or authority regression are new in isolation. They are not. Its novelty lies in integrating them into a single governance state machine for Physical AI, combining evidence-gated advancement, regression with hysteresis, AHD diagnostic conditioning, evidence portability, and design for machine autonomy. The paper evaluates crosswalk compatibility with NIST AI RMF, ISO/IEC 42001, and EU AI Act requirements, and presents four testable propositions.

The paper grounds the ladder primarily in precedent analysis across four high-consequence domains, and uses immunological tolerance as a structural parallel showing why graduated trust recurs in complex systems that face uncertainty, asymmetric error costs, accumulating evidence, and changing conditions.


The Minimum Viable Habitat

Abstract

The companion organism paper asked what must exist inside the organism boundary. This paper asks the prior question: what must exist outside it? The organism boundary is binary: a system either crosses into organismhood or it does not. Habitat viability is graded: every deployment creates some habitat, and the quality gradient determines what class of operation is possible.

This paper's central claim is that the Minimum Viable Habitat is the minimum environment in which an organism's outputs stop being suggestions and begin becoming evidence: attributable, governable, reviewable, and accumulable. The paper introduces the Habitat Viability Ladder, five thresholds from Demo through Ecosystem viability. Threshold S describes the service stage where many deployments operate today: delivering value but not retaining what they prove. Threshold T1 is Evidence Viability, the point at which AI spend stops buying only service and starts building an institutional asset.

The paper proposes the doctrine that, for consequence-bearing Physical AI deployments, no first autonomous action should be taken before T1. It establishes the Dual Adequacy Principle: deployment outcomes are bounded by the weaker of organism capability and habitat maturity, and the diagnostic question is which side is the binding constraint. It further argues that the habitat is among the few deployment assets that can compound with use, while models and hardware depreciate. If this argument holds, the question to ask of a stalled deployment shifts from "is the model good enough?" to "can the habitat convert this organism's outputs into evidence?" Many pilot failures are not model failures. They are habitat failures of challenge definition, observability, governance, feedback, and commitment.


Multi–Agent Habitat Dynamics

Abstract

A manufacturing plant deploys a predictive maintenance system. It works. Six months later, the plant adds quality inspection. Then energy optimization. Then human-robot collaboration. Each system is deployed independently, governed independently, and evaluated independently. Each works in isolation. Together, they produce a cascade failure that none of them caused and none of them can diagnose.

This paper extends the Agent–Habitat Dynamics framework from single-organism to multi-agent environments. It introduces three primary constructs and several derived mechanisms. The Coupling Taxonomy classifies four channels through which organisms interact via shared habitat: physical, informational, governance, and consequence. Habitat Debt names the hidden liability that accumulates when local optimization or local governance around one organism degrades the operating conditions, evidence quality, governance capacity, or trust environment of another. This liability is invisible in any individual scorecard. It becomes visible only when the shared habitat is the unit of analysis. Collective Write-Back describes the conditions under which multiple organisms enrich a shared habitat constructively rather than chaotically. The scenario then derives Governance Inheritance, coupling cascades, authority conflicts, and the consequence attribution problem as recurring failure mechanisms.

The paper demonstrates these constructs through a single extended scenario: a manufacturing facility that deploys four AI systems over fifteen months, observing how coupling emerges, debt accumulates, cascades propagate, and governance fails. The paper also identifies what architectural design would have prevented each failure. The central finding is that in multi-agent environments above a threshold of coupling density, shared habitat architecture becomes structurally necessary for governability. Without shared visibility, each organism optimizes locally while remaining effectively blind to how it shapes the operating conditions of others.


The Empirical Test: A Structured Field Study of Agent–Habitat Regimes in Industrial AI Deployment

Abstract

The Agent–Habitat Dynamics (AHD) series has developed a framework for Physical AI deployment built around specific constructs: the habitat as a first-class architectural object, the agent–habitat pairing as the unit of analysis, regimes defined by stability margin and write-back, the Trust-Maturity Ladder (TML) of earned operating authority, the Habitat Viability Ladder, the Digital Organism's five functional groups, and consequence learning as the mechanism by which operational experience enriches the environment. This paper is the first structured attempt to apply those constructs, through a coded diagnostic instrument, to real startup–corporate collaborations in the field. The question is whether the framework's vocabulary is adequate to describe what practitioners actually encounter and whether its predicted patterns hold.

The paper's principal finding is that the framework passed two distinct tests. First, semantic modelability: the framework's vocabulary placed every case on its two primary axes. Fourteen startup–corporate collaborations drawn from Physical AI deployments between 2017 and 2026 were scored against the full AHD diagnostic instrument and against the Trust-Maturity Ladder. On the AHD regime axis, the 14 cases distribute as 8 in Fragility, 4 in Managed Usefulness, and 2 in Compounding Resonance. On the TML rung axis, they distribute as 7 at Rung 1, 4 at Rung 2, 1 at Rung 3, and 2 at Rung 4. Every case was placeable on both axes without gap or ambiguity. Second, construct adequacy: the cases labeled with a given regime or rung actually displayed the real-world behavior the framework predicts that label should carry. Fragility cases uniformly showed low stability, weak or absent write-back, low pilot-to-production rates (10–40%), no field crystallization, and consequence learning limited or absent. Managed Usefulness cases uniformly showed moderate stability, pilot-to-production rates (70–80%) above Fragility but below Compounding Resonance, consequence learning limited or absent, and field not yet crystallizing. Compounding Resonance cases uniformly showed high stability, active consequence learning, active field crystallization, and pilot-to-production rates around 90%. The framework's labels carry real behavioral content; the classification is not an empty scheme. Read together as a regime × rung cross-tabulation, the two axes also reveal structural features of the sample that neither axis alone shows: most notably that Fragility never co-occurs with Rung 3 or Rung 4 in this sample, indicating that earned operating authority does not appear where stability margin is low.

Three further patterns the instrument surfaced extended the framework's vocabulary without replacing it. First, a dynamic the series did not formalize: many collaborations enter the sample's lower-left region (Fragility at low TML rungs) in a state of joint sub-adequacy (the Twin Deficit) in which agent capability and habitat readiness are simultaneously below the thresholds the pairing requires, and each side's sub-adequacy masks and amplifies the other's. Escape from this dynamic, in this sample, occurred only through joint maturation on both sides. Second, organism completeness as a property of the pairing rather than the agent alone: the habitat can supply functional groups the agent lacks. Third, the decoupling of nominal authority scope from evidence-earned Trust-Maturity Ladder rung progression: authority scope can advance while rung, stability margin, and binding-constraint diagnosis all indicate the collaboration has not genuinely escaped low-regime status. Each pattern is a natural extension of the framework's existing vocabulary rather than a correction of it, and each is developed as a downstream finding of the same diagnostic instrument that produced the principal result.

This paper's contribution is not proof of the AHD framework. It is the first structured attempt to translate the Agent–Habitat Dynamics (AHD) framework into an observable diagnostic instrument, apply it across real-world collaborations, and report what the instrument reveals: both where the framework's constructs succeed in describing the cases and where the data extends the framework in ways the original formulation did not fully anticipate. All reported figures are descriptive patterns from a small, non-random sample and are not offered as causal or statistically generalizable.