"We are macroscopic observers in a microscopic reality." As macroscopic observers, we cannot directly access the microscopic reality we occupy. The Conservation-Congruent Encoding (CCE) framework seeks to formalise this limitation, treating measurement and observation as physical acts bounded by conservation laws.
Google Scholar | GitHub | Blog
I'm building Corca Health, an AI-assisted transdiagnostic screener for mental health and neurodevelopmental concerns. More broadly, my work is centred around leveraging AI to improve mental and physical wellbeing.
My research develops a physical theory of intelligence to establish rigorous constraints for the safe development of artificial intelligence. Through the Conservation-Congruent Encoding (CCE) framework, I anchor computation in physical conservation laws. This work derives physical metrics for intelligence and consciousness, exposes the Platonic Observer Fallacy inherent in modeling microscopic realities with macroscopic equations, and seeks to show how observer information constraints relate to established physical theories.
While often treated as abstract, substrate-independent algorithmic properties, intelligence and computation are ultimately physical processes constrained by conservation laws. The core hypothesis of this framework is that information processing emerges when open systems undergo irreversible transitions, carving out stable macroscopic states from underlying reversible micro-dynamics. Under this physical lens:
To measure information physically rather than abstractly, we utilize the Conservation-Congruent Encoding (CCE) framework. Traditional information theory (e.g., Shannon entropy) measures logical uncertainty, but is blind to equilibrium asymmetries and reservoir exchanges. CCE grounds information in matter by defining a macroscopic distinction as a family of protected regions (metastable basins of attraction) in state space whose boundaries are stabilized by conservation laws.
Under CCE, informational state changes are mapped directly to physical fluxes:
By defining both intelligence (χ) and consciousness (κ) as ratios of goal-directed work (W) to their respective informational currencies (Iirr and Irev), we isolate how an agent utilizes physical resources to influence its environment.
Solving the consciousness relation for work yields W = κ Irev. Substituting this into the definition of intelligence gives the identity:
This identity shows that an agent's operational intelligence is the product of its structural consciousness and its processing efficiency (the ratio of preserved to destroyed distinctions). A system can achieve high intelligence either by executing massive, energy-intensive irreversible computations (high Iirr) or by relying on complex, pre-compiled internal models that advect information reversibly (high κ and Irev).
AI safety is traditionally treated as a normative alignment problem. However, developing a formal framework to measure intelligence shifts the focus to physical boundaries and resource constraints. By defining intelligence as a measurable capacity to perform goal-directed work, we can establish the physical limits within which an agent operates.
Safety constraints can be studied mathematically via boundary dynamics and symbiotic coupling rather than isolated value alignment.
In classical physics and computational theory, observers frequently treat macroscopic equations of motion as absolute and decoupled from the microscopic substrates they describe. This assumption—which we call the Platonic Observer Fallacy—imagines that coarse-grained states and continuous field variables are objective entities that can be measured, tracked, and simulated without physical cost. By neglecting that observation is a physical act bounded by conservation laws, standard modeling hides a growing informational deficit.
The difference between Shannon's formulation and the Conservation-Congruent Encoding (CCE) framework lies in the distinction between pure logical accounting and dynamical accounting.
Under Shannon's formulation, information is logical and substrate-independent. For a discrete random variable X representing symbolic states, the Shannon entropy measures uncertainty purely from the state probabilities:
If a bit transitions from an initial distribution p to a final distribution p_{\text{final}}, Shannon's logical accounting reports the information change purely as the difference in logical entropy:
This accounting does not care about the physical characteristics of the underlying states, making it symmetric and blind to the physical effort required to maintain, transition, or merge them.
CCE, by contrast, relies on dynamical accounting. It grounds information in matter by mapping logical states to protected phase-space volumes stabilized by conservation laws. Let the unconstrained thermal equilibrium distribution of the physical substrate be \pi. In CCE accounting, the physical information content of a macroscopic state distribution p is measured using the Kullback-Leibler (KL) divergence relative to this equilibrium distribution:
This divergence represents the physical distinction held by the observer's encoding relative to the ambient substrate dynamics. The minimum irreversible cost I_{\text{irr}} of transitioning the system from distribution p to p_{\text{final}} is determined by the difference in their physical distinctions:
To see the impact of this dynamical ledger, consider an asymmetric bit with two macroscopic basins: Left (L) and Right (R). Due to differences in energy levels, boundary geometry, or coupling to the environment, the phase-space volumes corresponding to these states are unequal: V_R = \alpha V_L (with \alpha \neq 1). The prior equilibrium probabilities of the basins are:
If we prepare the system in a distribution where p(L) = p and p(R) = 1-p, and subsequently execute a logically irreversible Reset-to-Left operation (yielding p_{\text{final}} = (1, 0)), the logical Shannon cost is H_S(p). However, the CCE dynamical cost is:
The CCE dynamical accounting recovers the logical Shannon term H_S(p) plus a physical asymmetry penalty (1-p)\ln\alpha.
Crucially, the asymmetry parameter \alpha is a fixed physical property of the hardware (the geometric or energetic ratio of the basins), whereas how this asymmetry penalizes or discounts the transition is determined by the direction of the operation. Here, because \alpha > 1 (meaning the Right basin is larger), sweeping probability out of the larger basin into the smaller Left basin requires compressing the phase space. This direction-dependent physical effort is why the operation pays a penalty of +(1-p)\ln\alpha.
If we instead executed a Reset-to-Right operation (sweeping the system into the larger basin, so p_{\text{final}} = (0, 1)), the CCE dynamical cost would be:
In this direction, the penalty becomes a thermodynamic discount of -p\ln\alpha. Because the operation allows the system to expand from the smaller, constrained Left basin into the larger Right basin, the physical substrate assists the transition. Under pure Shannon accounting, both operations cost the same logical H_S(p) nats. Under CCE, the logical ledger is bound to the physical hardware: compressing data against the substrate's natural asymmetry costs extra work, while expanding with it recovers work.
We now scale this concept from a single bit to a macroscopic equation modeling a high-dimensional microscopic reality. Let the true microscopic state of a system be x in a high-dimensional phase space \mathcal{M}, with its probability distribution evolving under the microscopic Liouville or Fokker-Planck flow as P_{\text{micro}}(x, t).
A macroscopic observer uses a coarse-graining projection operator \Pi: \mathcal{M} \to \mathcal{C} to map these microstates to a reduced, continuous field variable \phi(t) \in \mathcal{C}. The observer models the system using a macroscopic equation of motion:
By relying solely on \phi(t), the observer implicitly assumes a microscopic density constructed via the maximum-entropy (or local equilibrium) lift operator \Pi^*:
However, the true microscopic distribution P_{\text{micro}}(x, t) evolves under the full chaotic and coupled microscopic laws. As time progresses, microscopic interactions generate fine-grained fluctuations, correlations, and gradients across the boundaries of the coarse-grained cells. These details are neglected by P_{\text{macro}}(x, t), which assumes local equilibrium within the macroscopic states.
The difference between prediction and retrodiction under a macroscopic model can be illustrated using a classical physical system: a particle of mass m moving in a viscous fluid with friction coefficient \gamma.
Let the true microscopic dynamics of the particle's velocity v be stochastic due to collisions with fluid molecules (thermal noise), described by the Langevin equation:
where D is the diffusion coefficient and W_t is a standard Wiener process. The true microscopic probability density P_{\text{micro}}(v, t) evolves under the Fokker-Planck equation, tending toward the thermal equilibrium variance \sigma_{\text{eq}}^2 = D/\gamma.
A macroscopic observer ignores the thermal fluctuations and models the velocity using the deterministic decay equation:
The observer's measurement instrument has a finite resolution represented by a narrow Gaussian of variance \sigma_0^2 (where \sigma_0^2 \ll \sigma_{\text{eq}}^2).
Suppose the observer prepares the particle in a highly localized velocity state v(0) = v_0 with variance \sigma_0^2 at t = 0. After a time interval T, the true microscopic distribution spreads due to diffusion:
Meanwhile, the macroscopic model predicts v_{\text{macro}}(T) = v_0 e^{-\gamma T} and assumes the measurement resolution remains \sigma_0^2, yielding P_{\text{macro}}(v, T) = \mathcal{N}(v_0 e^{-\gamma T}, \sigma_0^2).
The predictive divergence in nats is the Kullback-Leibler divergence between these two distributions:
where \sigma_T^2 = \sigma_0^2 e^{-2\gamma T} + \sigma_{\text{eq}}^2(1 - e^{-2\gamma T}). As T becomes large, the true variance approaches the thermal variance \sigma_{\text{eq}}^2, and the predictive divergence plateaus at a constant level:
This represents the information lost by ignoring environmental fluctuations, which is bounded by the ratio of thermal noise to observer resolution.
Now suppose the observer measures the velocity at time T to be v(T) = v_T. To reconstruct the past state at t = 0, the macroscopic modeler runs the deterministic equation backward in time, yielding the retrodictive estimate v_{\text{macro}}(0) = v_T e^{\gamma T} with resolution variance \sigma_0^2.
However, the true microscopic retrodiction is the Bayesian posterior distribution of the initial state given the measurement. Under a thermal prior P(v(0)) = \mathcal{N}(0, \sigma_{\text{eq}}^2), the posterior distribution is:
Because high-energy states are exponentially rare under the thermal prior, observing a velocity v_T today does not mean the particle started with a massive velocity v_T e^{\gamma T} that slowly dissipated. Rather, it is overwhelmingly more probable that the particle was near thermal equilibrium (near zero) in the past, and a recent random thermal fluctuation pushed it to v_T. The macroscopic model completely ignores this thermal prior, hallucinating a physically exorbitant, high-energy history.
The retrodictive divergence in nats between this true historical distribution and the macroscopic model's backward projection is:
As T grows, the difference between the true posterior mean and the macroscopic model's retrodiction grows exponentially. The divergence is dominated by the mean mismatch:
Unlike the prediction divergence which plateaus, the retrodictive divergence grows exponentially with T. This asymmetry represents the informational debt of coarse-graining: the macroscopic model throws away phase-space volume in the forward direction, which requires exponential precision (information) to reconstruct in reverse.
These informational divergences provide a direct bridge to physical dynamics and energy under CCE. If we take the macroscopic equations as they are and project them backward in time under an assumption of costless resolution, the observer must pack an exponentially growing amount of energy to track and re-instantiate the lost historical details, mirroring the divergence of a physical singularity. Conversely, in the forward predictive limit, the complete export of the observer's organized energy and signal into the thermal bath matches the dynamics of heat death.
Although these asymptotic limits diverge from physical reality—because real observers are finite and bounded by resource constraints—the analogy highlights a key CCE principle: thermodynamic events like singularities and heat death can be understood as the operational boundaries of macroscopic coarse-graining when pushed to its limits.
Rather than combining these divergences, it is the individual terms and the deep asymmetry between them that expose the Platonic Observer Fallacy. The predictive divergence, D_{\text{pred}}(T), plateaus at a constant level, indicating that the observer's loss of forward predictive capacity is bounded by the finite scrambling capacity of the thermal bath. Conversely, the retrodictive divergence, D_{\text{retro}}(T), grows exponentially over time, representing the severe informational debt incurred when trying to reconstruct history from a dissipative macroscopic representation. This asymmetry demonstrates that macroscopic models are not self-contained descriptions of reality; their forward evolution silently discards microscopic coordinates, demanding an exponentially growing amount of information to resolve in reverse.
The following research notes represent an ongoing effort to map and structure a developing physical theory. To keep pace with theoretical ideation, I utilize AI tools to rapidly generate and formalize my work into research notes.
Please read these documents with the understanding that they are living drafts:
Peter David Fagan
Preprint (v2), 2026
A conservation-congruent encoding (CCE) is a physically realized macroscopic distinction, unlike the abstract, substrate-independent notion of information used in traditional information theory. Under a chosen coarse-graining, it is represented by protected macroscopic regions and their associated world-tubes that persist under ambient fluctuations, are maintained by dynamical invariants tied to conserved quantities, and can be irreversibly merged only through dissipative export into explicitly modeled channels. This note gives a minimal definition of a CCE.
Peter David Fagan
Preprint, 2026
The foundations of standard cosmology rely on modelling reality with continuous macroscopic field equations. This note identifies an observer-resource assumption hidden by such equations and analyzes it using the Conservation-Congruent Encoding (CCE) framework. Within CCE, a projection Π is a coarse-graining of physical reality whose erasure, refinement, and maintenance carry energetic or informational costs. The note argues that several horizon-like limits also admit an observer-indexed operational reading within this ledger. Forward in time, heat death is read as Predictive Dissipation: the irreversible loss of macroscopic signal when a projection truncates the metric exhaust required to hold that signal distinct from the bath. Backward in time, the Big Bang singularity is read as Retrodictive Divergence: the divergence of the Landauer-scale cost required to re-instantiate erased branch distinctions under an assumption of costless resolution. Cosmological expansion and redshift are treated as standard geometric identities with an additional CCE bookkeeping interpretation, while gravitational singularities mark lower area-capacity limits for physically instantiated projections. The result is not a new dynamics, but a stricter operational license for using continuous models: observers do not access an infinite continuous universe at arbitrary precision, but work within a finite epistemic bubble bounded by their physical embedding.
Peter David Fagan
Preprint, 2026
We ask how apparently fundamental laws can arise from the physics of observation itself. In the Conservation-Congruent Encoding framework, an observer is a finite material device whose records must be stored and repeatedly reset. We make that bookkeeping explicit through developing an example case of a one-bit observer: a particle in a symmetric double-well potential immersed in a homogeneous thermo-acoustic medium. Incoming acoustic packets flip the bit reversibly, while a time-dependent control protocol restores the ready state. Because reset is logically irreversible and occurs while the bit remains coupled to the bath, each cycle dissipates at least kBT0ΔHcg, so under a reset rate ν the observer acts as a localized heat source with mean power P ≥ νkBT0ΔHcg. In steady state this produces a thermal halo δT(r) = P/(4πkr), which in a focusing medium induces the refractive profile n(r) = n0(1 + Γ/r). The resulting ray bending enlarges the capture cross-section and, in the weak-field limit, is mathematically equivalent to motion in an attractive 1/r potential. An external analyst restricted to the reduced event stream can therefore mistake self-induced bath distortion for an intrinsic force law. We then sketch a speculative gravitational extrapolation in which erased information is absorbed by local horizons, Newton's constant becomes Vacuum Informational Compliance, and a positive cosmological constant sets a deep-field crossover scale; with additional equilibrium assumptions, the weak-field closure can then be lifted toward the Einstein field equations.
Peter David Fagan
Preprint, 2026
This research note revisits Leibniz's mill, Turing's imitation game, and Searle's Chinese Room through the Conservation-Congruent Encoding (CCE) framework. It formalises a toy symbolic setting in which successful behaviour is measured by task performance (Wcausal,T), while the efficiency with which preserved internal structure supports that behaviour is measured by operational consciousness (κT). Within this setup, an uncompressed lookup system and a compact generative system can in principle achieve comparable behavioural success, yet diverge sharply in κT: the former relies on an expanding standing store of unreused mappings, whereas the latter reuses compact internal structure. The note therefore reframes classic disputes about understanding by separating outward performance from the organisation that sustains it, and motivates why this distinction may matter for later AI-safety analysis.
Peter David Fagan
Preprint, 2026
We apply the Conservation-Congruent Encoding (CCE) framework to the P versus NP problem by explicitly modeling the thermodynamic tradeoff between reversible information processing (Irev) and irreversible information processing (Iirr). While constructive algorithms theoretically avoid exponential candidate generation, worst-case NP-complete problems possess constraint topologies that are logically irreducible. Mapping this implicitly exponential constraint density into a poly(N) physical memory forces continuous intermediate state erasure. Under the physical identity χ = κ(Irev/Iirr), we demonstrate that processing irreducible logical structures strictly triggers an exponential Landauer tax, yielding the physical contradiction poly(N) + poly(N) ≥ Θ(2N). We present a physical constraint on scalable realizations of worst-case search on digital substrates, independent of formal mathematical shortcuts.
Peter David Fagan
Preprint, 2026
We present a physical framework for computational efficiency in AI by mapping classical complexity to informational costs measured in nats. Algorithmic time is treated as irreversible information processing (Iirr), while space is treated as preserved information processing (Irev). Using these quantities, we define operational intelligence (χ = Wachieved/Iirr), structural consciousness (κ = Wachieved/Irev), and retention (ρ = Wachieved/Wtarget) to separate task demand from architecture-level performance. We ground these metrics in lightweight Conservation-Congruent Encoding (CCE) conditions, which specify when coarse-grained informational states admit metastable physical realizations and when irreversible state collapse must export entropy through conserved channels. We then apply the framework to sorting algorithms and modern sequence models, showing that dominant scaling bottlenecks are physical routing burdens rather than software abstraction alone. Under this lens, Euclidean GPU-style layouts impose congestion costs that can preserve near-quadratic pressure even when nominal attention complexity is reduced. As an illustrative topology-only proxy, a constrained optimization with N = 128 and k ≤ 4 shifts from a planar baseline (D = 21) to an expander-like layout (D = 5), reducing hop-count routing proxy from Ihopirr = 2688 to Ihopirr = 640 (76%) and increasing proxy χ by 4.2× at fixed transceiver budget. Because this surrogate omits post-layout wire length, capacitance, and Rent-style pin-limited embedding effects, these gains are reported as upper bounds pending physical place-and-route validation.
Peter David Fagan
Preprint (v2), 2026
While often treated as abstract algorithmic properties, intelligence and computation are ultimately physical processes constrained by conservation laws. We introduce the Conservation-Congruent Encoding (CCE) framework as a unified, substrate-neutral physical framework for studying intelligence. We propose that information processing emerges when open systems undergo irreversible transitions, carving out macroscopic states from underlying reversible micro-dynamics. Generalizing Landauer's principle to arbitrary conserved quantities via metriplectic flows, we derive a universal bound for macroscopic computation. This yields physical metrics for intelligence and an operational analogue for consciousness, quantifying an agent's ability to extract work from the environment while minimizing its own dissipative dynamics. Applying CCE to the limits of physical observation, we model measurement as an active coarse-graining process rather than a passive projection. At the quantum scale, CCE recovers the Lindblad Master Equation, consistent with modelling decoherence as the dissipative exhaust required to record a measurement. Scaling to cosmological limits, we explore the hypothesis that gravity emerges as the macroscopic geometric footprint of these bounds. We show that, under this hypothesis, measurement-induced dissipation is consistent with a volumetric phase-space collapse, offering a dynamical route to the Bekenstein-Hawking area law. Equating the Landauer exhaust of this coarse-graining to horizon deformation outlines a limiting-case recovery of the Einstein Field Equations. Ultimately, by establishing a substrate-neutral link between thermodynamic dissipation, quantum measurement, and spacetime geometry, CCE provides physical constraints for understanding both natural and artificial intelligence.
This is a working manuscript that proposes a unified physical framework for studying intelligence. Several of the broader implications—particularly those bridging macroscopic information bounds with quantum and cosmological limits—are presented as formal hypotheses to be refined through extended proofs and empirical validation. The overarching goal is to anchor abstract computation in fundamental physical laws and, in doing so, establish rigorous, geometric constraints for the safe development of artificial intelligence.
Peter David Fagan
Preprint, 2025
We introduce a framework for applying keyed chaotic dynamical systems to encrypt and decrypt tensors in machine learning pipelines. This lightweight, deterministic approach enables authenticated inference without modifying model architectures or requiring retraining. Designed for privacy-first AI, this method provides a new building block at the intersection of cryptography, dynamical systems, and neural computation.
Peter David Fagan, Subramanian Ramamoorthy
Preprint, 2024
We introduce a new recurrent neural network layer that incorporates fixed nonlinear dynamics models where the dynamics satisfy the Echo State Property. We show that this neural network layer is well suited to the task of overcoming compounding errors under the learning from demonstration paradigm. Through evaluating neural network architectures with/without our layer on the task of reproducing human handwriting traces we show that the introduced neural network layer improves task precision and robustness to perturbations all while maintaining a low computational overhead.
Rimvydas Rubavicius, Peter David Fagan, Alex Lascarides, Subramanian Ramamoorthy
Preprint, 2025
In this work, we introduce an interactive task learning framework to cope with unforeseen possibilities by exploiting the formal semantic analysis of embodied conversation.
The DROID Dataset Team
Robotics: Science and Systems (R:SS), 2024
In this work, we introduce DROID (Distributed Robot Interaction Dataset), a diverse robot manipulation dataset comprising 76k demonstration trajectories or 350 hours of interaction data, collected across 564 scenes and 86 tasks by 50 data collectors in North America, Asia, and Europe over the course of 12 months.
Open X-Embodiment Team
IEEE International Conference on Robotics and Automation (ICRA), May 2024
In this work, we introduce Open X-Embodiment, a comprehensive collection of robotic learning datasets and RT-X models. These datasets and models facilitate research in embodied AI by providing large-scale, diverse, and realistic environments for training robotic systems. The datasets cover a wide range of tasks and scenarios, enabling robots to learn complex behaviors through interaction with their environment.
Peter David Fagan
Google Summer of Code, 2022
This is the official Python binding for the MoveIt 2 library.