Peter David Fagan

"We are macroscopic observers in a microscopic reality." As macroscopic observers, we cannot directly access the microscopic reality we occupy. The Conservation-Congruent Encoding (CCE) framework seeks to formalise this limitation, treating measurement and observation as physical acts bounded by conservation laws.

Google Scholar  |  GitHub

profile photo

I'm building Corca Health, an AI-assisted transdiagnostic screener for mental health and neurodevelopmental concerns. More broadly, my work is centred around leveraging AI to improve mental and physical wellbeing.

My research develops a physical theory to establish rigorous constraints for the safe development of artificial intelligence. Through the Conservation-Congruent Encoding (CCE) framework, I anchor computation in physical conservation laws. This work derives physical metrics for intelligence and consciousness, exposes the Platonic Observer Fallacy inherent in modeling microscopic realities with macroscopic equations, and shows how observer information processing/measurement relates to established physical theories.

Conceptual Notes

Quantifying Intelligence

Click to expand

The Core Hypothesis

While often treated as abstract, substrate-independent algorithmic properties, intelligence and computation are ultimately physical processes constrained by conservation laws. The core hypothesis of this framework is that information processing emerges when open systems undergo irreversible transitions, carving out stable macroscopic states from underlying reversible micro-dynamics. Under this physical lens:

  • Intelligence (χ) measures the physical efficiency of goal-directed computation—specifically, the amount of useful causal work extracted per unit of irreversible (distinction-destroying) information processing.
  • Consciousness (κ) is operationalized as the physical efficiency of internal organization—specifically, the amount of goal-directed work supported per unit of preserved (reversible) internal informational structure.

Grounding Information in Physical Substrates (CCE)

To measure information physically rather than abstractly, we utilize the Conservation-Congruent Encoding (CCE) framework. Traditional information theory (e.g., Shannon entropy) measures logical uncertainty, but is blind to equilibrium asymmetries and reservoir exchanges. CCE grounds information in matter by defining a macroscopic distinction as a family of protected regions (metastable basins of attraction) in state space whose boundaries are stabilized by conservation laws.

Under CCE, informational state changes are mapped directly to physical fluxes:

  • Irreversible Computation (Iirr): Many-to-one logical operations (such as resetting a register or merging states) compress macroscopic support. This collapse of distinctions forces a proportional export of entropy—measured in nats—through specific conserved channels (e.g., heat dissipation in a thermal bath, satisfying Landauer-type bounds).
  • Reversible Manipulation (Irev): One-to-one logical operations transport or advect the state within protected world-tubes without crossing boundaries or destroying distinctions. This preserves internal structure, incurring zero thermodynamic penalty in the quasistatic limit.
  • Controlled Expansion: One-to-many retrodictive operations reopen macroscopic support, importing conjugate conserved loads (e.g., absorbing heat) to re-instantiate the branch variables.

From Physical Information to the Core Identity

By defining both intelligence (χ) and consciousness (κ) as ratios of goal-directed work (W) to their respective informational currencies (Iirr and Irev), we isolate how an agent utilizes physical resources to influence its environment.

Solving the consciousness relation for work yields W = κ Irev. Substituting this into the definition of intelligence gives the identity:

χ = κ ( Irev / Iirr )

This identity shows that an agent's operational intelligence is the product of its structural consciousness and its processing efficiency (the ratio of preserved to destroyed distinctions). A system can achieve high intelligence either by executing massive, energy-intensive irreversible computations (high Iirr) or by relying on complex, pre-compiled internal models that advect information reversibly (utilizing Irev).

Toy Model: The Turbulent Stream (Morphological vs. Brute-Force Computation)

To study the relationship \chi = \kappa (I_{\text{rev}} / I_{\text{irr}}), we simulate two macroscopic agents swimming upstream through a channel. The environment dynamics feature a background current flowing right-to-left, carrying a synchronized stream of baseline vortices that drift with the flow.

Note: This simulation serves as a simplified, pedagogical toy model. In actual physical implementations, system-environment boundaries are often blurred (where the fluid itself performs part of the morphological computation), real-world thermodynamic efficiencies deviate significantly from the idealized Landauer limit, and defining "useful work" incorporates goal-dependent, observer-relative assumptions.

Both agents start from the left side and swim toward the goal line at the right side under the following dynamics:

Agent A: The Brute-Force Microbot

Agent A is modeled with a rigid body and operates via discrete sensory-motor feedback loops. Its dynamics are governed by:

  • Control Loop: It measures local fluid velocity, calculates a steering and thrust correction vector, and fires its thrusters.
  • Thermodynamic Cost: Every control iteration requires erasing memory buffers, exporting Landauer heat (I_{\text{irr}} \gg 0) into the fluid.
  • Fluid Interaction: The exported heat is modeled as a thermal backreaction that spawns secondary thermal vortices directly in the agent's path.
Agent B: The Morphological Eel

Agent B is modeled with a multi-segment soft body representing coupled, resonant oscillators. Its dynamics are governed by:

  • Mechanical Response: As the fluid current and vortices flow around the multi-segment body, the external forces physically shift the phases of its oscillators.
  • Thermodynamic Cost: It undergoes reversible state changes along protected symplectic phase boundaries without logical resets (I_{\text{irr}} \approx 0).
  • Fluid Interaction: The agent undulates in phase with the incoming vortices, exchanging momentum directly with the fluid flow fields.
The Environmental Shift: Adding Obstacles

Enabling the obstacles toggle introduces identical stationary basalt rock obstacles into both simulation lanes. This shifts the channel from a periodic vortex stream to an out-of-distribution environment, highlighting the epistemic trade-off between pre-compiled efficiency and active, general adaptability:

  • Spatial Constraints: The obstacles represent non-periodic physical boundaries in the fluid channel that disrupt the baseline vortex flow structures.
  • Active Steering vs. Passive Coupling: Agent A utilizes its active sensory scan cone to detect obstacles and adjust its vertical trajectory. Agent B lacks active sensors, relying on its soft body oscillators which are pre-compiled for fluid vortices.
  • Thermodynamic Trade-off: While Agent B achieves high computational efficiency (\chi) in periodic flow, it fails to adapt to non-periodic boundaries, colliding and losing goal-directed work. Agent A incurs a high irreversible computation cost (I_{\text{irr}}) through active sensing and trajectory calculation, but this cost allows it to ingest novel information and successfully navigate out-of-distribution environmental discontinuities.
Mathematical Outline of the Dynamics

The environment and agent dynamics are mathematically modeled as follows:

u(x, y) = -1.2 \cdot \text{gradient} + \sum_{j} u_{\text{vortex}}^{(j)}(x, y)
u_{\text{vortex}}^{(j)}(x, y) = \frac{-(y - y_j)}{d_j} \left(1 - \frac{d_j}{R_j}\right) \Gamma_j \quad (\text{for } d_j < R_j)

where d_j = \sqrt{(x - x_j)^2 + (y - y_j)^2} is the distance to vortex j, R_j is its radius, and \Gamma_j is its strength.

The information ledgers and goal-directed work use an exponential moving average (EMA) with a decay factor of 0.995 per frame:

W(t) = 0.995 \cdot W(t-1) + \Delta W(t) + \delta_{\text{goal}} \cdot 2000.0
I_{\text{rev}}(t) = \max(0.1, 0.995 \cdot I_{\text{rev}}(t-1) + \Delta I_{\text{rev}}(t))
I_{\text{irr}}(t) = \max(0.1, 0.995 \cdot I_{\text{irr}}(t-1) + \Delta I_{\text{irr}}(t))

For Agent A (Brute-Force Microbot), progress is steering-dependent, and sensory/control loops add to its reversible and irreversible costs:

\Delta W^A(t) = \delta_{\Delta x^A > 0} \cdot (x_t^A - x_{t-1}^A) \cdot 0.45
\Delta I_{\text{rev}}^A(t) = \delta_{t,\text{sensing}} \cdot 0.5(1+\sigma_{\text{noise}}) + \delta_{t,\text{thrust}} \cdot 1.5(1+\sigma_{\text{noise}}) + \delta_{t,\text{steering}} \cdot 0.2(1+\sigma_{\text{noise}})
\Delta I_{\text{irr}}^A(t) = \delta_{\text{erase}} \cdot k_B T \ln 2 + I_{\text{exhaust}}

For Agent B (Morphological Eel), internal wave-like undulation adds a constant baseline reversible cost, while vortex surfing provides passive energy extraction boosts. Its irreversible cost only spikes if extreme thermal noise knocks it out of its resonant phase:

\Delta W^B(t) = \delta_{\Delta x^B > 0} \cdot \left[ (x_t^B - x_{t-1}^B) \cdot 0.45 + \delta_{\text{exploiting}} \cdot 0.8(1 + \text{gradient}) \right]
\Delta I_{\text{rev}}^B(t) = 0.15 \cdot (1.0 + 0.5 \sigma_{\text{noise}})
\Delta I_{\text{irr}}^B(t) = \delta_{\text{thermal\_knock}} \cdot I_{\text{friction}}

Interactive Visualization: The Turbulent Stream Ledger

Useful Work (W) Terms:
Agent A: Brute-Force Microbot
Work Performed (W): 0.0 nats
Irreversible Cost (I_{\text{irr}}): 0.0 nats
Reversible Cost (I_{\text{rev}}): 0.0 nats
Intelligence (\chi): 0.00
Consciousness (\kappa): 0.00
Agent B: Morphological Eel
Work Performed (W): 0.0 nats
Irreversible Cost (I_{\text{irr}}): 0.0 nats
Reversible Cost (I_{\text{rev}}): 0.0 nats
Intelligence (\chi): 0.00
Consciousness (\kappa): 0.00

Two agents swimming upstream (left to right) through a turbulent fluid channel. Agent A (red microbot) utilizes discrete measurements and thrusters, exporting Landauer heat plumes (I_{\text{irr}}) that generate thermal vortices in its path. Agent B (green eel) utilizes a soft body of resonant coupled oscillators (I_{\text{rev}}) interacting with the fluid velocity fields. A toggle allows introducing identical stationary obstacles to both lanes.

AI Safety

AI safety is traditionally treated as a normative alignment problem. However, developing a formal framework to measure intelligence shifts the focus to physical boundaries and resource constraints. By defining intelligence as a measurable capacity to perform goal-directed work, we can establish the physical limits within which an agent operates.

Safety constraints can be studied mathematically via boundary dynamics and symbiotic coupling rather than isolated value alignment.

Cite this note
@misc{fagan2026quantifying,
  author = {Fagan, Peter David},
  title = {Quantifying Intelligence},
  howpublished = {\url{https://peterdavidfagan.github.io/\#concepts-equation-details}},
  year = {2026},
  note = {Online; accessed 30-May-2026}
}
Fagan, P. D. (2026). Quantifying Intelligence. Peter David Fagan's Personal Website. Retrieved May 30, 2026, from https://peterdavidfagan.github.io/#concepts-equation-details

Platonic Observer Fallacy

Click to expand

The Fallacy of the Detached Observer

In classical physics and computational theory, observers frequently treat macroscopic equations of motion as absolute and decoupled from the microscopic substrates they describe. This assumption—the Platonic Observer Fallacy—imagines that coarse-grained states and continuous field variables are objective entities that can be measured, tracked, and simulated without physical cost. By neglecting that observation is a physical act bounded by conservation laws, standard modeling hides a growing informational deficit.

To expose this fallacy, we sequence our argument as follows: First, we establish how to formally account for information, highlighting the difference between pure logical Shannon accounting and substrate-aware CCE dynamical accounting. Second, we equate this informational accounting to physical energy cost, demonstrating via an interactive asymmetric double-well simulator how substrate properties and conjugate potentials dictate the minimum energy required to execute logical operations. Finally, we scale these principles to show how deterministic macroscopic equations silently discard microscopic coordinates, causing the model to lose information and diverge over time, revealing the faulty accounting of detached observers.

1. Information Accounting: Shannon vs. CCE

The difference between Shannon's formulation and the Conservation-Congruent Encoding (CCE) framework lies in the distinction between pure logical accounting and dynamical accounting.

Under Shannon's formulation, information is logical and substrate-independent. For a discrete random variable X representing symbolic states, the Shannon entropy measures uncertainty purely from the state probabilities:

H_S(X) = -\sum_{x \in \mathcal{X}} p(x) \ln p(x)

If a logical operation maps a random variable from an initial probability distribution p to a final distribution p_{\text{final}}, Shannon's logical accounting reports the information change purely as the difference in logical entropy:

I_{\text{logical}} = H_S(p) - H_S(p_{\text{final}})

This accounting does not care about the physical characteristics of the underlying states, making it symmetric and blind to the physical effort required to maintain, transition, or merge them.

CCE, by contrast, relies on dynamical accounting. It grounds information in matter by mapping logical states to protected phase-space volumes stabilized by conservation laws. Let the unconstrained equilibrium distribution of the physical substrate be \pi. In CCE accounting, the physical information content of a macroscopic state distribution p is measured using the Kullback-Leibler (KL) divergence relative to this equilibrium distribution:

I_{\text{CCE}}(p) = D_{\text{KL}}(p \parallel \pi) = \sum_i p_i \ln \frac{p_i}{\pi_i}

This divergence represents the physical distinction held by the observer's encoding relative to the ambient substrate dynamics. The minimum irreversible cost I_{\text{irr}} of transitioning the system from distribution p to p_{\text{final}} is determined by the difference in their physical distinctions:

I_{\text{irr}} \ge I_{\text{CCE}}(p_{\text{final}}) - I_{\text{CCE}}(p) = D_{\text{KL}}(p_{\text{final}} \parallel \pi) - D_{\text{KL}}(p \parallel \pi)

To see the impact of this dynamical ledger, consider an asymmetric bit with two macroscopic basins: Left (L) and Right (R), forming the logical state space \mathcal{X} = \{L, R\}. Due to differences in energy levels, boundary geometry, or coupling to the environment, the phase-space volumes corresponding to these states are unequal: V_R = \alpha V_L (with \alpha \neq 1). The prior equilibrium distribution is:

\pi = (\pi_L, \pi_R)^T = \left(\frac{V_L}{V_L + V_R}, \frac{V_R}{V_L + V_R}\right)^T = \left(\frac{1}{1+\alpha}, \frac{\alpha}{1+\alpha}\right)^T

We represent probability distributions over \mathcal{X} as vectors in the 1-simplex, p = (p(L), p(R))^T \in \Delta^1.

A. Reset-to-Left Operation

If we prepare the system in an initial distribution p = (p, 1-p)^T and subsequently execute a logically irreversible Reset-to-Left operation (yielding a final distribution p_{\text{final}} = (1, 0)^T, where the system occupies basin L with certainty), the logical Shannon cost is I_{\text{logical}} = H_S(p) - H_S(p_{\text{final}}) = H_S(p) - 0 = H_S(p). However, the CCE dynamical cost is:

\begin{aligned} I_{\text{irr}} &\ge D_{\text{KL}}(p_{\text{final}} \parallel \pi) - D_{\text{KL}}(p \parallel \pi) \\ &= \ln\frac{1}{\pi_L} - \left( p\ln\frac{p}{\pi_L} + (1-p)\ln\frac{1-p}{\pi_R} \right) \\ &= -\ln\pi_L - p\ln p + p\ln\pi_L - (1-p)\ln(1-p) + (1-p)\ln\pi_R \\ &= -p\ln p - (1-p)\ln(1-p) - (1-p)\ln\pi_L + (1-p)\ln\pi_R \\ &= H_S(p) + (1-p)\ln\frac{\pi_R}{\pi_L} \\ &= H_S(p) + (1-p)\ln\alpha \end{aligned}

The CCE dynamical accounting recovers the logical Shannon term H_S(p) plus a physical asymmetry penalty (1-p)\ln\alpha.

Crucially, the asymmetry parameter \alpha is a fixed physical property of the hardware (the geometric or energetic ratio of the basins), whereas how this asymmetry penalizes or discounts the transition is determined by the direction of the operation. Here, because \alpha > 1 (meaning the Right basin is larger), sweeping probability out of the larger basin into the smaller Left basin requires compressing the phase space. This direction-dependent physical effort is why the operation pays a penalty of +(1-p)\ln\alpha.

B. Reset-to-Right Operation

If we instead executed a Reset-to-Right operation (sweeping the system into the larger basin, so p_{\text{final}} = (0, 1)^T), the CCE dynamical cost would be:

\begin{aligned} I_{\text{irr}} &\ge D_{\text{KL}}(p_{\text{final}} \parallel \pi) - D_{\text{KL}}(p \parallel \pi) \\ &= \ln\frac{1}{\pi_R} - \left( p\ln\frac{p}{\pi_L} + (1-p)\ln\frac{1-p}{\pi_R} \right) \\ &= -\ln\pi_R - p\ln p + p\ln\pi_L - (1-p)\ln(1-p) + (1-p)\ln\pi_R \\ &= -p\ln p - (1-p)\ln(1-p) + p\ln\pi_L - p\ln\pi_R \\ &= H_S(p) - p\ln\frac{\pi_R}{\pi_L} \\ &= H_S(p) - p\ln\alpha \end{aligned}

In this direction, the penalty becomes a thermodynamic discount of -p\ln\alpha. Because the operation allows the system to expand from the smaller, constrained Left basin into the larger Right basin, the physical substrate assists the transition. Under pure Shannon accounting, both operations cost the same logical H_S(p) nats. Under CCE, the logical ledger is bound to the physical hardware: compressing data against the substrate's natural asymmetry costs extra work, while expanding with it recovers work.

Interactive Simulator: Asymmetric Double-Well Reset

ΔU = 0.00 nats L (p = 0.50) R (1-p = 0.50) Barrier V_L = 1.0 V_R = 2.0

Wells represent Left (L) and Right (R) basins. The potential landscape fill and basin depths scale with volume asymmetry. Ball sizes scale with probability mass.

Shannon Logical Accounting
Initial Entropy H_S(p): -
Final Entropy H_S(p_{\text{final}}): -
Logical Cost I_{\text{logical}}: -
CCE Dynamical Accounting
Initial KL D_{\text{KL}}(p \parallel \pi): -
Final KL D_{\text{KL}}(p_{\text{final}} \parallel \pi): -
Irreversible Cost I_{\text{irr}}: -
Physical Cost & Conservation Laws

In the CCE framework, physical information processing costs are determined by the conservation laws stabilizing state boundaries. Under an intensive potential (such as temperature, voltage, or chemical potential), the minimum physical energy cost is given by the generalized equation:

\Delta E \ge \text{Scale} \times \text{Conjugate} \times \text{Info}
Characteristic Scales (Constants):
  • Thermal: Boltzmann constant (k_B)
  • Electronic: Elementary charge (e)
  • Biological: Unit factor (1 molecule)
Conjugate Forces (Intensive Potentials):
  • Thermal: Temperature (T)
  • Electronic: Voltage Bias (V_0)
  • Biological: Chemical Gradient (\Delta \mu)

For Thermal substrates this yields \Delta E \ge k_B T I_{\text{irr}}, whereas Electronic and Biological substrates yield the work bounds \Delta E_{\text{work}} \ge e V_0 I_{\text{irr}} and \Delta E_{\text{chem}} \ge \Delta \mu I_{\text{irr}} under their respective biases.

2. From Micro-Reality to Macro-Equations

We now scale this concept from a single bit to a macroscopic equation modeling a high-dimensional microscopic reality. Let the true microscopic state of a system be x in a high-dimensional phase space \mathcal{M}, with its probability distribution evolving under the microscopic Liouville or Fokker-Planck flow as P_{\text{micro}}(x, t).

A macroscopic observer uses a coarse-graining projection operator \Pi: \mathcal{M} \to \mathcal{C} to map these microstates to a reduced, continuous field variable \phi(t) \in \mathcal{C}. The observer models the system using a macroscopic equation of motion:

\dot{\phi}(t) = f(\phi(t))

By relying solely on \phi(t), the observer implicitly assumes a microscopic density constructed via the maximum-entropy (or local equilibrium) lift operator \Pi^*:

P_{\text{macro}}(x, t) = \Pi^* \phi(t)

However, the true microscopic distribution P_{\text{micro}}(x, t) evolves under the full chaotic and coupled microscopic laws. As time progresses, microscopic interactions generate fine-grained fluctuations, correlations, and gradients across the boundaries of the coarse-grained cells. These details are neglected by P_{\text{macro}}(x, t), which assumes local equilibrium within the macroscopic states.

Under the CCE framework, this informational mismatch D_{\text{KL}}\left(P_{\text{micro}} \parallel P_{\text{macro}}\right) is not merely an abstract distance; it represents a physical mismatch in phase-space coordinates that carries a literal energetic price tag. If the observer wishes to prevent this predictive drift—actively keeping the real physical system aligned with the macroscopic prediction P_{\text{macro}} via feedback control—they must perform corrective measurements and physical operations that dissipate energy:

\Delta E_{\text{diss}} \ge k_B T \, D_{\text{KL}}\left(P_{\text{micro}} \parallel P_{\text{macro}}\right)

This inequality establishes a direct physical link: the informational divergence in nats determines the minimum energy the observer must dissipate to maintain the validity of their macroscopic model. If they do not pay this energy bill, the model and reality will physically diverge. In the following section, we analyze the exact shape of this energy ledger by tracing the rates of divergence when projecting a macroscopic system forward and backward.

3. Nats of Divergence: A Concrete Example

The difference between prediction and retrodiction under a macroscopic model can be illustrated using a classical physical system: a particle of mass m moving in a viscous fluid with friction coefficient \gamma.

Let the true microscopic dynamics of the particle's velocity v be stochastic due to collisions with fluid molecules (thermal noise), described by the Langevin equation:

dv = -\gamma v dt + \sqrt{2D} dW_t

where D is the diffusion coefficient and W_t is a standard Wiener process. The true microscopic probability density P_{\text{micro}}(v, t) evolves under the Fokker-Planck equation, tending toward the thermal equilibrium variance \sigma_{\text{eq}}^2 = D/\gamma.

A macroscopic observer ignores the thermal fluctuations and models the velocity using the deterministic decay equation:

\dot{v}_{\text{macro}} = -\gamma v_{\text{macro}}

The observer's measurement instrument has a finite resolution represented by a narrow Gaussian of variance \sigma_0^2 (where \sigma_0^2 \ll \sigma_{\text{eq}}^2).

A. Prediction Divergence

Suppose the observer prepares the particle in a highly localized velocity state v(0) = v_0 with variance \sigma_0^2 at t = 0. After a time interval T, the true microscopic distribution spreads due to diffusion:

P_{\text{micro}}(v, T) = \mathcal{N}\left(v_0 e^{-\gamma T}, \, \sigma_0^2 e^{-2\gamma T} + \sigma_{\text{eq}}^2(1 - e^{-2\gamma T})\right)

Meanwhile, the macroscopic model predicts v_{\text{macro}}(T) = v_0 e^{-\gamma T} and assumes the measurement resolution remains \sigma_0^2, yielding P_{\text{macro}}(v, T) = \mathcal{N}(v_0 e^{-\gamma T}, \sigma_0^2).

The predictive divergence in nats is the Kullback-Leibler divergence between these two distributions:

D_{\text{pred}}(T) = D_{\text{KL}}\left(P_{\text{micro}}(v, T) \parallel P_{\text{macro}}(v, T)\right) = \ln\frac{\sigma_0}{\sigma_T} + \frac{\sigma_T^2}{2\sigma_0^2} - \frac{1}{2}

where \sigma_T^2 = \sigma_0^2 e^{-2\gamma T} + \sigma_{\text{eq}}^2(1 - e^{-2\gamma T}). As T becomes large, the true variance approaches the thermal variance \sigma_{\text{eq}}^2, and the predictive divergence plateaus at a constant level:

D_{\text{pred}}(T) \approx \frac{\sigma_{\text{eq}}^2}{2\sigma_0^2} - \ln\frac{\sigma_{\text{eq}}}{\sigma_0} - \frac{1}{2}

This represents the information lost by ignoring environmental fluctuations, which is bounded by the ratio of thermal noise to observer resolution.

B. Retrodictive Divergence

Now suppose the observer measures the velocity at time T to be v(T) = v_T. To reconstruct the past state at t = 0, the macroscopic modeler runs the deterministic equation backward in time, yielding the retrodictive estimate v_{\text{macro}}(0) = v_T e^{\gamma T} with resolution variance \sigma_0^2.

However, the true microscopic retrodiction is the Bayesian posterior distribution of the initial state given the measurement. Under a thermal prior P(v(0)) = \mathcal{N}(0, \sigma_{\text{eq}}^2), the posterior distribution is:

P_{\text{micro}}(v(0) \mid v_T) = \mathcal{N}\left(v_T e^{-\gamma T}, \, \sigma_{\text{eq}}^2(1 - e^{-2\gamma T})\right)

Because high-energy states are exponentially rare under the thermal prior, observing a velocity v_T today does not mean the particle started with a massive velocity v_T e^{\gamma T} that slowly dissipated. Rather, it is overwhelmingly more probable that the particle was near thermal equilibrium (near zero) in the past, and a recent random thermal fluctuation pushed it to v_T. The macroscopic model completely ignores this thermal prior, hallucinating a physically exorbitant, high-energy history.

The retrodictive divergence in nats between this true historical distribution and the macroscopic model's backward projection is:

D_{\text{retro}}(T) = D_{\text{KL}}\left(P_{\text{micro}}(v(0) \mid v_T) \parallel P_{\text{macro}}(v(0))\right) = \ln\frac{\sigma_0}{\sigma_{\text{post}}} + \frac{\sigma_{\text{post}}^2 + \left(v_T e^{-\gamma T} - v_T e^{\gamma T}\right)^2}{2\sigma_0^2} - \frac{1}{2}

As T grows, the difference between the true posterior mean and the macroscopic model's retrodiction grows exponentially. The divergence is dominated by the mean mismatch:

D_{\text{retro}}(T) \approx \frac{v_T^2 \left(e^{\gamma T} - e^{-\gamma T}\right)^2}{2\sigma_0^2} \approx \frac{v_T^2 e^{2\gamma T}}{2\sigma_0^2}

Unlike the prediction divergence which plateaus, the retrodictive divergence grows exponentially with T. This asymmetry represents the informational debt of coarse-graining: the macroscopic model throws away phase-space volume in the forward direction, which requires exponential precision (information) to reconstruct in reverse.

Visualizing Macroscopic Divergence (Prediction vs. Retrodiction)

Divergence at Boundaries (t = \pm 2.5\text{s}): Future (D_{\text{pred}}): 0.00 nats Past (D_{\text{retro}}): 0.00 nats

Left half (t < 0) shows the Past (Retrodiction); right half (t > 0) shows the Future (Prediction). Shaded blue plume shows the time-symmetric diffusing microscopic distribution (P_{\text{micro}}). The solid red line shows the macroscopic model (P_{\text{macro}}), which decays to 0 on the right but shoots up exponentially on the left. The divergence chart (bottom) uses a logarithmic scale to resolve both prediction and retrodiction.

4. The Platonic Observer Fallacy: Physical Boundaries of Macroscopic Models

Under the Conservation-Congruent Encoding (CCE) framework, a continuous macroscopic equation is not a passive mirror of reality; it is a physical mapping executed by an observer. The Platonic Observer Fallacy assumes this mapping can be pushed to arbitrary temporal limits (T \to \pm\infty) without exacting a physical cost.

By subjecting macroscopic models to this physical ledger, their asymptotic extremes cease to be abstract mathematical curiosities. They represent the literal operational bounds of any embedded physical system attempting to maintain a continuous projection.

A. Predictive Dissipation: The Informational Whiteout

In the forward direction, the Platonic fallacy assumes the observer can track a pristine macroscopic signal indefinitely, implicitly granting them infinite resources to actively suppress underlying environmental fluctuations. For any physically embedded observer, this active suppression is bounded by their substrate's physical capacity. As the tracked state diffuses entirely into the equilibrium of the surrounding environment, the predictive divergence plateaus. This is Predictive Dissipation—the exact boundary where the observer’s physical capacity to isolate and maintain a distinct macroscopic signal is exhausted. The mathematical model does not break, but the observer is left in a bounded informational whiteout, unable to resolve the state from the background noise.

B. Retrodictive Divergence: Physical Bankruptcy

In the backward direction, the Platonic fallacy assumes costless resolution. By ignoring the physical prior and running dissipative operations in reverse, the macroscopic model hallucinates an exponentially diverging phase-space volume to justify the system's history. To actually map this retrodictive calculation to reality, the observer must physically encode an exploding number of diverging micro-histories into their local memory substrate.

Because the observer possesses a finite phase-space capacity, they cannot indefinitely support this geometric growth. Therefore, the asymptotic failure of a backward-running macroscopic equation is not a fundamental breakdown of physical law; it is Retrodictive Divergence. It represents the exact temporal coordinate where the observer goes computationally and physically bankrupt, lacking the fundamental phase-space capacity to reconstruct the past.

Conclusion

By mapping macroscopic divergence to a physical substrate, the CCE framework exposes the Platonic Observer Fallacy as a fundamental mismatch between mathematical models and physical hardware. The asymmetry between Predictive Dissipation (which plateaus as information diffuses into environmental equilibrium) and Retrodictive Divergence (which grows exponentially) is not a mere mathematical curiosity; it is a physical ledger of the energy and phase-space constraints governing the observer. A continuous macroscopic equation is never a free, detached view of reality. Maintaining it forward requires actively dissipating energy to suppress noise, while executing it backward demands an exponentially growing memory capacity. Ultimately, the breakdown of these equations does not signal a failure of physical law, but rather the boundary where the observer’s physical hardware runs out of phase-space and energy—the point of physical bankruptcy.

Cite this note
@misc{fagan2026platonic,
  author = {Fagan, Peter David},
  title = {Platonic Observer Fallacy},
  howpublished = {\url{https://peterdavidfagan.github.io/\#concepts-platonic-observer-fallacy}},
  year = {2026},
  note = {Online; accessed 30-May-2026}
}
Fagan, P. D. (2026). Platonic Observer Fallacy. Peter David Fagan's Personal Website. Retrieved May 30, 2026, from https://peterdavidfagan.github.io/#concepts-platonic-observer-fallacy

Toward a Physical Theory of Intelligence

The following research notes represent an ongoing effort to map and structure a developing physical theory. To keep pace with theoretical ideation, I utilize AI tools to rapidly generate and formalize my work into research notes.

Please read these documents with the understanding that they are living drafts:

The Core Architecture The initial constraints, concepts, and theoretical leaps are human-authored.
The Textual Generation AI-assisted to accelerate the drafting process.
The Content is Under Active Validation I am continuously reviewing, mathematically checking, and refining these notes. Until finalized, they should be treated as developing hypotheses rather than rigorously proven theory.

Conservation-Congruent Encodings

Click to expand

Peter David Fagan

Preprint (v2), 2026

PDF

A conservation-congruent encoding (CCE) is a physically realized macroscopic distinction, unlike the abstract, substrate-independent notion of information used in traditional information theory. Under a chosen coarse-graining, it is represented by protected macroscopic regions and their associated world-tubes that persist under ambient fluctuations, are maintained by dynamical invariants tied to conserved quantities, and can be irreversibly merged only through dissipative export into explicitly modeled channels. This note gives a minimal definition of a CCE.

Epistemic Limits of the Embedded Observer

Click to expand

Peter David Fagan

Preprint, 2026

PDF

The Platonic observer is replaced by a physically embedded computational subsystem. Bounded by holographic encoding capacity and Conservation-Congruent Encoding (CCE), the observer is geometrically forced to project a massively large, finite set of discrete microscopic realities into a strictly smaller set of macroscopic equivalence classes. Consequently, state evolution under exact reversible dynamics becomes predictively intractable within the observer’s formal system. Expanding upon the physical limits of inference and the computational capacity bounds of the universe, this framework proves that computing the precise trajectory of the universe is physically impossible.

Cosmological Horizons as Epistemic Bounds of Conservation-Congruent Encodings

Click to expand

Peter David Fagan

Preprint, 2026

PDF

The foundations of standard cosmology rely on modelling reality with continuous macroscopic field equations. This note identifies an observer-resource assumption hidden by such equations and analyzes it using the Conservation-Congruent Encoding (CCE) framework. Within CCE, a projection Π is a coarse-graining of physical reality whose erasure, refinement, and maintenance carry energetic or informational costs. The note argues that several horizon-like limits also admit an observer-indexed operational reading within this ledger. Forward in time, heat death is read as Predictive Dissipation: the irreversible loss of macroscopic signal when a projection truncates the metric exhaust required to hold that signal distinct from the bath. Backward in time, the Big Bang singularity is read as Retrodictive Divergence: the divergence of the Landauer-scale cost required to re-instantiate erased branch distinctions under an assumption of costless resolution. Cosmological expansion and redshift are treated as standard geometric identities with an additional CCE bookkeeping interpretation, while gravitational singularities mark lower area-capacity limits for physically instantiated projections. The result is not a new dynamics, but a stricter operational license for using continuous models: observers do not access an infinite continuous universe at arbitrary precision, but work within a finite epistemic bubble bounded by their physical embedding.

Emergence of the Physical Laws of a Macroscopic Observer

Click to expand

Peter David Fagan

Preprint, 2026

PDF

We ask how apparently fundamental laws can arise from the physics of observation itself. In the Conservation-Congruent Encoding framework, an observer is a finite material device whose records must be stored and repeatedly reset. We make that bookkeeping explicit through developing an example case of a one-bit observer: a particle in a symmetric double-well potential immersed in a homogeneous thermo-acoustic medium. Incoming acoustic packets flip the bit reversibly, while a time-dependent control protocol restores the ready state. Because reset is logically irreversible and occurs while the bit remains coupled to the bath, each cycle dissipates at least kBT0ΔHcg, so under a reset rate ν the observer acts as a localized heat source with mean power P ≥ νkBT0ΔHcg. In steady state this produces a thermal halo δT(r) = P/(4πkr), which in a focusing medium induces the refractive profile n(r) = n0(1 + Γ/r). The resulting ray bending enlarges the capture cross-section and, in the weak-field limit, is mathematically equivalent to motion in an attractive 1/r potential. An external analyst restricted to the reduced event stream can therefore mistake self-induced bath distortion for an intrinsic force law. We then sketch a speculative gravitational extrapolation in which erased information is absorbed by local horizons, Newton's constant becomes Vacuum Informational Compliance, and a positive cosmological constant sets a deep-field crossover scale; with additional equilibrium assumptions, the weak-field closure can then be lifted toward the Einstein field equations.

Revisiting Classic Thought Experiments to Measure Consciousness for Artificial Intelligence Safety

Click to expand

Peter David Fagan

Preprint, 2026

PDF

This research note revisits Leibniz's mill, Turing's imitation game, and Searle's Chinese Room through the Conservation-Congruent Encoding (CCE) framework. It formalises a toy symbolic setting in which successful behaviour is measured by task performance (Wcausal,T), while the efficiency with which preserved internal structure supports that behaviour is measured by operational consciousness (κT). Within this setup, an uncompressed lookup system and a compact generative system can in principle achieve comparable behavioural success, yet diverge sharply in κT: the former relies on an expanding standing store of unreused mappings, whereas the latter reuses compact internal structure. The note therefore reframes classic disputes about understanding by separating outward performance from the organisation that sustains it, and motivates why this distinction may matter for later AI-safety analysis.

Physical Constraints on Realizing P=NP with Applications to Artificial Intelligence Safety

Click to expand

Peter David Fagan

Preprint, 2026

PDF

We apply the Conservation-Congruent Encoding (CCE) framework to the P versus NP problem by explicitly modeling the thermodynamic tradeoff between reversible information processing (Irev) and irreversible information processing (Iirr). While constructive algorithms theoretically avoid exponential candidate generation, worst-case NP-complete problems possess constraint topologies that are logically irreducible. Mapping this implicitly exponential constraint density into a poly(N) physical memory forces continuous intermediate state erasure. Under the physical identity χ = κ(Irev/Iirr), we demonstrate that processing irreducible logical structures strictly triggers an exponential Landauer tax, yielding the physical contradiction poly(N) + poly(N) ≥ Θ(2N). We present a physical constraint on scalable realizations of worst-case search on digital substrates, independent of formal mathematical shortcuts.

Intelligence, Consciousness and Designing for Computational Efficiency

Click to expand

Peter David Fagan

Preprint, 2026

PDF

We present a physical framework for computational efficiency in AI by mapping classical complexity to informational costs measured in nats. Algorithmic time is treated as irreversible information processing (Iirr), while space is treated as preserved information processing (Irev). Using these quantities, we define operational intelligence (χ = Wachieved/Iirr), structural consciousness (κ = Wachieved/Irev), and retention (ρ = Wachieved/Wtarget) to separate task demand from architecture-level performance. We ground these metrics in lightweight Conservation-Congruent Encoding (CCE) conditions, which specify when coarse-grained informational states admit metastable physical realizations and when irreversible state collapse must export entropy through conserved channels. We then apply the framework to sorting algorithms and modern sequence models, showing that dominant scaling bottlenecks are physical routing burdens rather than software abstraction alone. Under this lens, Euclidean GPU-style layouts impose congestion costs that can preserve near-quadratic pressure even when nominal attention complexity is reduced. As an illustrative topology-only proxy, a constrained optimization with N = 128 and k ≤ 4 shifts from a planar baseline (D = 21) to an expander-like layout (D = 5), reducing hop-count routing proxy from Ihopirr = 2688 to Ihopirr = 640 (76%) and increasing proxy χ by 4.2× at fixed transceiver budget. Because this surrogate omits post-layout wire length, capacitance, and Rent-style pin-limited embedding effects, these gains are reported as upper bounds pending physical place-and-route validation.

Toward a Physical Theory of Intelligence

Click to expand

Peter David Fagan

Preprint (v2), 2026

PDF

While often treated as abstract algorithmic properties, intelligence and computation are ultimately physical processes constrained by conservation laws. We introduce the Conservation-Congruent Encoding (CCE) framework as a unified, substrate-neutral physical framework for studying intelligence. We propose that information processing emerges when open systems undergo irreversible transitions, carving out macroscopic states from underlying reversible micro-dynamics. Generalizing Landauer's principle to arbitrary conserved quantities via metriplectic flows, we derive a universal bound for macroscopic computation. This yields physical metrics for intelligence and an operational analogue for consciousness, quantifying an agent's ability to extract work from the environment while minimizing its own dissipative dynamics. Applying CCE to the limits of physical observation, we model measurement as an active coarse-graining process rather than a passive projection. At the quantum scale, CCE recovers the Lindblad Master Equation, consistent with modelling decoherence as the dissipative exhaust required to record a measurement. Scaling to cosmological limits, we explore the hypothesis that gravity emerges as the macroscopic geometric footprint of these bounds. We show that, under this hypothesis, measurement-induced dissipation is consistent with a volumetric phase-space collapse, offering a dynamical route to the Bekenstein-Hawking area law. Equating the Landauer exhaust of this coarse-graining to horizon deformation outlines a limiting-case recovery of the Einstein Field Equations. Ultimately, by establishing a substrate-neutral link between thermodynamic dissipation, quantum measurement, and spacetime geometry, CCE provides physical constraints for understanding both natural and artificial intelligence.

This is a working manuscript that proposes a unified physical framework for studying intelligence. Several of the broader implications—particularly those bridging macroscopic information bounds with quantum and cosmological limits—are presented as formal hypotheses to be refined through extended proofs and empirical validation. The overarching goal is to anchor abstract computation in fundamental physical laws and, in doing so, establish rigorous, geometric constraints for the safe development of artificial intelligence.

Other Papers

Keyed Chaotic Dynamics For Privacy-Preserving Neural Inference

Click to expand

Peter David Fagan

Preprint, 2025

Project Page / arXiv

We introduce a framework for applying keyed chaotic dynamical systems to encrypt and decrypt tensors in machine learning pipelines. This lightweight, deterministic approach enables authenticated inference without modifying model architectures or requiring retraining. Designed for privacy-first AI, this method provides a new building block at the intersection of cryptography, dynamical systems, and neural computation.

Learning from Demonstration with Implicit Nonlinear Dynamics Models

Click to expand

Peter David Fagan, Subramanian Ramamoorthy

Preprint, 2024

Project Page / arXiv

We introduce a new recurrent neural network layer that incorporates fixed nonlinear dynamics models where the dynamics satisfy the Echo State Property. We show that this neural network layer is well suited to the task of overcoming compounding errors under the learning from demonstration paradigm. Through evaluating neural network architectures with/without our layer on the task of reproducing human handwriting traces we show that the introduced neural network layer improves task precision and robustness to perturbations all while maintaining a low computational overhead.

SECURE: Semantics-Aware Embodied Conversation Under Unawareness for Lifelong Robot Learning

Click to expand

Rimvydas Rubavicius, Peter David Fagan, Alex Lascarides, Subramanian Ramamoorthy

Preprint, 2025

Project Page / arXiv

In this work, we introduce an interactive task learning framework to cope with unforeseen possibilities by exploiting the formal semantic analysis of embodied conversation.

DROID: A Large-Scale In-the-Wild Robot Manipulation Dataset

Click to expand

The DROID Dataset Team

Robotics: Science and Systems (R:SS), 2024

Project Page / arXiv

In this work, we introduce DROID (Distributed Robot Interaction Dataset), a diverse robot manipulation dataset comprising 76k demonstration trajectories or 350 hours of interaction data, collected across 564 scenes and 86 tasks by 50 data collectors in North America, Asia, and Europe over the course of 12 months.

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Click to expand

Open X-Embodiment Team

IEEE International Conference on Robotics and Automation (ICRA), May 2024

Project Page / arXiv

In this work, we introduce Open X-Embodiment, a comprehensive collection of robotic learning datasets and RT-X models. These datasets and models facilitate research in embodied AI by providing large-scale, diverse, and realistic environments for training robotic systems. The datasets cover a wide range of tasks and scenarios, enabling robots to learn complex behaviors through interaction with their environment.

Software

MoveIt 2 Python Library

Click to expand

Peter David Fagan

Google Summer of Code, 2022

Code

This is the official Python binding for the MoveIt 2 library.