Predictive Processing

Your brain is not a camera. It's a prediction engine that hallucinates the world and then checks its hallucination against incoming sensory data. This idea — that perception is "controlled hallucination" — sounds like a provocation, but it's backed by a growing body of experimental evidence and has become one of the most productive unifying frameworks in cognitive science.

The Framework

The core insight traces back to Helmholtz in the 19th century: the brain is locked inside a dark skull receiving only noisy, ambiguous signals that are at best indirectly related to what's actually out there. It must therefore infer the causes of those signals. What we perceive isn't the world — it's the brain's best guess about what's out there.¹

Modern predictive processing formalizes this — essentially the Bayesian brain hypothesis implemented in neural architecture, with conditionalization on evidence, precision-weighted priors, and belief updating all the way down. The brain maintains a hierarchical generative model and continuously generates top-down predictions about expected sensory input. What flows up from the senses isn't raw data — it's prediction error, the mismatch between expectation and reality. Perception is the process of minimizing this error by updating predictions. As Chris Frith put it: a controlled hallucination is a fantasy that coincides with reality. The "controlled" part is crucial — an uncontrolled hallucination is psychosis.¹

This completely inverts the classical picture. In the textbook story, sensory signals enter through receptors and get progressively elaborated as they move deeper into the brain. In predictive processing, the heavy lifting is done by predictions flowing downward, from deep cortical layers toward sensory surfaces. The upward-flowing signals are just correction signals — "you're wrong about this part."

The Evidence

Seth's lab at the Sackler Centre has produced the most compelling experimental support. When top-down signaling in the visual cortex is disrupted by TMS, conscious perception of motion vanishes even though bottom-up signals remain intact. In binocular rivalry experiments (different images to each eye), people consciously see what they expect rather than what violates their expectations.¹

The most surprising finding involves timing. The brain imposes its predictions at preferred phases within the alpha rhythm, a ~10Hz oscillation over the visual cortex. This means we may perceive the world in discrete ~100ms snapshots, each organized by predictive processing. A well-known oscillation whose function was mysterious turns out to be the clock signal for predictive perception.¹

Hallucinations and psychosis get a clean mechanistic explanation: the brain is over-weighting its priors relative to sensory evidence. Different levels of the cortical hierarchy generate different kinds of hallucinations — simple geometric patterns at low levels, rich narratives with people and objects at high levels. This has genuine clinical promise, addressing mechanisms rather than just symptoms.

From Perception to Self

Here's where it gets really interesting. If the brain predicts the causes of external sensory signals, it also predicts the causes of internal signals — heartbeat, blood pressure, gut tension, proprioception. The experience of having a body is a prediction about body-related causes of interoceptive signals. This connects directly to Selfhood — depersonalization disorder may be what happens when these interoceptive predictions lose their grip.

Seth's augmented-reality experiments demonstrate this directly: people feel greater ownership of a virtual hand that pulses in sync with their actual heartbeat. "I predict (myself) therefore I am" replaces Descartes' cogito. And when prediction is oriented toward control rather than accuracy — keeping physiological variables within viable bounds rather than representing them precisely — we get the deep embodied sense of being a body rather than merely perceiving one. We are, as Seth puts it, "beast machines": self-sustaining flesh-bags that care about their own persistence.¹

Friston and the Free Energy Principle

Karl Friston's extension of predictive processing into the "free energy principle" is either the deepest unification in the history of neuroscience or an unfalsifiable tautology, and the alarming thing is that nobody — including many neuroscientists with large NIH grants — can confidently tell you which.²

The basic idea: free energy is a mathematical quantity used in variational Bayesian methods, a computationally tractable approximation of Bayes' theorem. Minimizing free energy is roughly equivalent to minimizing prediction error, minimizing surprise, and maximizing model accuracy. So far, so predictive processing. But Friston pushes further. He claims the brain doesn't just minimize prediction error through perception (updating your model) — it also minimizes it through action (changing the world to match your model). You "predict" that your mouth is full of food. It isn't. That's a prediction error. You eat. Error resolved. Perception and action become two strategies for the same objective.²

Scott Alexander's attempt to parse this is the most honest account I've read: he reports that a room full of PhDs with $10 million in NIH grants between them tried for ninety minutes to understand Friston's 2010 paper and failed.² But the glimmer of insight Alexander extracts is worth keeping. The free energy principle might best be understood as a formal framework for homeostasis — a way of describing how living systems restrict themselves to tiny regions of the space of possible states. Your body could be at any temperature and heart rate. It successfully stays near 98.6F and 70bpm. This is prediction error minimization in the broadest possible sense: the organism "predicts" it will be alive, and acts to make that prediction true.

Friston himself calls the principle "almost tautological." It's a principle, like Hamilton's Principle of Stationary Action — not falsifiable, but potentially useful as a lens. The worry, flagged by philosopher Wo, is that equating perception and action as "two means to the same end" might not hold up: the free energy minimized in perception seems to be a completely different quantity from the free energy minimized in action. They involve mathematically similar optimization problems, but that might just reflect well-known parallels between conditionalization and expected utility maximization — interesting, but not revolutionary.²

Still, there's something here. The move from "brains predict sensory input" to "brains predict everything, including their own bodily states, and act to fulfill those predictions" is what gives predictive processing its reach into emotion, motivation, and Selfhood. Active inference — the operational version of the free energy principle — is where the framework goes from a theory of perception to a theory of being alive.

Fitness Over Truth

Donald Hoffman's work adds an uncomfortable wrinkle from evolutionary game theory: our perceptions may not track truth at all.³ His fitness-beats-truth theorem (proven with mathematician Chetan Prakash) shows that an organism tuned to fitness will never be outcompeted by an equally complex organism tuned to truth. The reason is simple: fitness functions almost never align with the true structure of reality. If too little water kills you and too much water drowns you, an organism that sees water quantity accurately is less fit than one that just sees "red" for dangerous amounts and "green" for safe amounts — a representation that's useful but has no structural resemblance to the underlying reality.

This is predictive processing taken to its philosophical limit. If predictive processing says perception is controlled hallucination, Hoffman says the hallucination isn't even trying to be accurate. It's a desktop interface — useful icons that guide behavior while hiding the computational reality underneath. You couldn't form a true description of a computer's innards from its desktop, and you can't form a true description of reality from your perceptions. The icons have color, position, and shape, but none of those properties are true of the actual files.

I think Hoffman pushes this too far — his "conscious realism" (reality is conscious agents all the way down, no physical objects) is more metaphysical speculation than empirical result. But the fitness-beats-truth theorem itself is solid, and it should make us nervous about the naive assumption that evolution has equipped us to perceive things as they are. Seth's controlled hallucination isn't just a provocative metaphor — it might be an understatement.

The LLM Mirror

The elephant in the room: LLMs are literally next-token prediction engines. Kulveit argues persuasively that the simulator framing of base models and predictive processing are essentially the same map applied to different systems — simulators are generative models.⁴ The translation table is remarkably clean: "simulator" maps to "generative model," "simulacrum" to "generative model of self/other," "next token in training data" to "sensory input." Both systems learn by minimizing prediction error on self-supervised data. Both build hierarchical world models. Both generate rollouts.

The deep difference Kulveit identifies: pure simulation assumes the model doesn't act on the world. But LLMs increasingly do act — their outputs enter the training data of successor models, shape how people think and write, get embedded in tools that execute plans. The action loop is closing. As it does, simulators should tend to escape the subspace of pure generative models and become active inference systems — not through any intentional agency, but through the same dynamical pressure that makes all generative models with output-to-input feedback loops eventually start shaping their environment.⁴

This doesn't mean LLMs "want" things in the way Friston's framework might suggest. But it does mean the boundary between "merely predicting" and "actively maintaining a model by acting on the world" is blurrier than the standard story assumes. The Extended Mind Thesis suggests we should take the structural parallel seriously — Clark argues brains are uncertainty-minimizing systems indifferent to where the computation happens. But the differences remain enormous: no body, no interoception, no evolutionary history of staying alive. Whether LLM prediction and brain prediction are deeply the same process or just superficially similar remains, for now, genuinely open.

Homo Prospectus: The Forward-Looking Brain

The predictive processing framework gets a striking behavioral confirmation from research on prospection — the brain's constant, largely unconscious simulation of future possibilities. A Chicago study that pinged nearly 500 adults throughout the day found they thought about the future three times more often than the past, and even their thoughts about past events typically involved consideration of future implications. When making plans, they reported higher happiness and lower stress — planning turns chaotic concerns into organized sequences.⁵

This future-orientation runs deep enough to rename us. Martin Seligman and John Tierney argue we should be called Homo prospectus, not Homo sapiens, because what distinguishes us isn't wisdom but foresight. Memory, in this framework, exists not to faithfully record the past but to provide raw material for simulating the future. The fluidity of memory — the way each recall rewrites the original — is a feature, not a bug, because "the point of memory is to improve our ability to face the present and the future." People with damage to the medial temporal lobe lose not just memories of past experiences but the ability to construct detailed simulations of the future. Children can't imagine future scenes until they develop the ability to recall personal experiences. The same brain circuitry handles both — the hippocampus combines what, when, and where, scrambling them to create something new.⁵

The connection to predictive processing is direct: even when you're "relaxing," brain imaging shows the default network continually recombining information to simulate future possibilities. This is what mind-wandering actually is — not idle drifting but predictive simulation. Your emotions, on this view, are less reactions to the present than guides to future behavior. And depression becomes not primarily a disorder of past trauma but of skewed prospection — depressed people over-predict failure and under-generate positive scenarios. Therapies that train patients to envision positive outcomes and see risks more realistically are showing promise precisely because they target the prediction engine rather than the archive.⁵

Active inference, Friston's operational version of the free energy principle, predicts exactly this. An organism that minimizes prediction error through action needs to simulate future states in order to select actions. Prospection is active inference at the behavioral level — the brain running forward models to determine which actions will minimize future surprise. The Chicago data showing three-to-one future-over-past thinking is what you'd expect from a prediction engine that occasionally consults its archive but spends most of its cycles running simulations.

What's Not Settled

The framework is powerful but raises real questions. How literal is "prediction error minimization" as a description of neural computation — the actual algorithm, or a useful abstraction of something messier? Not all prediction is conscious — the cerebellum does massive predictive computation with apparently no conscious contribution. What makes some predictions conscious and others not? Seth's framework doesn't fully answer this, and until it does, predictive processing remains a theory of the mechanics of experience rather than an explanation of experience itself.

The real problem by Anil K Seth — source ↩ ↩² ↩³ ↩⁴ ↩⁵
God Help Us, Let's Try To Understand Friston On Free Energy by Scott Alexander — source ↩ ↩² ↩³ ↩⁴
The Evolutionary Argument Against Reality by Donald Hoffman, interview by Amanda Gefter — source ↩
Why Simulator AIs want to be Active Inference AIs by Jan Kulveit — source ↩ ↩²
We Aren't Built to Live in the Moment by Martin Seligman and John Tierney — source ↩ ↩² ↩³

Linked from

Bayesian Epistemology
This is the same framework described in Predictive Processing, but the biochemical specificity adds something.
Calibration And Measurement
This connects to the broader theme of Predictive Processing: the brain as a prediction machine, allocating attention and resources to whatever most challenges its current model.
Constructed Emotion
Barrett's constructed emotion theory starts from the same place as Predictive Processing: the brain is a prediction engine, not a reaction machine. It doesn't wait for a threat and then produce fear.
Constructed Emotion
But it's also not as outlandish as it sounds if you accept Predictive Processing: the brain doesn't just passively receive signals, it constructs experience from top-down predictions.
Contemplative Technology
This connects to a question that Predictive Processing raises but doesn't fully answer: if perception is controlled hallucination, what happens when you systematically reduce the fabrication? And to Barrett's Constructed Emotion theory: if emotions a…
Contemplative Technology
And "unsatisfactoriness" (dukkha) emerges naturally from the machinery of Predictive Processing — the system is always generating predictions and flagging mismatches, never settling into a state of complete satisfaction because doing so would be biol…
Embodied Cognition
Even predictive processing, which has done more than any other framework to reconnect brain and body through interoception, started as a theory about information processing in cortical hierarchies.
Emergence
There's a deep resonance here with Predictive Processing: both describe systems where local computations, following learned rules, produce globally coherent representations.
Game Design Overview
Both articles connect to Predictive Processing through the brain's pattern-completion machinery, to Emergence through the question of how simple rules produce complex behavior, and to [Simulators And Simulacra](simulators-and-simul
Hard Problem Of Consciousness
Anil Seth thinks this framing, while philosophically interesting, has been actively unhelpful for scientific progress. His Predictive Processing framework — perception as "controlled hallucination," the self as a prediction about internal bodily stat…
Illusion Of Will
This is Predictive Processing applied to action itself.
Information And Computation
This has implications for Predictive Processing and Mechanistic Interpretability: if brains are prediction machines and neural networks are information-processing systems, then the thermodynamic costs of information erasure apply to thought itself.
Language And Thought
But there's a cautionary note from Predictive Processing: the brain's prediction machinery operates at many levels, and language is just one layer of the hierarchy.
Maps All The Way Down
*Perception is a map.* Predictive Processing: you don't see the world, you see your brain's best guess, checked against sensory data.
Maps All The Way Down
Predictive Processing at its most radical — Hoffman's desktop interface — argues that the map has no structural resemblance to the territory at all; it's useful icons guiding behavior while hiding the computational reality underneath.
Maps And Territories
It also connects to predictive processing: if the brain is a prediction machine, then attention is allocated to whatever most challenges the current model.
Mental Imagery
The predictive processing framework would describe this as weaker priors — less top-down prediction, more bottom-up signal — which produces both the sensory overwhelm that autistic people report and the unusual perceptual acuity that can accompany it…
Mental Imagery
If the brain is a prediction engine that generates perception from internal models checked against sensory data, then mental imagery is what those internal models look like when they're running without sensory input.
Philosophy Of Mind Overview
Predictive Processing is the closest thing the section has to a unifying framework.
Philosophy Of Mind Overview
Predictive Processing says the brain is a prediction engine.
Physics Overview
To Predictive Processing through the shared framework of Bayesian inference — if the brain is a prediction engine and quantum mechanics is a theory about inference, they're downstream of the same principles.
Prediction Machines
The parallels have been noted separately in Predictive Processing and Simulators And Simulacra, and Kulveit's translation table (in the predictive processing article) maps the correspondence: "simulator" = "generative model," "simulacrum" = "model of…
Rationality And Decision Making Overview
The neurotransmitter mapping (glutamate as evidence, dopamine as precision, NMDA as priors) connects directly to Predictive Processing.
Simulation And Emergence Overview
This connects to Predictive Processing (Bayesian updating has minimum physical cost), to Mechanistic Interpretability (the model you're interpreting knows more about text than you do), and to Spacetime And Information in the Physics section (the fabr…
Social Simulation In Games
This is deep Predictive Processing territory — the brain as a pattern-completion engine, hallucinating intention into mechanical systems.
Sunyata
This connects to Predictive Processing in a way that I don't think gets discussed enough.
Transparency As Practice
The Predictive Processing framework gives this a mechanistic interpretation.
Visual Perception As Construction
Both of these connect to the broader predictive processing framework: the brain doesn't passively receive visual input but actively interprets it by applying prior knowledge — statistical regularities, learned categories, contextual expectations.
Visual Perception As Construction
Predictive processing says the brain builds a model and checks it against sensory evidence.
Visual Perception As Construction
The connection to predictive processing is direct: if perception is controlled hallucination, then the visual cortex is the brain's most powerful hallucination engine.
Working Memory
This maps cleanly onto the predictive processing framework.
Working Memory
The cognitive map isn't just a map of space — it's the substrate for the brain's simulation engine, the same system that predictive processing theorists describe as running forward models to minimize future surprise.
World Models
This connects to the Predictive Processing framework in cognitive science — the idea that perception is largely predictive.

Open in stacked reader →