Integrated information theory (IIT) 4.0: Formulating the properties of phenomenal existence in physical terms
Larissa Albantakis 1 Y , Leonardo Barbosa 1,2 Y , Graham Findlay 1,3 Y , Matteo Grasso 1 Y , Andrew M Haun 1 Y , William Marshall 1,4 Y , William GP Mayner 1,3 Y , Alireza Zaeemzadeh 1 Y , Melanie Boly 1,5 , Bjørn E Juel 1,6 , Shuntaro Sasai 1,7 , Keiko Fujii 1 , Isaac David 1 , Jeremiah Hendren 1,8 , Jonathan P Lang 1 , Giulio Tononi 1*
- 1 Department of Psychiatry, University of Wisconsin, Madison, WI 53719, USA
- 2 Fralin Biomedical Research Institute at VTC, Virginia Tech, Roanoke, VA 24016, USA
- 3 Neuroscience Training Program, University of Wisconsin, Madison, WI 53705, USA
- 4 Department of Mathematics and Statistics, Brock University, St. Catharines, ON L2S 3A1, Canada
- 5 Department of Neurology, University of Wisconsin, Madison, WI 53719, USA
- 6 Institute of Basic Medical Sciences, University of Oslo, Oslo, 0372, Norway
- 7 Araya Inc., Tokyo, 107-0052, Japan
- 8 Graduate School Language & Literature, Ludwig Maximilian University of Munich, Munich, 80799, Germany
- Y These authors contributed equally to this work.
* gtononi@wisc.edu
## Abstract
This paper presents Integrated Information Theory (IIT) 4.0. IIT aims to account for the properties of experience in physical (operational) terms. It identifies the essential properties of experience (axioms), infers the necessary and sufficient properties that its substrate must satisfy (postulates), and expresses them in mathematical terms. In principle, the postulates can be applied to any system of units in a state to determine whether it is conscious, to what degree, and in what way. IIT offers a parsimonious explanation of empirical evidence, makes testable predictions, and permits inferences and extrapolations. IIT 4.0 incorporates several developments of the past ten years, including a more accurate translation of axioms into postulates and mathematical expressions, the introduction of a unique measure of intrinsic information that is consistent with the postulates, and an explicit assessment of causal relations. By fully unfolding a system's irreducible cause-effect power, the distinctions and relations specified by a substrate can account for the quality of experience.
## Author summary
IIT aims to account for consciousness and its properties in physical terms. The theory identifies the essential properties of experience (axioms), formulates them as physical properties in terms of cause-effect power (postulates), and provides a mathematical formalism for assessing those properties. This formalism can be employed to 'unfold' a cause-effect structure from a substrate constituted of units in a state, whose interactions can be characterized as interventional conditional probabilities. According to IIT, all the properties of an experience can be accounted for, in physical terms, by
those of a cause-effect structure that satisfies its postulates. The theory is consistent with neurological data and has led to successful experimental predictions. As the latest iteration of the theory, IIT 4.0 incorporates several developments pursued over the past ten years.
## Introduction
A scientific theory of consciousness should account for experience, which is subjective, in objective terms [1]. Being conscious-having an experience-is understood to mean that 'there is something it is like to be' [2]: something it is like to see a blue sky, hear the ocean roar, dream of a friend's face, imagine a melody flow, contemplate a choice, or reflect on the experience one is having.
IIT aims to account for phenomenal properties-the properties of experience-in physical terms. IIT's starting point is experience itself rather than its behavioral, functional, or neural correlates [1]. Furthermore, in IIT 'physical' is meant in a strictly operational sense-in terms of what can be observed and manipulated.
The starting point of IIT is the existence of an experience, which is immediate and irrefutable [3,4]. From this 'zeroth' axiom, IIT sets out to identify the essential properties of consciousness-those that are immediate and irrefutably true of every conceivable experience. These are IIT's five axioms of phenomenal existence: every experience is for the experiencer (intrinsicality), specific (information), unitary (integration), definite (exclusion), and structured (composition).
Unlike phenomenal existence, which is immediate and irrefutable (an axiom), physical existence is an explanatory construct (a postulate) and it is assessed operationally from within consciousness: in physical terms, to be is to have cause-effect power (see Box 2: Principle of being). In other words, something can be said to exist physically if it can 'take and make a difference'-bear a cause and produce an effect-as judged by a conscious observer/manipulator.
The next step of IIT is to formulate the essential phenomenal properties (the axioms) in terms of corresponding physical properties (the postulates). This formulation is an 'inference to a good explanation' and rests on basic assumptions such as realism, physicalism, and atomism (see Box 1: Methodological guidelines of IIT). If IIT is correct, the substrate [5] of consciousness, beyond having cause-effect power (existence), must satisfy all five essential phenomenal properties in physical terms: its cause-effect power must be for itself (intrinsicality), specific (information), unitary (integration), definite (exclusion), and structured (composition).
On this basis, IIT proposes a fundamental explanatory identity: an experience is identical to the cause-effect structure unfolded from a maximal substrate (defined below). Accordingly, all the specific phenomenal properties of any experience must have a good explanation in terms of the specific physical properties of the corresponding cause-effect structure, with no additional ingredients.
IIT formulates the postulates in a mathematical framework that is in principle applicable to general models of interacting units (but see [6]). A mathematical framework is needed (i) to evaluate whether the theory is self-consistent and compatible with our overall knowledge about the world, (ii) to make specific predictions regarding the quality and quantity of our experiences and their substrate within the brain, and (iii) to extrapolate from our own consciousness to infer the presence (or absence) and nature of consciousness in beings different from ourselves.
Ultimately, the theory should account for why our consciousness depends on certain portions of the world and their state, such as certain regions of the brain and not others, and for why it fades during dreamless sleep, even though the brain remains active. It should also account for why an experience feels the way it does-why the sky feels
extended, why a melody feels flowing in time, and so on. Moreover, the theory makes several predictions concerning both the presence and the quality of experience, some of which have been and are being tested empirically [7].
While the main tenets of the theory have remained the same, its formal framework has been progressively refined and extended [8-11]. Compared to IIT 1.0 [8,9], 2.0 [10,12], and 3.0 [11], IIT 4.0 presents a more complete, self-consistent formulation and incorporates several recent advances [13-16]. Chief among them are a more accurate translation of the axioms into postulates and mathematical expressions, the introduction of an Intrinsic Difference (ID) measure [15,17] that is uniquely consistent with IIT's postulates, and the explicit assessment of causal relations [14].
In what follows, after introducing IIT's axioms and postulates, we provide its updated mathematical formalism. In the 'Results and discussion' section, we apply the mathematical framework of IIT to representative examples and discuss some of their implications. The article is meant as a reference for the theory's mathematical formalism, a concise demonstration of its internal consistency, and an illustration of how a substrate's cause-effect structure is unfolded computationally. A discussion of the theory's motivation, its axioms and postulates, and its assumptions and implications can be found in a forthcoming book [4] and wiki [18] as well as in several publications [1,19-24]. A survey of the explanatory power and experimental predictions of IIT can be found in [7]. The way IIT's analysis of cause-effect power can be applied to actual causation, or 'what caused what,' is presented in [13].
## From phenomenal axioms to physical postulates
## Axioms of phenomenal existence
That experience exists-that 'there is something it is like to be'-is immediate and irrefutable, as everybody can confirm, say, upon awakening from dreamless sleep. Phenomenal existence is immediate in the sense that my experience is simply there, directly rather than indirectly: I do not need to infer its existence from something else. It is irrefutable because the very doubting that my experience exists is itself an experience that exists-the experience of doubting [1,3]. Thus, to claim that my experience does not exist is self-contradictory or absurd. The existence of experience is IIT's zeroth axiom.
Existence Experience exists : there is something .
Traditionally, an axiom is a statement that is assumed to be true, cannot be inferred from any other statement, and can serve as a starting point for inferences. The existence of experience is the ultimate axiom-the starting point for everything, including logic and physics.
On this basis, IIT proceeds by considering whether experience-phenomenal existence-has some axiomatic or essential properties, properties that are immediate and irrefutably true of every conceivable experience. Drawing on introspection and reason, IIT identifies the following five:
Intrinsicality Experience is intrinsic : it exists for itself .
Information Experience is specific : it is the way it is .
Integration Experience is unitary : it is a whole , irreducible to separate experiences.
Exclusion Experience is definite : it is this whole.
Composition Experience is structured : it is composed of distinctions and the relations that bind them together, yielding a phenomenal structure .
To exemplify, if I awaken from dreamless sleep, and experience the white wall of my room, my bed, and my body, the experience not only exists, immediately and irrefutably, but 1) it exists for me, 2) it is specific (the wall is a wall and it is white), 3) it is unitary (the left side is not experienced separately from the right side, and vice versa), 4) it is definite (it includes the visual scene in front of me-neither less, say, its left side only, nor more, say, the wall behind my head), 5) it is structured by distinctions (the wall, the bed, the body) and relations (the body is on the bed, the bed in the room).
The axioms are not only immediately given, but they are irrefutably true of every conceivable experience. For example, once properly understood, the unity of experience cannot be refuted. Trying to conceive of an experience that were not unitary leads to conceiving of two separate experiences, each of which is unitary, which reaffirms the validity of the axiom. Even though each of the axioms spells out an essential property in its own right, the axioms must be considered together to properly characterize phenomenal existence.
IIT takes the above set of axioms to be complete: there are no further properties of experience that are essential. Other properties that might be considered as candidates for axiomatic status include space (experience typically takes place in some spatial frame), time (an experience usually feels like it flows from a past to a future), change (an experience usually transitions or flows into another), subject-object distinction (an experience seems to involve both a subject and an object), intentionality (experiences usually refer to something in the world, or at least to something other than the subject), a sense of self (many experiences include a reference to one's body or even to one's narrative self), figure-ground segregation (an experience usually includes some object and some background), situatedness (an experience is often bound to a time and a place), will (experience offers the opportunity for action), and affect (experience is often colored by some mood), among others. However, experiences lacking each of these candidate properties are conceivable-that is, conceiving of them does not lead to self-contradiction or absurdity. They are also achievable, as revealed by altered states of consciousness reached through dreaming, meditative practices, or drugs.
## Postulates of physical existence
To account for the many regularities of experience (Box 1), it is a good inference to assume the existence of a world that persists independently of one's experience ( realism ). From within consciousness, we can probe the physical existence of things outside of our experience operationally-through observations and manipulations. To be granted physical existence, something should have the power to 'take a difference' (be affected) and 'make a difference' (produce effects) in a reliable way ( physicalism ). IIT also assumes operational reductionism: ideally, to establish what exists in physical terms, one would start from the smallest units that can take and make a difference, so that nothing is left out ( atomism ).
By characterizing physical existence operationally as cause-effect power, IIT can proceed to translate the axioms of phenomenal existence into postulates of physical existence. This establishes the requirements for the substrate of consciousness , where 'substrate' is meant operationally as a set of units that can be observed and manipulated.
Existence The substrate of consciousness must have cause-effect power : its units must take and make a difference .
Building from this 'zeroth' postulate, IIT formulates the five axioms in terms of postulates of physical existence:
- Intrinsicality The substrate of consciousness must have intrinsic cause-effect power: it must take and make a difference within itself .
- Information The substrate of consciousness must have specific cause-effect power: it must select a specific cause-effect state .
This state is the one with maximal intrinsic information ( ii ), a measure of the difference a system takes or makes over itself for a given cause state and effect state.
- Integration The substrate of consciousness must have unitary cause-effect power: it must specify its cause-effect state as a whole set of units, irreducible to separate subsets of units.
Irreducibility is measured by integrated information ( ϕ ) over the substrate's minimum partition.
- Exclusion The substrate of consciousness must have definite cause-effect power: it must specify its cause-effect state as this set of units.
This is the set of units that is maximally irreducible, as measured by maximum ϕ ( ϕ ∗ ). This set is called a maximal substrate , also known as complex [11, 16].
- Composition The substrate of consciousness must have structured cause-effect power: subsets of its units must specify cause-effect states over subsets of units ( distinctions ) that can overlap with one another ( relations ), yielding a cause-effect structure or Φ -structure ('Phi-structure').
Distinctions and relations, in turn, must also satisfy the postulates of physical existence: they must have cause-effect power, within the substrate of consciousness, in a specific, unitary, and definite way (they do not have components, being components themselves). They thus have an associated ϕ value. The Φ -structure unfolded from a complex corresponds to the quality of consciousness. The sum total of the ϕ values of the distinctions and relations that compose the Φ -structure measures its structured information Φ ('big Phi') and corresponds to the quantity of consciousness.
According to IIT, the physical properties characterized by the postulates are necessary and sufficient for an entity to be conscious. They are necessary because they are needed to account for the properties of experience that are essential, in the sense that it is inconceivable for an experience to lack any one of them. They are also sufficient because no additional property of experience is essential, in the sense that it is conceivable for an experience to lack that property. Thus, no additional physical property is a necessary requirement for being a substrate of consciousness.
The postulates of IIT have been and are being applied to account for the location of the substrate of consciousness in the brain [7] and for its loss and recovery in physiological and pathological conditions [25,26].
## The explanatory identity between experiences and Φ -structures
Having determined the necessary and sufficient conditions for a substrate to support consciousness, IIT proposes an explanatory identity: every property of an experience is accounted for in full by the physical properties of the Φ -structure unfolded from a maximal substrate (a complex) in its current state, with no further or 'ad hoc' ingredients. That is, there must be a one-to-one correspondence between the way the experience feels and the way distinctions and relations are structured. Importantly, the
identity is not meant as a correspondence between the properties of two separate things. Instead, the identity should be understood in an explanatory sense: the intrinsic (subjective) feeling of the experience can be explained extrinsically (objectively, i.e. , operationally or physically) in terms of cause-effect power [27].
The explanatory identity has been applied to account for how space feels (spatial extendedness) and which neural substrates may account for it [14]. Ongoing work is applying the identity to provide a basic account of the feeling of temporal flow [28] and that of objects [29].
## Box 1. Methodological guidelines of IIT
## Inference to a good explanation
We should generally assume that an explanation is good if it can account for a broad set of facts ( scope ), does so in a unified manner ( synthesis ), can explain facts precisely ( specificity ), is internally coherent ( self-consistency ), is coherent with our overall understanding of things ( system consistency ), is simpler than alternatives ( simplicity ), and can make testable predictions ( scientific validation ). For example, IIT 4.0 aims at expressing the postulates of intrinsicality, information, integration, and exclusion in a self-consistent manner when applied to systems, causal distinctions, and relations (see formulas).
## Realism
We should assume that something exists (and persists) independently of our own experience. This is a much better hypothesis than solipsism, which explains nothing and predicts nothing. Although IIT starts from our own phenomenology, it aims to account for the many regularities of experience in a way that is fully consistent with realism.
## Operational physicalism
To assess what exists independently of our own experience, we should employ an operational criterion: we should systematically observe and manipulate a substrate's units and determine that they can indeed take and make a difference in a way that is reliable and persisting. Doing so demonstrates a substrate's cause-effect power-the signature of physical existence. Ideally, cause-effect power is fully captured by a substrate's transition probability matrix (TPM) (1). This assumption is embedded in IIT's zeroth postulate.
## Operational reductionism ('atomism')
Ideally, we should account for what exists physically in terms of the smallest units we can observe and manipulate, as captured by unit TPMs. Doing so would leave nothing unaccounted for. IIT assumes that in principle it should be possible to account for everything purely in terms of cause-effect power-cause-effect power 'all the way down' to conditional probabilities between atomic units. Eventually, this would leave neither room nor need to assume intrinsic properties or laws.
## Intrinsic perspective
When accounting for experience itself in physical terms, existence should be evaluated from the intrinsic perspective of an entity-what exists for the entity itself, not from the perspective of an external observer. This assumption is embedded in IIT's postulate of intrinsicality and has several consequences. One is that, from the intrinsic perspective, the quality and quantity of existence must be observer-independent and cannot be arbitrary. For instance, information in IIT must be relative to the specific state the entity is in, rather than an average of states as assessed by an external observer. Similarly, it should be evaluated based on the uniform distribution of possible states, as captured by the entity's TPM (1), rather than on an observed probability distribution. By the same token, units outside the entity should be treated as fixed background conditions that do not contribute directly to what the system is. The intrinsic perspective also imposes a tension between expansion and dilution (see below and [15,17]): from the intrinsic perspective of a system (or a mechanism within the system), having more units may increase its informativeness (cause-effect power measured as deviation from chance), while at the same time diluting its selectivity (ability to concentrate cause-effect power over a specific state).
## Overview of IIT's framework
IIT 4.0 aims at providing a formal framework to characterize the cause-effect structure of a substrate in a given state by expressing IIT's postulates in mathematical terms. In line with operational physicalism (Box 1), we characterize a substrate by the transition probability function of its constituting units.
On this basis, the IIT formalism first identifies sets of units that fulfill all required properties of a substrate of consciousness according to the postulates of physical existence. First, for a candidate system, we determine a maximal cause-effect state based on the intrinsic information ( ii ) that the system in its current state specifies over its possible cause states and effect states. We then determine the maximal substrate based on the integrated information ( ϕ s ) of the maximal cause-effect state. To qualify as a substrate of consciousness, a candidate system must specify a maximum of integrated information ( ϕ ∗ s ) compared to all competing candidate systems with overlapping units.
The second part of the IIT formalism unfolds the cause-effect structure specified by a maximal substrate in its current state, its Φ -structure . To that end, we determine the distinctions and relations specified by the substrate's subsets according to the postulates of physical existence. Distinctions are cause-effect states specified over subsets of substrate units ( purviews ) by subsets of substrate units ( mechanisms ). Relations are congruent overlaps among distinctions' cause and/or effect states. Distinctions and relations are also characterized by their integrated information ( ϕ d , ϕ r ). The Φ -structure they compose corresponds to the quality of the experience specified by the substrate; the sum of their ϕ d/r values corresponds to its quantity ( Φ ).
While IIT must still be considered as work in progress, having undergone successive refinements, IIT 4.0 is the first formulation of IIT that strives to characterize Φ -structures completely and to do so based on measures that satisfy the postulates uniquely. For a comparison of the updated framework with IIT 1.0, 2.0, and 3.0, see A.2.
## Substrates, transition probabilities, and cause-effect power
IIT takes physical existence as synonymous with having cause-effect power, the ability to take and make a difference. Consequently, a substrate U with state space Ω U is operationally defined by its potential interactions, assessed in terms of conditional probabilities (physicalism, Box 1). We denote the complete transition probability function of a substrate U over a system update u → ¯ u as
<!-- formula-not-decoded -->
A substrate in IIT can be described as a stochastic system U = { U 1 , U 2 , . . . , U n } of n interacting units with state space Ω U = ∏ i Ω U i and current state u ∈ Ω U . We assume that the system updates in discrete steps, that the state space Ω U is finite, and that the individual random variables U i ∈ U are conditionally independent from each other given the preceding state of U :
<!-- formula-not-decoded -->
Finally, we assume a complete description of the substrate, which means that we can determine the conditional probabilities in (2) for every system state, with p (¯ u | u ) = p (¯ u | do( u )) [13, 30-32], where the 'do-operator' do( u ) indicates that u is imposed by intervention. This implies that U must correspond to a causal network [13], and T U is a transition probability matrix (TPM) of size | Ω U | [33].
The TPM T U , which forms the starting point of IIT's analysis, serves as an overall description of a system's cause-effect power: what is the probability that the system will transition into each of its possible states upon an intervention that initializes it into every possible state (Figure 1)? (Notably, there is no additional role for intrinsic physical properties or laws of nature.) In practice, a causal model will be neither complete nor atomic (capturing the smallest units that can be observed and manipulated), but will capture the relevant features of what we are trying to explain and predict [34].
In the 'Results and discussion' section, the IIT formalism will be applied to extremely simple, simulated networks, rather than causal models of actual substrates. The cause-effect structures derived from these simple networks only serve as convenient illustrations of how a hypothetical substrate's cause-effect power can be unfolded.
## Implementing the postulates
In what follows, our goal is to evaluate whether a hypothetical substrate (also called 'system') satisfies all the postulates of IIT. To that end, we must verify whether the system has cause-effect power that is intrinsic, specific, integrated, definite, and structured.
## Existence
According to IIT, existence understood as cause-effect power requires the capacity to both take and make a difference (see Box 2, Principle of being). On the basis of a complete description of the system in terms of interventional conditional probabilities ( T U ) (1), cause-effect power can be measured as causal informativeness . Cause informativeness measures how much a potential cause increases the probability of the current state, and effect informativeness how much the current state increases the probability of a potential effect (as compared to chance).
## Intrinsicality
Building upon the existence postulate, the intrinsicality postulate further requires that a system exerts cause-effect power within itself . In general, the systems we want to evaluate are open systems S ⊆ U that are part of a larger 'universe' U . From the intrinsic perspective of a system S (see Box 1), the set of the remaining units W = U \ S merely act as background conditions, whose state can be considered as fixed. This is enforced by causally conditioning the larger TPM ( T U ) on the current state W = w , which makes W causally inert.
## Information
The information postulate requires that a system's cause-effect power be specific: the system in its current state must select a specific cause-effect state for its units. Based on the principle of maximal existence (Box 2), this is the state for which intrinsic information is maximal-the maximal cause-effect state . Intrinsic information ( ii ) measures the difference a system takes or makes over itself for a given cause and effect state as the product of informativeness and selectivity. As we have seen (existence), informativeness quantifies the causal power of a system in its current state as a reduction of uncertainty with respect to chance. Selectivity measures how much cause-effect power is concentrated over that specific cause or effect state. Selectivity is reduced by uncertainty in the cause or effect state with respect to other potential cause and effect states.
From the intrinsic perspective of the system, the product of informativeness and selectivity leads to a tension between expansion and dilution , whereby a system comprising more units may show increased deviation from chance but decreased concentration of cause-effect power over a specific state [15,17].
## Integration
By the integration postulate, it is not sufficient for a system to have cause-effect power within itself and select a specific cause-effect state: it must also specify its maximal cause-effect state in a way that is irreducible. This can be assessed by partitioning the set of units that constitute the system into separate parts. The system integrated information ( ϕ s ) then quantifies how much the intrinsic information specified by the maximal state is reduced due to the partition [35]. Integrated information is evaluated over the partition that makes the least difference, the minimum partition (MIP), in accordance with the principle of minimal existence (see Box 2).
Integrated information is highly sensitive to the presence of fault lines -partitions that separate parts of a system that interact weakly or directionally [16].
## Exclusion
Many overlapping sets of units may have a positive value of integrated information ( ϕ s ). However, the exclusion postulate requires that the substrate of consciousness must be constituted of a definite set of units, neither less nor more. Moreover, units, updates, and states must have a definite grain. Operationally, the exclusion postulate is enforced by selecting the set of units that maximizes integrated information over itself ( ϕ ∗ s ), based again on the principle of maximal existence (see Box 2). That set of units is called a maximal substrate , or complex . Over a universal substrate, sets of units for which integrated information is maximal compared to all competing candidate systems with overlapping units can be assessed recursively (by identifying the first complex, then the second complex, and so on).
## Composition
Once a complex has been identified, composition requires that we characterize its cause-effect structure by considering all its subsets and fully unfolding its cause-effect power.
Usually, causal models are conceived in holistic terms, as state transitions of the system as a whole (1), or in reductionist terms, as a description of the individual units of the system and their interactions (2) [36]. However, to account for the structure of experience, considering only the cause-effect power of the individual units or of the system as a whole would be insufficient [20,36]. Instead, by the composition postulate, we have to evaluate the system's cause-effect structure by considering the cause-effect power of its subsets as well as their causal relations.
To contribute to the cause-effect structure of a complex, a system subset must both take and make a difference (as required by existence) within the system (as required by intrinsicality). A subset M ⊆ S in state M = m is called a mechanism if it links a cause and effect state over subsets of units Z c/e ⊆ S , called purviews . A mechanism together with the cause and effect state it specifies is called a causal distinction . Distinctions are evaluated based on whether they satisfy all the postulates of IIT (except for composition). For every mechanism, the cause-effect state is the one having maximal intrinsic information ( ii ), and the cause and effect purviews are those yielding the maximum value of integrated information ( ϕ d ) within the complex-that is, those that are maximally irreducible.
Distinctions whose cause or effect states overlap congruently within the system (over the same subset of units in the same state) are bound together by causal relations . Relations also have an associated value of integrated information ( ϕ r ), corresponding to their irreducibility.
The distinctions (and associated relations) that exist for the complex are only those whose cause-effect state is congruent with the cause-effect state of the complex as a whole. Together, those distinctions and relations compose the cause-effect structure of the complex in its current state. The cause-effect structure specified by a complex is called a Φ -structure . The sum of its distinction and relation integrated information amounts to the structured information ( Φ ) of the complex.
In the following, we will provide a formal account of the IIT analysis. The first part demonstrates how to identify complexes. This requires that we (a) determine the cause-effect state of a system in its current state, (b) evaluate the system integrated information ( ϕ s ) over that cause-effect state, and (c) search iteratively for maxima of integrated information ( ϕ ∗ s ) within a universe. The second part describes how the postulates of IIT are applied to unfold the cause-effect structure of a complex. This requires that we identify the causal distinctions specified by subsets of units within the complex and the causal relations determined by the way distinctions overlap, yielding the system's cause-effect structure and its structured information Φ .
## Box 2. Ontological principles of IIT
## Principle of being
The principle of being states that to be is to have cause-effect power . In other words, in physical, operational terms, to exist requires being able to take and make a difference. The principle is closely related to the so-called Eleatic principle, as found in Plato's Sophist dialogue [37]: 'I say that everything possessing any kind of power, either to do anything to something else, or to be affected to the smallest extent by the slightest cause, even on a single occasion, has real existence: for I claim that entities are nothing else but power.' A similar principle can be found in the work of the Buddhist philosopher Dharmak¯ ırti: 'Whatever has causal powers, that really exists. ' [38] Note that the Eleatic principle is enunciated as a disjunction (either to do something... or to be affected...), whereas IIT's principle of being is presented as a conjunction (take and make a difference).
## Principle of maximal existence
The principle of maximal existence states that, with respect to an essential requirement for existence, what exists is what exists the most . The principle is offered by IIT as a good explanation for why the system state specified by the complex and the cause-effect states specified by its mechanisms are what they are. It also provides a criterion for determining the set of units constituting a complex-the one with maximally irreducible cause-effect power, for determining the subsets of units constituting the distinctions and relations that compose its cause-effect structure, and for determining the units' grain. To exemplify, consider a set of candidate complexes overlapping over the same substrate. By the postulates of integration and exclusion, a complex must be both unitary and definite. By the maximal existence principle, the complex should be the one that lays the greatest claim to existence as one entity, as measured by system integrated information ( ϕ s ). For the same reason, candidate complexes that overlap over the same substrate but have a lower value of ϕ s are excluded from existence. In other words, if having maximal ϕ s is the reason for assigning existence as a unitary complex to a set of units, it is also the reason to exclude from existence any overlapping set not having maximal ϕ s .
## Principle of minimal existence
Another key principle of IIT is the principle of minimal existence , which complements that of maximal existence. The principle states that, with respect to an essential requirement for existence, nothing exists more than the least it exists . The principle is offered by IIT as a good explanation for why, given that a system can only exist as one system if it is irreducible, its degree of irreducibility should be assessed over the partition across which it is least irreducible (the minimum partition). Similarly, a distinction within a system can only exist as one distinction to the extent that it is irreducible, and its degree of irreducibility should be assessed over the partition across which it is least irreducible. Moreover, a set of units can only exist as a system, or as a distinction within the system, if it specifies both an irreducible cause and an irreducible effect, so its degree of irreducibility should be the minimum between the irreducibility on the cause side and on the effect side [39].
## Identifying substrates of consciousness through existence, intrinsicality, information, integration, and exclusion
Our starting point is a substrate U in current state U = u with TPM T U (1). We consider any subset s ⊆ u as a possible complex and refer to a set of units S ⊆ U as a candidate system. (Note that s , in a slight abuse of notation, refers to both a subset of units in a state, as well as the state itself.)
By the intrinsicality postulate, the units W = U \ S are fixed in their current state w ∈ Ω W throughout the analysis of the candidate system S ( causal conditioning ). Accordingly, we obtain the TPM T S of a candidate system S from its intrinsic transition probability function
<!-- formula-not-decoded -->
where w = u \ s .
The intrinsic information ii c/e is a measure of the intrinsic cause/effect power exerted by a system S in its current state s over itself by selecting a specific cause/effect state ¯ s . The cause-effect state for which intrinsic information ( ii c and ii e ) is maximal is called the maximal cause-effect state s ′ = { s ′ c , s ′ e } . The integrated information ϕ s is a measure of the irreducibility of a cause-effect state, compared to the directional system partition θ ′ that affects the maximal cause-effect state the least (minimum partition, or MIP). Systems for which integrated information is maximal ( ϕ ∗ s ) compared to any competing candidate system with overlapping units are called maximal substrates, or complexes.
The IIT 4.0 formalism to measure a system's integrated information ϕ s and to identify maximal substrates was first presented in [16]. An example of how to identify complexes in a simple system is given in Fig. 1, while a comparison with prior accounts (IIT 1.0, IIT 2.0, and IIT 3.0) can be found in A.2. An outline of the IIT algorithm is included in A.4.
## Existence, intrinsicality, and information: Determining the maximal cause-effect state of a candidate system
Given a causal model T S (3), we wish to identify the maximal cause-effect state specified by a system in its current state over itself and to quantify the causal power with which it does so. In doing so, we quantify the cause-effect power of a system from its intrinsic perspective, rather than from the perspective of an outside observer (see Box 1).
## System intrinsic information ii
Intrinsic information ii ( s, ¯ s ) measures the causal power of a system S over itself, for its current state s , over a specific cause/effect state ¯ s . Intrinsic information depends on interventional conditional probabilities and unconstrained probabilities of cause/effect states and is the product of selectivity and informativeness.
On the effect side, intrinsic effect information ii e of the current state s over a possible effect state ¯ s is defined as:
<!-- formula-not-decoded -->
Above, p e (¯ s | s ) = p (¯ s | s ) (3) is the interventional conditional probability that the current state S = s produces the effect state ¯ s , as indicated by T S .
Fig 1. Identifying substrates of consciousness through the postulates of existence, intrinsicality, information, integration, and exclusion.(A) The substrate S = aBC in state ( -1 , 1 , 1) (lowercase letters for units indicated state ' -1', uppercase letters state '+1') is the starting point for applying the postulates. The substrate updates its state according to the depicted transition probability matrix (TPM) (each unit follows a logistic equation (see Results for definition) with k = 4.0 and connection weights as indicated in the causal model). Existence requires that the substrate must have cause-effect power, meaning that the TPM among substrate states must differ from chance. (B) Intrinsicality requires that a candidate substrate, for example, units aB , has cause-effect power over itself. Units outside the candidate substrate (in this case, unit C ) are treated as background conditions by pinning them in their current state. (C) Information requires that the candidate substrate aB selects a specific cause-effect state ( s ′ ). This is the cause state and effect state for which intrinsic information ( ii ) is maximal. Dark-colored and gray bars represent the quantities for informativeness (constrained and unconstrained), and light colored bars for selectivity. (D) Integration requires that the substrate specifies its cause-effect state irreducibly ('as one'). This is established by identifying the minimum partition (MIP) and measuring the integrated information of the system ( ϕ s )-the minimum between cause integrated information ( ϕ c ) and effect integrated information ( ϕ e ). Here, gray bars represent the partitioned probability. (E) Exclusion requires that the substrate of consciousness is definite, including some units and excluding others. This is established by identifying the candidate substrate with the maximum value of system integrated information ( ϕ ∗ s )-the maximal substrate, or complex. In this case, aB is a complex since its system integrated information ( ϕ s = 0 . 17) is higher than the one of all other overlapping systems (for example, subset a with ϕ s = 0 . 06 and superset aBC with ϕ s = 0 . 13).
<details>
<summary>Image 1 Details</summary>

### Visual Description
\n
## Diagram: Causal Structures & Information Flow
### Overview
The image presents five distinct diagrams (A-E) illustrating different causal structures and information flow concepts. Each diagram features a network of nodes (a, b, c) connected by directed edges, accompanied by a matrix representing the transition probabilities between states. The diagrams are labeled "Existence", "Intrinsicality", "Information", "Integration", and "Exclusion". Each diagram also includes a small matrix representing the effect/current state.
### Components/Axes
Each diagram shares the following components:
* **Nodes:** Represented by circles labeled 'a', 'b', and 'c'.
* **Directed Edges:** Arrows indicating the direction of causal influence. Edge weights are indicated numerically.
* **Transition Matrix (T(ABC)):** A 4x4 matrix representing the probabilities of transitioning between states ABC, Abc, aBC, and abc. Rows and columns are labeled with these states.
* **Effect/Current State Matrix:** A 2x2 matrix representing the effect/current state.
* **Labels:** Each diagram has a title indicating the concept it illustrates.
* **Color Coding:** Red indicates activation/influence, while black indicates no influence.
### Detailed Analysis or Content Details
**A) Existence**
* **Edge Weights:** a->b: 0.2, b->c: 0.2, c->a: -0.2.
* **Transition Matrix T(ABC):**
* ABC: 0.11, 0.02, 0.03, 0.02
* Abc: 0.00, 0.45, 0.10, 0.00
* aBC: 0.00, 0.00, 0.60, 0.00
* abc: 0.00, 0.00, 0.00, 0.50
* **Current State:** aBC (highlighted in blue).
* **Effect/Current State Matrix:** S' = 1.1, ii<sub>c</sub> = 1.8
**B) Intrinsicality**
* **Edge Weights:** a->b: 0.2, b->c: 0.2, c->a: -0.2. (Same as A)
* **Transition Matrix T(ABC):** (Identical to A)
* ABC: 0.11, 0.02, 0.03, 0.02
* Abc: 0.00, 0.45, 0.10, 0.00
* aBC: 0.00, 0.00, 0.60, 0.00
* abc: 0.00, 0.00, 0.00, 0.50
* **Current State:** abc (highlighted in blue).
* **Effect/Current State Matrix:** S' = 0.6, ii<sub>c</sub> = 0.6
**C) Information**
* **Edge Weights:** a->b: (red), b->c: (red), a->c: (red).
* **Transition Matrix:** Not fully visible, but shows values like 0.71, 0.44.
* **Current State:** Ab (highlighted in blue).
* **Effect/Current State Matrix:** ii<sub>max</sub> = 1.1, ii<sub>c</sub> = 1.8
**D) Integration**
* **Edge Weights:** a->b: (red), b->c: (red), c->a: (red). Angle θ' is indicated.
* **Φ<sub>s</sub>:** 0.17
* **Φ<sub>c</sub>:** 0.93
* **Transition Matrix:** Not fully visible, but shows values like 97.61.
* **Current State:** aB (highlighted in blue).
**E) Exclusion**
* **Edge Weights:** a->b: (red), b->c: (red).
* **Φ<sub>s</sub>(aB):** 0.17
* **Φ<sub>c</sub>(a):** 0.06
* **Φ<sub>c</sub>(a,b):** 0.29
* **Φ<sub>c</sub>(a,b,c):** 0.13
* **Transition Matrix:** Not fully visible.
* **Current State:** ab (highlighted in blue).
### Key Observations
* Diagrams A and B share the same network structure and transition matrix, differing only in the highlighted current state.
* The "Information" diagram (C) shows all nodes influencing each other.
* The "Integration" diagram (D) introduces an angle θ' and uses Φ values (likely representing some form of correlation or influence strength).
* The "Exclusion" diagram (E) introduces multiple Φ values, suggesting a more complex relationship between the nodes.
* The matrices are not fully visible, making precise data extraction difficult.
### Interpretation
These diagrams appear to be exploring different types of causal relationships within a simple three-node system. The transition matrices quantify the probabilities of the system being in different states, while the edge weights represent the strength of causal influence.
* **Existence & Intrinsicality (A & B):** These diagrams demonstrate how the same underlying causal structure can lead to different system states. The difference in current state (aBC vs. abc) highlights the sensitivity to initial conditions.
* **Information (C):** This diagram suggests a fully connected system where information can flow freely between all nodes.
* **Integration (D):** The introduction of the angle θ' and Φ values suggests a more nuanced relationship, potentially representing the degree of integration or synergy between the nodes. The high Φ<sub>c</sub> value (0.93) indicates a strong correlation in the current state.
* **Exclusion (E):** The multiple Φ values in this diagram suggest that the influence of one node on another is contingent on the state of other nodes. The lower Φ<sub>c</sub>(a) value (0.06) suggests that node 'a' has a limited influence on the system when considered in isolation.
The diagrams collectively illustrate how different causal structures and information flow patterns can emerge within a simple system, and how these patterns can be quantified using probabilistic models. The use of color coding (red for activation, black for no influence) provides a visual representation of the causal relationships. The matrices provide a quantitative measure of the system's dynamics. The diagrams are likely part of a larger theoretical framework exploring the foundations of information processing and causality.
</details>
The interventional unconstrained probability p e (¯ s )
<!-- formula-not-decoded -->
is defined as the marginal probability of ¯ s , averaged across all possible current states of S with equal probability (where | Ω S | denotes the cardinality of the state space Ω S ).
On the cause side, intrinsic cause information ii c of the current state s over a possible cause state ¯ s is defined as:
<!-- formula-not-decoded -->
Above, p c (¯ s | s ) is the interventional conditional probability that the current state S = s was produced by ¯ s . The latter is derived from T S using Bayes' rule, where we again assign a uniform prior to the states of ¯ s ,
<!-- formula-not-decoded -->
<!-- formula-not-decoded -->
## Informativeness (over chance)
In (4) and (6), the logarithmic term (in base 2 throughout) is called informativeness . Note that informativeness is expressed in terms of effect probabilities for both ii e (4) and ii c (6). However, ii e (4) evaluates the increase in probability of the effect state due to the current state, while ii c (6) evaluates the increase in probability of the current state due to the cause state.
In line with the existence postulate, a system S in state s has cause-effect power (it takes and makes a difference) if it raises the probability of a possible effect state compared to chance, which is to say compared to its unconstrained probability,
<!-- formula-not-decoded -->
and if the probability of the current state is raised above chance by a possible cause state,
<!-- formula-not-decoded -->
Informativeness is additive over the number of units: if a system specifies a cause or effect state with probability p = 1, its causal power increases additively with the number of units whose states it fully specifies ( expansion ), given that the chance probability of all states decreases exponentially.
## Selectivity (over states)
From the intrinsic perspective of a system, cause-effect power over a specific cause or effect state depends not only on the deviation from chance it produces, but also on how that deviation is concentrated on that state, rather than being diluted over other states. This is measured by the selectivity term in front of the logarithmic term in (4) and (6), corresponding to the conditional probability p c/e (¯ s | s ) of that specific cause or effect state. Selectivity means that if p < 1, the system's causal power becomes subadditive ( dilution ) (see [17] for details). For example, as shown in [15], if an unconstrained unit is added to a fully specified unit, intrinsic information does not just stay the same, but decreases exponentially. From the intrinsic perspective of the system, the such that
informativeness of a specific cause/effect state is diluted because it is spread over multiple possible states, yet the system must select only one state.
Altogether, taking the product of informativeness and selectivity leads to a tension between expansion and dilution: a larger system will tend to have higher informativeness than a smaller system because it will deviate more from chance, but it will also tend to have lower selectivity because it will have a larger repertoire of states to select from.
Because of the selectivity term, intrinsic information is reduced by indeterminism and degeneracy. As shown in [16], indeterminism decreases the probability of the selected effect state because it implies that the same state can lead to multiple states. In turn, degeneracy decreases the probability of the selected cause state because it implies that multiple states can lead to the same state, even in a deterministic system.
The intrinsic information ii is quantified in units of intrinsic bits , or ibits , to distinguish it from standard information-theoretic measures (which are typically additive). Formally, the ibit corresponds to a point-wise information value (measured in bits) weighted by a probability.
## The maximal cause-effect state
Taking the product of informativeness and selectivity on the system's cause and effect sides captures the postulates of existence (taking and making a difference) and intrinsicality (taking and making a difference over itself) for each possible cause/effect state, as measured by intrinsic information. However, the information postulate further requires that the system selects a specific cause/effect state. Which one is determined based on the principle of maximal existence (Box 1): the cause/effect specified by the system should be the one that maximizes intrinsic information. On the effect side (and similarly for the cause side),
<!-- formula-not-decoded -->
The system's intrinsic effect information is the value of ii e (4) for its maximal effect state:
<!-- formula-not-decoded -->
We have made the dependency of s ′ and ii e on T S explicit in (11) and (12) to highlight that for intrinsic information to properly assess cause-effect power, all probabilities must be derived from the system's interventional transition probability function, while imposing a uniform prior distribution over all possible system states. If there is no state with ii e ( T S , s ) > 0, the system S in state s has no causal power (and likewise for ii c ( T S , s )). Note also that a system's intrinsic cause/effect state does not necessarily correspond to the actual cause/effect state (what actually happened before / will happen after) in the dynamical evolution of the system, which typically also depends on extrinsic influences. (For an account of actual causation according to the causal principles of IIT, see [13].)
Because consciousness is the way it is, the translation of its properties in physical, operational terms should be unique and based on quantities that uniquely satisfy the postulates [15,40]. Intrinsic information (12) is formally equivalent to a measure of intrinsic difference [17], which uniquely satisfies three desired properties (causality, specificity, and intrinsicality) that align with the postulates of IIT, but also have independent justification.
## Integration: Determining the irreducibility of a candidate system
Having identified the maximal cause-effect state s ′ = { s ′ c , s ′ e } of a candidate system S in its current state s , the next step is to evaluate whether the system specifies the cause-effect state of its units in a way that is irreducible , as required by the integration postulate: a candidate system can only be a substrate of consciousness if it is one system-that is, if it cannot be subdivided into subsets of units that exist separately from one another.
## Directional system partitions
To that end, we define a set of directional system partitions Θ( S ) that divide S into k ≥ 2 parts { S ( i ) } k i =1 , such that
/negationslash
<!-- formula-not-decoded -->
In words, each part S ( i ) must contain at least one unit, there must be no overlap between any two parts S ( i ) and S ( j ) , and every unit of the system must appear in exactly one part. For each part S ( i ) , the partition removes the causal connections of that part with the rest of the system in a directional manner: either the part's inputs, outputs, or both are replaced by independent 'noise' (they are 'cut' by the partition in the sense that their causal powers are substituted by chance). Directional partitions are necessary because, from the intrinsic perspective of a system, a subset of units that cannot affect the rest of the system, or cannot be affected by it, cannot truly be a part of the system. In other words, to be a part of a system, a subset of units must be able to interact with the rest of the system in both directions (cause and effect).
A partition θ ∈ Θ( S ) thus has the form
<!-- formula-not-decoded -->
where δ i ∈ {← , → , ↔} indicates whether the inputs ( ← ), outputs ( → ), or both ( ↔ ) are cut for a given part. For each part S ( i ) , we can then identify a set of units X ( i ) ⊆ S whose inputs to S ( i ) have been cut by the partition, and the complementary set Y ( i ) = S \ X ( i ) whose inputs to S ( i ) are left intact. Specifically,
/negationslash
<!-- formula-not-decoded -->
Given a partition θ ∈ Θ( S ), we define a partitioned transition probability matrix T θ S in which all connections affected by the partition are 'noised.' This is done by combining the independent contributions of each unit S j ∈ S in line with the conditional independence assumption (2), such that
<!-- formula-not-decoded -->
where the partitioned probability of a unit S j ∈ S ( i ) is defined as
<!-- formula-not-decoded -->
This means that all connections to unit S j that are affected by the partition are causally marginalized (replaced by independent noise).
## System integrated information ϕ s
The integrated effect information ϕ e measures how much the partition θ ∈ Θ S reduces the probability with which a system S = s specifies its effect state s ′ e (11),
<!-- formula-not-decoded -->
Note that ϕ e has the same form as the intrinsic information ii e ( s, ¯ s ) (4), with the partitioned effect probability taking the place of the unconstrained (marginal) probability. Likewise, the integrated cause information ϕ c is defined as
<!-- formula-not-decoded -->
(By the principle of maximal existence, if two or more cause-effect states are tied for maximal intrinsic information, the system specifies the one that maximizes ϕ c/e .)
By the zeroth postulate, existence requires cause and effect power, and the integration postulate requires that its cause-effect power be irreducible. By the principle of minimal existence (Box 2), then, system integrated information for a given partition is the minimum of its irreducibility on the cause and effect sides:
<!-- formula-not-decoded -->
Accordingly, the system is reducible if at least one partition θ ∈ Θ S makes no difference to the cause or effect probability.
Moreover, again by the principle of minimal existence, the integrated information of a system is given by its irreducibility over its minimum partition (MIP) θ ′ ∈ Θ S , such that
<!-- formula-not-decoded -->
The MIP is defined as the partition θ ∈ Θ S that minimizes the system's integrated information, relative to the maximum possible value it could take for an arbitrary TPM T ′ S over the units of system S
<!-- formula-not-decoded -->
The normalization ensures that ϕ s ( T S , s ) is evaluated fairly over a system's fault lines by assessing integration relative to its maximum possible value over a given partition. Using the relative integrated information quantifies the strength of the interactions between parts in a way that does not depend on the number of parts and their size. As proven in [16], the maximal value of ϕ ( T S , s, θ ) for a given partition θ is the
<!-- formula-not-decoded -->
maximal possible number of 'connections' (pairwise interactions) affected by θ . For example, as shown in [16], the MIP will correctly identify the fault line dividing a system into two large subsets of units linked through a few interconnected units (a 'bridge'), rather than defaulting to partitions between individual units and the rest of the system. Once the minimum partition has been identified, the integrated information across it is an absolute quantity, quantifying the loss of intrinsic information due to
cutting the minimum partition of the system. (If two or more partitions θ ∈ Θ( S ) minimize Eqn. (22), we select the partition with the largest unnormalized ϕ s value as θ ′ , applying the principle of maximal existence.) Defining θ ′ as in (22), moreover, ensures that ϕ s ( T S , s ) = 0 if the system is not strongly connected in graph-theoretic terms.
In summary, the system integrated information ( ϕ s ( T S , s )) quantifies the extent to which system S in state s has cause-effect power over itself as one system ( i.e. , irreducibly). ϕ s ( T S , s ) is thus a quantifier of irreducible existence.
## Exclusion: Determining maximal substrates (complexes)
In general, multiple candidate systems with overlapping units may have positive values of ϕ s ( T S , s ). By the exclusion postulate, the substrate of consciousness must be definite; that is, it must comprise a definite set of units. But which one? Once again, we employ the principle of maximal existence (Box 2): among candidate systems competing over the same substrate with respect to an essential requirement for existence, in this case irreducibility, the one that exists is the one that exists the most. Accordingly, the maximal substrate, or complex, is the candidate substrate with the maximum value of system integrated information ( ϕ ∗ s ), and overlapping substrates with lower ϕ s are excluded from existence.
## Determining maximal substrates recursively
Within a universal substrate U 0 in state u 0 , subsets of units that specify maxima of irreducible cause-effect power (complexes) can be identified by an iterative search for the system S ∗ k ⊆ U k with
<!-- formula-not-decoded -->
<!-- formula-not-decoded -->
and U k +1 = U k \ S ∗ k until U k +1 = ∅ or U k +1 = u k (the units in U 0 \ U k +1 still serve as background conditions). If the maximal substrate S ∗ k is not unique, and all tied systems overlap, the next best system that is unique is chosen instead; for details see [16], and also [41].
For any complex S ∗ = s ∗ , overlapping substrates that specify less integrated information ( ϕ s < ϕ s ( T S ∗ , s ∗ )) are excluded. Consequently, specifying a maximum of integrated information ϕ ∗ s compared to all overlapping systems
/negationslash
<!-- formula-not-decoded -->
/negationslash is a sufficient requirement for a system S ⊆ U to be a complex.
As shown in [16], this recursive search for maximal substrates 'condenses' the universe U 0 = u 0 into a disjoint (non-overlapping) and exhaustive set of complexes-the first complex, second complex, and so on.
## Determining maximal unit grains
Above, we presented how we can determine the borders of a complex within a larger system U , assuming a particular spatio-temporal grain for the units U i ∈ U . In principle, however, all possible grains should be considered [42,43]. In the brain, for example, the grain of units could be brain regions, groups of neurons, individual neurons, sub-cellular structures, molecules, atoms, quarks, or anything finer, down to hypothetical atomic units of cause-effect power [3,7]. For any unit grain-neurons, for such that
example-the grain of updates could be minutes, seconds, milliseconds, micro-seconds, and so on. And the grain of states could be 2 states ('no spikes / any number of spikes per neuron over one hundred milliseconds'), 4 states ('no spikes, 1 spike, 2-5 spikes, bursts of > 5 spikes'), or 256 states (from 0 to 255 spikes in intervals of 1 spike), and so on. However, by the exclusion postulate, the units that constitute a system S must also be definite, in the sense of having a definite spatio-temporal grain.
Once again, the grain is defined by the principle of maximal existence: across the possible micro- and macroscopic levels, the 'winning' grain is the one that ensures maximally irreducible existence ( ϕ ∗ s ) for the entity to which the units belong [42,43].
As constituents of a complex upon which its cause-effect power rests, the units themselves should comply with the postulates of IIT [4]. Otherwise it would be possible to 'make something out of nothing.' Accordingly, units themselves must also be maximally irreducible, as measured by the unit's integrated information ( ϕ j ≡ ϕ s ); otherwise, they would not be units but 'disintegrate' into their constituents. However, in contrast to systems, units only need to be maximally irreducible within, because they do not exist as entities in their own right: a unit J qualifies as a candidate unit of a larger system S if its integrated information ϕ ( j ) is higher than that of any of its subsets
<!-- formula-not-decoded -->
Out of all possible sets of such candidate units, the set of (macro) units that actually exists is the one that maximizes the existence of the complex to which they belong, rather than their own existence.
## Unfolding the cause-effect structure of a complex through composition
Once a maximal substrate and the associated maximal cause-effect state have been identified, we must unfold its cause-effect power to reveal its cause-effect structure of distinctions and relations, in line with the composition postulate. As components of the cause-effect structure, distinctions and relations must also satisfy the postulates of IIT (save for composition).
## Composition and causal distinctions
Causal distinctions capture how the cause-effect power of a substrate is structured by subsets of units that specify irreducible causes and effects over subsets of its units. A candidate distinction d ( m ) consists of (1) a mechanism M ⊆ S in state m ∈ Ω M inherited from the system state s ∈ Ω S ; (2) a maximal cause-effect state z ∗ = { z ∗ c , z ∗ e } over the cause and effect purviews ( Z c , Z e ⊆ S ) linked by the mechanism; and (3) an associated value of irreducibility ( ϕ d > 0). A distinction d ( m ) is thus represented by the tuple
<!-- formula-not-decoded -->
For a given mechanism m , our goal is to identify the maximal cause Z ∗ c = z ∗ c and effect Z ∗ e = z ∗ e of m within the system, where Z ∗ c , Z ∗ e ⊆ S , z ∗ c ∈ Ω Z ∗ c , z ∗ e ∈ Ω Z ∗ e .
As above, in line with existence, intrinsicality, and information, we determine the maximal cause or effect state specified by the mechanism over a candidate purview within the system based on the value of intrinsic information ii ( m,z ). Next, in line with integration, we determine the value of integrated information ϕ d ( m,Z,θ ) over the minimum partition θ ′ . In line with exclusion, we determine the maximal cause-effect purviews for that mechanism over all possible purviews Z ⊆ S based on the associated
value of irreducibility ϕ d ( m,Z,θ ′ ). Finally, we determine whether the maximal cause-effect state specified by the mechanism is congruent with the system's overall cause-effect state ( z ∗ c ⊆ s ∗ c , z ∗ e ⊆ s ∗ e ), in which case we conclude that it contributes a distinction to the overall cause-effect structure.
The updated formalism to identify causal distinctions within a system S in state s was first presented in [15]. Here we provide a summary with minor adjustments on selecting z ∗ c and z ∗ e , the cause integrated information ϕ c ( m,Z ), and the requirement that causal distinctions must be congruent with the system's maximal cause-effect state (see A.2).
## Existence, intrinsicality, and information: Determining the cause and effect state specified by a mechanism over candidate purviews
Like the system as a whole, its subsets must comply with existence, intrinsicality, and information. As for the system, we begin by quantifying, in probabilistic terms, the difference a subset of units M ⊆ S , in its current state m ⊆ s takes and makes from and to subsets of units Z ⊆ S (cause and effect purview). As above, we start by establishing the interventional conditional probabilities and unconstrained probabilities from the TPM T S .
When dealing with a mechanism constituted by a subset of system units, it is important to capture the constraints on a purview state z that are exclusively due to the mechanism in its state ( m ), removing any potential contribution from other system units. This is done by causally marginalizing all variables in X = S \ M , which corresponds to imposing a uniform distribution as p ( X t ) [11, 13, 15] (note [44]). In addition, product probabilities π ( z | m ) are used instead of conditional probabilities p ( z | m ) to discount correlations from units in X = S \ M with divergent outputs to multiple units in Z ⊆ S [11, 13, 45]. In this way, causal marginalization maintains the conditional independence between units (2) by applying independent noise to individual connections. The assumption of conditional independence distinguishes IIT's causal analysis from standard information-theoretic analyses of information flow [13,31] and corresponds to an assumption that variables are 'physical' units in the sense that they are irreducible within and can be observed and manipulated independently.
The effect probability of a single unit Z i ∈ Z conditioned on the current state m is thus defined as
<!-- formula-not-decoded -->
and the effect probability over a set Z of | Z | units is defined as the product of the effect probabilities over individual units
<!-- formula-not-decoded -->
From Eq. (29) we can also define an unconstrained effect probability
<!-- formula-not-decoded -->
Given the set Y = S \ Z , the cause probability for a mechanism m with | M | units is computed using Bayes' rule over the product distributions
<!-- formula-not-decoded -->
where p e ( m i | z ) = | Ω Y | -1 ∑ y ∈ Ω Y p ( m i | z, y ) in line with (28).
To correctly quantify intrinsic causal constraints, the unconstrained cause probability is again set to the uniform distribution π c ( z ) = | Ω Z | -1 , z ∈ Ω Z (unlike the unconstrained effect probability, the unconstrained cause distribution does not depend on the mechanism). Note that the product in (31) is over units of M , not of Z . For details see [13,15]. As above, all probabilities p are obtained from the TPM T S (3) and thus correspond to interventional probabilities throughout.
Having defined cause and effect probabilities, we can now evaluate the intrinsic information of a mechanism m over a purview state Z = z analogously to the system intrinsic information (4) and (6). The intrinsic effect information that a mechanism in a state m specifies about a purview state z is
<!-- formula-not-decoded -->
The intrinsic cause information that a mechanism in a state m specifies about a purview state z is
<!-- formula-not-decoded -->
As with system intrinsic information, the logarithmic term is the informativeness, which captures how much causal power is exerted by the mechanism m on its potential effect z (how much it increases the probability of that state above chance), or by the potential cause z on the mechanism m . The first term corresponds to the mechanism's selectivity, which captures how much the causal power of the mechanism m is concentrated on a specific state of its purview (as opposed to other states). In the following we will again focus on the effect side, but an equivalent procedure applies on the cause side.
Based on the principle of maximal existence, the maximal effect state of m within the purview Z is defined as
<!-- formula-not-decoded -->
which corresponds to the specific effect of m on Z . Note that z ′ e is not always unique (see A.1). The maximal intrinsic information of mechanism m over a purview Z is then
<!-- formula-not-decoded -->
The intrinsic information of a candidate distinction, like that of the system as a whole, is sensitive to indeterminism (the same state leading to multiple states) and degeneracy (multiple states leading to the same state) because both factors decrease the probability of the selected state. Moreover, the product of selectivity and informativeness leads to a tension between expansion and dilution: larger purviews tend to increase informativeness because conditional probabilities will deviate more from chance, but they also tend to decrease selectivity because of the larger repertoire of states.
## Integration: Determining the irreducibility of a candidate distinction
To comply with integration, we must next ask whether the specific effect of m on Z is irreducible. As for the system, we do so by evaluating the integrated information ϕ e ( m,Z ). To that end, we define a set of 'disintegrating' partitions Θ( M,Z ) as
/negationslash
<!-- formula-not-decoded -->
where { M ( i ) } is a partition of M and { Z ( i ) } is a partition of Z , but the empty set may also be used as a part ( P denotes the power set). As introduced in [13,15], a disintegrating partition θ ∈ Θ( M,Z ) either 'cuts' the mechanism into at least two independent parts if | M | > 1, or it severs all connections between M and Z , which is always the case if | M | = 1 (we refer to [13,15] for details).
Given a partition θ ∈ Θ( M,Z ), we can define the partitioned effect probability
<!-- formula-not-decoded -->
with π ( ∅ | m ( i ) ) = π ( ∅ ) = 1. In the case of m ( i ) = ∅ , π e ( z ′ ( i ) e | ∅ ) corresponds to the fully partitioned effect probability
<!-- formula-not-decoded -->
The integrated effect information of mechanism m over a purview Z ⊆ S with effect state z ′ e for a particular partition θ ∈ Θ( M,Z ) is then defined as
<!-- formula-not-decoded -->
The effect of m on z ′ e is reducible if at least one partition θ ∈ Θ( M,Z ) makes no difference to the effect probability. In line with the principle of minimal existence, the total integrated effect information ϕ e ( m,Z ) again has to be evaluated over θ ′ , the minimum partition (MIP)
<!-- formula-not-decoded -->
which requires a search over all possible partitions θ ∈ Θ( M,Z ):
<!-- formula-not-decoded -->
As in (22), the minimum partition is evaluated against its maximum possible value across all possible system T ′ S , which again corresponds to the number of possible pairwise interactions affected by the partition.
The integrated cause information is defined analogously, as
<!-- formula-not-decoded -->
where the partitioned probability π θ e ( m | z ) is again a product distribution over the parts in the partition, as in (37).
Taken together, the intrinsic information (35) determines what cause or effect state the mechanism m specifies. Its integrated information quantifies to what extent m specifies its cause or effect in an irreducible manner. Again, ϕ ( m,Z ) is a quantifier of irreducible existence.
## Exclusion: Determining causal distinctions
Finally, to comply with exclusion, a mechanism must select a definite effect purview, as well as a cause purview, out of a set of candidate purviews. Resorting again to the principle of maximal existence, the mechanism's effect purview and associated effect is the one having the maximum value of integrated information across all possible purviews Z ⊆ S in state z ′ e ( m,Z ) (34)
<!-- formula-not-decoded -->
The integrated effect information of a mechanism m within S is then
<!-- formula-not-decoded -->
The integrated cause information ϕ c ( m ) and the maximally irreducible cause z ∗ c ( m ) are defined in the same way. Based again on the principle of minimal existence, the irreducibility of the distinction specified by a mechanism is given by the minimum between its integrated cause and effect information
<!-- formula-not-decoded -->
## Determining the set of causal distinctions that are congruent with the system cause-effect state
As required by composition, unfolding the full cause-effect structure of the system S in state s requires assessing the irreducible cause-effect power of every subset of units within S (Fig. 2). Any m ⊆ s with ϕ d > 0 specifies a candidate distinction d ( m ) = ( m,z ∗ , ϕ d ) (27) within the system S in state s . However, in order to contribute to the cause-effect structure of a system, distinctions must also comply with intrinsicality and information at the system level. This means that the cause-effect state they specify over subsets of the system ( z ∗ = { z ∗ c , z ∗ e } ) must be congruent with the cause-effect state specified over itself by the system as a whole s ′ .
We thus define the set of all causal distinctions within S in state s as
<!-- formula-not-decoded -->
Altogether, distinctions can be thought of as irreducible 'handles' through which the system can take and make a difference to itself by linking an intrinsic cause to an intrinsic effect over subsets of itself. As components within the system, causal distinctions have no inherent structure themselves. Whatever structure there may be between the units that make up a distinction is not a property of the distinction but due to the structure of the system, and thus captured already by its compositional set of distinctions. Similarly, from an extrinsic perspective, one may uncover additional causes and effects, both within the system and across its borders, at either macro or micro grains. However, from the intrinsic perspective of the system causes and effects that are excluded from its cause-effect structure do not exist [20,36].
Fig 2. Composition and causal distinctions. Identifying the irreducible causal distinctions specified by a substrate in a state requires evaluating the specific causes and effects of every system subset. The candidate substrate is constituted of two interacting units S = aB (see Fig. 1) and updates its state according to the depicted transition probability matrix. In addition to the two first-order mechanisms a and B , the second-order mechanism aB specifies its own irreducible cause and effect, as indicated by ϕ d > 0.
<details>
<summary>Image 2 Details</summary>

### Visual Description
## Diagram: Causal Network and Irreducible Distinctions
### Overview
The image depicts a causal network with three nodes (a, b, and c) and their interactions, alongside a representation of irreducible distinctions between variables a and B. The network's weights are visualized, and a matrix shows the probabilities of different state combinations. The right side of the image illustrates the "Set of irreducible distinctions D(aB)" with a hierarchical structure (1st and 2nd order) and associated probabilities. A blue arrow connects the network to the distinction set.
### Components/Axes
The diagram consists of three main sections:
1. **Causal Network:** Nodes labeled 'a', 'b', and 'c'. Arrows indicate causal relationships with weights. A scale on the left shows the weight mapping (0.2 to -0.8).
2. **Probability Matrix:** A 4x4 matrix with row and column labels: 'ab', 'Ab', 'aB', 'AB'. The matrix contains numerical values representing probabilities.
3. **Irreducible Distinctions:** A boxed area labeled "Set of irreducible distinctions D(aB)". This section is divided into "2nd order" and "1st order" subsections, each containing diagrams and probability values. A legend on the right defines color-coding for "cause purview", "mechanism", and "effect purview".
### Detailed Analysis or Content Details
**Causal Network:**
* Node 'a' is black, Node 'b' is black, and Node 'c' is grey.
* An arrow points from 'a' to 'b' with a weight of approximately -0.8.
* An arrow points from 'a' to 'c' with a weight of approximately 0.2.
* An arrow points from 'c' to 'b' with a weight of approximately 0.2.
**Probability Matrix:**
The matrix values are as follows (row = first variable, column = second variable):
* ab: 0.440, 0.060, 0.000, 0.000
* Ab: 0.330, 0.100, 0.480, 0.000
* aB: 0.010, 0.490, 0.000, 0.000
* AB: 0.080, 0.560, 0.020, 0.170
**Irreducible Distinctions:**
* **2nd Order:** Two arrows labeled 'Ab' (red) point upwards. The probability φ<sub>d</sub>(aB) = 0.07 is given.
* **1st Order:**
* An arrow labeled 'a' (red) points downwards, and an arrow labeled 'b' (red) points downwards.
* An arrow labeled 'Ab' (red) points to the right.
* An arrow labeled 'B' (red) points downwards, and an arrow labeled 'Ab' (red) points to the right.
* The probability φ<sub>d</sub>(a) = 0.33 is given.
* The probability φ<sub>d</sub>(B) = 0.86 is given.
**Legend:**
* Cause purview: Red
* Mechanism: Blue
* Effect purview: Green
### Key Observations
* The causal network shows a strong negative influence from 'a' to 'b' and weaker positive influences from 'a' to 'c' and 'c' to 'b'.
* The probability matrix shows that the combination 'ab' has the highest probability (0.440), while 'AB' has a non-negligible probability (0.170).
* The irreducible distinctions section highlights the importance of 'B' (φ<sub>d</sub>(B) = 0.86) compared to 'a' (φ<sub>d</sub>(a) = 0.33) in distinguishing aB.
* The 2nd order distinctions have a much lower probability (0.07) than the 1st order distinctions.
### Interpretation
This diagram illustrates a causal model and its decomposition into irreducible distinctions. The network represents a system where 'a' directly influences 'b' and 'c', and 'c' influences 'b'. The probability matrix quantifies the likelihood of different states of 'a' and 'b' occurring. The irreducible distinctions section, D(aB), aims to identify the minimal set of distinctions needed to differentiate between 'a' and 'B'. The higher probability associated with φ<sub>d</sub>(B) suggests that 'B' is a more significant factor in distinguishing 'aB' than 'a' itself. The hierarchical structure of the irreducible distinctions (1st and 2nd order) suggests a refinement process, where the 1st order distinctions provide a broader separation, and the 2nd order distinctions offer a more nuanced understanding. The color-coding (cause, mechanism, effect) helps to understand the flow of influence within the system. The diagram suggests a system where the effect of 'a' on 'b' is mediated by 'c', and the distinction between 'a' and 'B' is primarily driven by the state of 'B'.
</details>
For example, as shown in Fig. 3(A), a system may have a mechanism through which it specifies, in a maximally irreducible manner, the effect state of a triplet of units ( e.g. , z ∗ e = abc , a third-order purview; again lowercase letters for units indicate state ' -1', uppercase letters state '+1'). However, if the system lacks a mechanism through which it can specify the effect state of single units, each taken individually (say, unit a , a first-order effect purview), then, from its intrinsic perspective, that unit does not exist as a single unit. By the same token, if the system can specify individually the state of unit a , b , and c , but lacks a way to specify irreducibly the state of abc together, then, from its intrinsic perspective, the triplet abc does not exist as a triplet (see Fig. 3(B)). Finally, even if the system can distinguish the single units a , b , and c , as well as the triplet abc , if it lacks handles to distinguish pairs of units such as ab and bc , it cannot order units in a sequence.
## Composition and causal relations
Causal relations capture how the causes and/or effects of a set of distinctions within a complex overlap with each other. Just as a distinction specifies which units/states constitute a cause purview and the linked effect purview, a relation specifies which units/states correspond to which units/states among the purviews of a set of distinctions. Relations thus reflect how the cause-effect power of its distinctions is 'bound together' within a complex. The irreducibility due to this binding of cause-effect power is measured by the relations' irreducibility ( ϕ r > 0). Relations between distinctions were first described in [14] (for differences with the initial presentation see A.2).
A set of distinctions d ⊆ D ( s ) is related if the cause, or effect, or both the cause and effect of each distinction d ∈ d overlap congruently over a set of shared units. Below we will denote the cause of a distinction d as z ∗ c ( d ) and its effect as z ∗ e ( d ). For a given set of distinctions d ⊆ D ( s ), there are potentially many 'relating' sets of causes and/or
Fig 3. Composition of intrinsic effects. From the intrinsic perspective of the system, a specific cause or effect is only available to the system if it is selected by a causal distinction d ∈ D ( s ). In (A), only the top-order effect is specified. From the intrinsic perspective, the system cannot distinguish the individual units. In (B), only first-order effects are specified. The system has no 'handle' to select all three units together. (C) If both first- and third-order effects are specified, but no second-order effects, the system can distinguish individual units and select them together, but has no way of ordering them sequentially. (D) The system can distinguish individual units, select them altogether, as well as order them sequentially, in the sense that it has a handle for ab and bc , but not ac . The ordering becomes apparent once the relations among the distinctions are considered (see below).
<details>
<summary>Image 3 Details</summary>

### Visual Description
\n
## Diagram: Effects of Multiple Genetic Variants
### Overview
The image presents a diagram illustrating four different scenarios for the effects of multiple genetic variants (m1, m2, m3, m4, m5, m6) on a trait. Each scenario (A, B, C, D) is represented visually using overlapping circles, with the letters 'a', 'b', and 'c' representing individual effects within those variants. The diagram aims to demonstrate how the combined effect of multiple variants can manifest in different ways.
### Components/Axes
The diagram consists of four panels labeled (A) through (D). Each panel depicts a different model of genetic effect. The panels are arranged in a 2x2 grid. Each panel contains circles representing genetic variants and letters within the circles representing individual effects. Lines connect the circles to the variant labels (m1, m2, m3, m4, m5, m6).
### Detailed Analysis or Content Details
**Panel A: One "lump effect"**
* A single large, light blue circle labeled "m1" encompasses three smaller green circles labeled "a", "b", and "c".
* This suggests that the effects 'a', 'b', and 'c' are all part of a single, combined effect associated with variant m1.
**Panel B: Three individual effects**
* Four separate, light blue circles are labeled "m2", "m3", "m4", and "m5".
* Each circle contains a single green circle labeled "a", "b", and "c" respectively.
* This indicates that each effect ('a', 'b', 'c') is associated with a distinct variant (m2, m3, m4).
**Panel C: Unordered effects**
* A large, light blue circle labeled "m1" contains three smaller green circles labeled "a", "b", and "c".
* Lines connect the green circles to variant labels "m2", "m3", and "m4".
* This suggests that the effects 'a', 'b', and 'c' are associated with variant m1, but also have connections to variants m2, m3, and m4. The connections are not ordered.
**Panel D: Ordered effects**
* Overlapping light blue circles are labeled "m1", "m2", "m3", "m4", "m5", and "m6".
* Three green circles labeled "a", "b", and "c" are positioned within the overlapping regions.
* Lines connect the green circles to variant labels "m1", "m2", "m3", "m4".
* This indicates that the effects 'a', 'b', and 'c' are associated with multiple variants (m1, m2, m3, m4) in an ordered or overlapping manner.
### Key Observations
* The size and arrangement of the circles vary significantly across the panels, reflecting different models of genetic effect.
* The use of overlapping circles in panels C and D suggests interactions or shared effects between variants.
* The lines connecting the green circles to the variant labels emphasize the association between individual effects and specific variants.
### Interpretation
The diagram illustrates different ways in which multiple genetic variants can contribute to a trait. Panel A represents a scenario where multiple effects are bundled into a single variant. Panel B shows independent effects of each variant. Panel C suggests that effects can be associated with multiple variants in an unordered fashion, while Panel D demonstrates an ordered or overlapping relationship between effects and variants. The diagram highlights the complexity of genetic effects and the importance of considering interactions between variants when studying complex traits. The diagram does not provide quantitative data, but rather a qualitative representation of different genetic models. It is a conceptual illustration rather than a data-driven analysis.
</details>
effects z such that
<!-- formula-not-decoded -->
with maximal overlap
/negationslash
/negationslash
<!-- formula-not-decoded -->
All possible sets z specify unique aspects about a relation r ( d ) and constitute the various 'faces' of the relation. The maximal overlap o ∗ ( z ) (48) is also called the 'face purview.' Note that (47) includes the case z = { z ∗ c ( d ) , z ∗ e ( d ) } , which indicates a 'self-relation' over the cause and effect of a single distinction d ∈ D ( s ).
A relation r ( d ) thus consists of a set of distinctions d ⊆ D ( s ), with an associated set of faces f ( d ) = { f ( z ) } d and irreducibility ϕ r > 0,
<!-- formula-not-decoded -->
A relation that binds together h = | d | distinctions is a h -degree relation. A relation face f ( z ) ∈ f ( d ) consists of a set of causes and effects z (as in 47), with associated face purview o ∗ ( z )(48)
<!-- formula-not-decoded -->
A relation face over k = | z | purviews is a k -degree face. Because z may include either the cause, or the effect, or both the cause and effect of a distinction d ∈ d , a relation r ( d ) with | d | > 1 may comprise up to 3 | d | faces. If a set of distinctions d ∈ D ( s ) does not overlap congruently, it is not related (in that case o ∗ ( z ) = ∅ for all possible f ( z ) ∈ f ( d )).
Causal relations inherit existence from the cause-effect power of the distinctions that compose them. They inherit intrinsicality because the causes and effects that compose their faces are specified within the substrate. Moreover, relations are specific because the joint purviews of their faces must be congruent for all causes and effects z ∗ ∈ z . Note that relation purviews are necessarily congruent with the overall cause and effect state specified by the system as a whole, because the causes and effects of the distinctions composing a relation must themselves be congruent.
The irreducibility of a causal relation is measured by 'unbinding' distinctions from their joint purviews, taking into account all faces of the relation. Distinctions d ∈ D ( s ) are already established as maximally irreducible components, characterized by their value of integrated information ϕ d . To assess the irreducibility of a relation, we thus assume that the integrated information ϕ d of a distinction is distributed uniformly across unique cause and effect purview units, such that
<!-- formula-not-decoded -->
is the average irreducible information ϕ d per unique purview unit for an individual distinction d ∈ d with cause-effect state z ∗ ( d ) = { z ∗ c ( d ) , z ∗ e ( d ) } (congruent units on the cause and effect side count as one, while incongruent units are counted separately).
Since distinctions are related by constraining common units (though they may constrain them in different ways), the effect of 'unbinding' a distinction must be proportional to the number of units jointly constrained in the relation, i.e. the number of unique units over the joint purviews of all faces in the relation:
<!-- formula-not-decoded -->
This union of the face purviews o ∗ f is also called the 'relation purview' or the 'joint purview' of the relation. While any partition of one or more distinctions from the relation will 'unbind' the set of distinctions d , by the principle of minimal existence, a relation can only be as irreducible as the minimal amount of integrated information specified by any one distinction in the relation. Therefore, the relation integrated information ϕ r ( d ) is defined as
<!-- formula-not-decoded -->
Defining ϕ r in this way guarantees that the integrated information of a relation cannot exceed the integrated information of its weakest distinction. A maximal relation is thus a relation in which the cause and effect of each distinction is fully overlapped by other distinctions in the relation (in that case, ϕ r = min d ∈ d ϕ d ). Note also that a relation satisfies exclusion in that its integrated information is naturally maximized over the maximally congruent overlap o ∗ f for each relation face (48) (taking subsets of these overlaps could only reduce the integrated information of the relation).
In summary, just as distinctions link a cause with an effect, relations bind various combinations of causes and effects that are congruent over the same units, i.e. , constraining those units to be in the same state (Fig. 4). And just as a distinction captures the irreducibility of an individual cause-effect linked by a mechanism, a relation captures the irreducibility of a set of distinctions bound by the joint purviews of their causes and/or effects.
For a set of distinctions D , we define the set of all relations among them as
<!-- formula-not-decoded -->
Fig 4. Composition and causal relations. Relations between distinctions specify joint causes and/or effects. The two distinctions d ( a ) and d ( aB ) each specify their own cause and effect. In this example, their cause and effect purviews overlap over the unit b and are congruent, which means that they all specify b to be in state '-1' . The relation r ( { a, ab } ) thus binds the two distinctions together over the same unit. Relation faces are indicated by the blue lines and surfaces between the distinctions' causes and/or effects. Because all four purviews overlap over the same unit, all nine possible faces exist. Note that the fact that the two distinctions overlap irreducibly can only be captured by a relation and not by a high-order distinction.
<details>
<summary>Image 4 Details</summary>

### Visual Description
## Diagram: Causal Network Representation
### Overview
The image depicts a causal network diagram illustrating relationships between variables 'a', 'b', and 'c'. The left side shows a directed graph with nodes representing these variables and arrows indicating causal influence. The right side presents a 3D geometric representation, likely a simplex, with vertices labeled 'a', 'b', and 'aB'. Color-coding is used to represent different types of relationships: mechanism (pink), cause purview (orange), effect purview (green), and relation (blue). A numerical value, φλ({a, aB}) = 0.035, is provided at the bottom-right.
### Components/Axes
* **Nodes:** 'a', 'B', 'c'. 'a' is represented by a white circle with a red arc. 'B' is represented by a black circle within a larger grey circle, with a red arc. 'c' is represented by a grey circle with a circular arrow indicating self-influence.
* **Edges (Arrows):**
* 'a' -> 'B' (black, thick arrow)
* 'B' -> 'a' (black, thick arrow)
* 'B' -> 'c' (light orange, dashed arrow)
* 'a' -> 'c' (light grey, dashed arrow)
* 'c' -> 'c' (circular, grey arrow)
* **3D Geometric Representation:** A tetrahedron-like shape (simplex) with vertices labeled 'a', 'b', and 'aB'. Edges are colored according to the legend.
* **Legend (Top-Right):**
* Mechanism: Pink
* Cause purview: Orange
* Effect purview: Green
* Relation: Blue
* **Numerical Value:** φλ({a, aB}) = 0.035
### Detailed Analysis or Content Details
* **Node 'a':** Has incoming influence from 'B' and outgoing influence to 'B' and 'c'. The red arc around 'a' suggests a potential feedback loop or internal state.
* **Node 'B':** Has incoming influence from 'a' and outgoing influence to 'a' and 'c'. The grey circle around 'B' might indicate a broader context or latent variable. The red arc around 'B' suggests a potential feedback loop or internal state.
* **Node 'c':** Has incoming influence from 'B' and 'a', and a self-loop indicating self-influence.
* **3D Representation:**
* The edge connecting 'a' to 'aB' is orange (cause purview).
* The edge connecting 'b' to 'aB' is green (effect purview).
* The edge connecting 'a' to 'b' is blue (relation).
* The edge connecting 'aB' to 'b' is orange (cause purview).
* **Numerical Value:** φλ({a, aB}) = 0.035. This likely represents a measure of the strength or probability of a relationship between 'a' and 'aB'. The subscript λ suggests a parameter or context.
### Key Observations
* The diagram highlights a complex interplay between 'a', 'B', and 'c', with bidirectional influence between 'a' and 'B'.
* The 3D representation provides a geometric interpretation of the relationships, potentially visualizing a probability simplex or similar structure.
* The numerical value suggests a quantifiable aspect to the relationship between 'a' and 'aB', but its precise meaning requires further context.
* The dashed lines between 'a' and 'c', and 'B' and 'c' suggest weaker or more indirect influences compared to the solid lines.
### Interpretation
The diagram appears to represent a causal model, potentially within a Bayesian network or similar framework. The nodes 'a', 'B', and 'c' represent variables, and the arrows indicate causal dependencies. The 3D representation likely visualizes the joint probability distribution of these variables, with 'aB' potentially representing a combined or interaction term. The value φλ({a, aB}) = 0.035 could be a measure of conditional dependence or mutual information between 'a' and 'aB' under some parameter λ.
The color-coding provides a nuanced understanding of the relationships:
* **Mechanism (pink):** Represents the direct causal process.
* **Cause purview (orange):** Indicates the scope of influence originating from a cause.
* **Effect purview (green):** Indicates the scope of influence impacting an effect.
* **Relation (blue):** Represents a general association or correlation.
The self-loop on 'c' suggests that 'c' has an inherent dynamic or feedback mechanism. The bidirectional influence between 'a' and 'B' indicates a potential feedback loop or reciprocal causation. The diagram suggests a system where 'a' and 'B' mutually influence each other, and both influence 'c', which in turn influences itself. The low value of φλ({a, aB}) = 0.035 suggests a relatively weak or infrequent relationship between 'a' and 'aB', which could be significant depending on the context of the model.
</details>
In practice, the total number of relations and their ∑ R ( D ) ϕ r can be determined analytically for a given set of distinctions D , which greatly reduces the necessary computations (see A.3). Together, a set of distinctions D and its associated set of relations R ( D ) compose a cause-effect structure.
Fig 5. Structuring of intrinsic effects by relations. (A) A single undifferentiated effect has no relations. (B) Likewise, there are no relations among multiple non-overlapping effects. (C) The set of three first-order effects and one third-order effect supports three relations, which bind the effects together. (D) The set of first, second, and third-order effects supports a large number of relations (10 2-relations (between two effects), 6 3-relations, and 1 4-relation), which bind the effects in a structure that is ordered sequentially.
<details>
<summary>Image 5 Details</summary>

### Visual Description
\n
## Diagram: Effects of Mutations
### Overview
The image presents four diagrams (A, B, C, and D) illustrating different effects of mutations (labeled m1, m2, m3, m4) on phenotypes (labeled a, b, c). Each diagram represents a different relationship between mutations and their resulting effects. The diagrams are arranged in a 2x2 grid.
### Components/Axes
Each diagram consists of mutation nodes (m1, m2, m3, m4) and effect nodes (a, b, c). Lines connect the mutation nodes to the effect nodes, indicating the relationship between them. Each diagram has a title indicating the type of effect being illustrated: "One effect", "Unrelated effects", "Unordered effects", and "Ordered effects".
### Detailed Analysis or Content Details
**Diagram A: One effect**
* Mutation m1 is connected to effects a and b.
* The connection is represented by a single line branching into two.
**Diagram B: Unrelated effects**
* Mutation m2 is connected to effect a.
* Mutation m3 is connected to effect b.
* Mutation m4 is connected to effect c.
* Each mutation has a direct, independent connection to its respective effect.
**Diagram C: Unordered effects**
* Mutation m1 is connected to effects a and b.
* Mutation m2 is connected to effect a.
* Mutation m3 is connected to effect b.
* Mutation m4 is connected to effect c.
* The connections form a tree-like structure with m1 as the root.
**Diagram D: Ordered effects**
* Mutation m1 is connected to effects a, b, and c.
* Mutation m2 is connected to effects a and b.
* Mutation m3 is connected to effects b and c.
* Mutation m4 is connected to effect c.
* The connections form overlapping arcs, indicating an ordered relationship between mutations and effects. The overlapping regions represent combined effects (ab, bc).
### Key Observations
* Diagram A shows a single mutation causing multiple effects.
* Diagram B shows multiple mutations each causing a single, unrelated effect.
* Diagram C shows a hierarchical relationship where one mutation influences multiple effects, and other mutations contribute to those same effects.
* Diagram D shows an ordered relationship where mutations build upon each other to create combined effects.
### Interpretation
The diagrams illustrate different models of how mutations can influence phenotypes. Diagram A represents pleiotropy, where a single gene affects multiple traits. Diagram B represents a simple additive model, where each mutation contributes independently to a trait. Diagram C represents a more complex model where mutations can interact in a hierarchical manner. Diagram D represents epistasis, where the effect of one mutation depends on the presence of other mutations. The diagrams are useful for understanding the genetic basis of complex traits and for predicting the effects of mutations. The diagrams do not contain numerical data, but rather conceptual relationships. They are meant to be illustrative rather than quantitative.
</details>
## Cause-effect structures and Φ -structures
A cause-effect structure is defined as the union of the distinctions specified by a substrate and the relations binding them together:
<!-- formula-not-decoded -->
The cause-effect structure specified by a maximal substrate-a complex-is also called a Φ -structure:
<!-- formula-not-decoded -->
The sum of the values of integrated information of a substrate's distinctions and relations, called Φ ('big phi') corresponds to the structured information of the Φ -structure,
<!-- formula-not-decoded -->
In conclusion, a maximal substrate or complex is a set of units S ∗ = s ∗ that satisfies all of IIT's postulates: it has to have cause-effect power that is intrinsic, specific, irreducible, definite, and structured. By IIT, a complex S ∗ does not exist as such, but exists 'unfolded' into its Φ -structure, with all the causal distinctions and relations that compose it. In other words, a substrate is what can be observed and manipulated 'operationally' from the extrinsic perspective. From the intrinsic perspective, what truly exists is a complex with all its causal powers unfolded-an intrinsic entity that exists for itself, absolutely, rather than relative to an external observer.
According to the explanatory identity of IIT, an experience is identical to the Φ -structure of an intrinsic entity: every property of the experience should be accounted for by a corresponding property of the Φ -structure, with no additional ingredients. If a system S in state s is a complex, then its Φ -structure corresponds to the quality of the experience of S in state s , while its Φ value corresponds to its quantity-in other words, the nature and amount of intrinsic content.
## Results and discussion
In this section, we apply the mathematical framework of IIT 4.0 to several example systems. The goal is to illustrate three critical implications of IIT's postulates:
1. Consciousness and connectivity: how the way units interact determines whether a substrate can support a Φ -structure of high Φ
2. Consciousness and activity: how changes in the activity of a substrate's units change Φ -structures
3. Consciousness and functional equivalence: how substrates that are functionally equivalent may not be equivalent in terms of their Φ -structures, and thus in terms of consciousness
The following examples will feature very simple networks constituted of binary units U i ∈ U with Ω U i = {-1 , 1 } for all U i and a logistic (sigmoidal) activation function
<!-- formula-not-decoded -->
where
<!-- formula-not-decoded -->
In Eq. (58), the parameter k defines the slope of the logistic function and allows one to adjust the amount of noise or determinism in the activation function (higher values signify a steeper slope and thus more determinism). The units U i can thus be viewed as noisy linear threshold units with weighted connections among them.
As in Figs. 1 and 2, units denoted by uppercase letters are in state '1' (ON, depicted in black), units denoted by lowercase letters are in state ' -1' (OFF, depicted in white). Cause-effect structures are illustrated as geometrical shapes projected into 3D space (Fig. 6). Distinctions are depicted as mechanisms (black labels) tying a cause (red labels) and an effect (green labels) through a link (orange edges, thickness indicating ϕ d ). Relation faces of second- and third-degree relations are depicted as blue edges or triangular surfaces between the causes and effects of the related distinctions. While edges always bind pairs of distinctions (a second-degree relation), triangular surfaces may bind the causes and effects of two or three distinctions (second- or third-degree relation). Relations of higher degrees are not depicted.
All examples were computed using the 'iit-4.0' feature branch of PyPhi [46]. This branch is currently under development and will be available in the next official release of the software. An example notebook is available here.
## Consciousness and connectivity
The first set of examples highlights how the organization of connections among units impacts the ability of a substrate to support a cause-effect structure with high structured information (high Φ ). Fig. 6 shows five systems, all in the same state s = Abcdef with the same number of units, but with different connectivity among the units.
<details>
<summary>Image 6 Details</summary>

### Visual Description
\n
## Network Diagrams: Causal Inference Models
### Overview
The image presents five distinct network diagrams (labeled A through E) illustrating causal relationships between variables. Each diagram depicts a network of nodes (labeled a, b, c, d, e, f) connected by directed edges, representing causal influences. Accompanying each diagram are quantitative measures related to the network structure, including probabilities (Φ), distinctions, and weights. The diagrams appear to be exploring the relationship between network structure and causal strength.
### Components/Axes
Each diagram shares the following components:
* **Nodes:** Represent variables (a, b, c, d, e, f).
* **Directed Edges:** Indicate causal relationships (e.g., A -> B means A causes B).
* **Probabilities (Φ):** Represent the strength of causal relationships between pairs of nodes. Labeled as Φ<sub>(XY)</sub>.
* **Distinctions:** A table quantifying the number of mechanisms and effects for different variable combinations.
* **Weights:** A scale indicating the relative importance of the network structure.
* **Φ(XY):** A numerical value representing the causal effect of X on Y.
Diagram E includes a data table with two columns: "bin" and "count".
### Detailed Analysis or Content Details
**Diagram A:**
* Φ<sub>(Ab)</sub> = 0.05
* Φ<sub>(c)</sub> = 0.02
* Φ<sub>(d)</sub> = 0.02
* Φ<sub>(e)</sub> = 0.02
* Distinctions (Ab): 2/2. Mechanism: cause/effect. Values: 0.6342, 0.9575. #Relations (Ab): 2
* Distinctions (c): 1/1. Mechanism: cause/effect. Value: 0.8178. #Relations (c): 1
* Φ(Ab) = 1.38
* Φ(c) = 0.03
**Diagram B:**
* Φ<sub>(Ab)</sub> = 0.53
* Φ<sub>(cd)</sub> = 1.17
* Φ<sub>(ef)</sub> = 1.17
* Distinctions (Ab): 2/2. Mechanism: cause/effect. Values: 0.7526, 0.6635. #Relations (Ab): 2
* Distinctions (cd): 3/3. Mechanism: cause/effect. Values: 0.7551, 0.7551, 0.1604. #Relations (cd): 3
* Φ(Ab) = 2.67
* Φ(cd) = 3.42
* Φ(ef) = 3.42
**Diagram C:**
* Φ<sub>(Abcdef)</sub> = 1.74
* Distinctions (Abcdef): 6/63. Mechanism: cause/effect. Values: 0.9563, 0.9563, 0.9563, 0.9563, 0.9563, 0.9563. #Relations (Abcdef): 4
* Φ(Abcdef) = 7.65
* Φ(Abcdef) = 6746.78
**Diagram D:**
* Φ<sub>(Abcdef)</sub> = 1.09
* Φ<sub>(Abcdef)</sub> = 6463.6
* Distinctions (Abcdef): 4/4. Mechanism: cause/effect. Values: 0.9563, 0.9563, 0.9563, 0.9563. #Relations (Abcdef): 4
* Φ(Abcdef) = 6746.78
* A small text box reads: "bootstrap bias beta=0.001"
**Diagram E:**
* Φ<sub>(Abcdef)</sub> = 2.21
* Distinctions (Abcdef): 7/7. Mechanism: cause/effect. Values: 0.9563, 0.9563, 0.9563, 0.9563, 0.9563, 0.9563, 0.9563. #Relations (Abcdef): 7
* Φ(Abcdef) = 13.14
* Data Table:
| bin | count |
| :---- | :---- |
| 0.00 | 9 |
| 0.02 | 7 |
| 0.04 | 8 |
| 0.06 | 10 |
| 0.08 | 12 |
| 0.10 | 14 |
### Key Observations
* As the network complexity increases (from A to E), the values of Φ<sub>(Abcdef)</sub> and the corresponding Φ values generally increase.
* The "Distinctions" tables consistently show a "cause/effect" mechanism with values close to 1.0.
* Diagram D includes a note about "bootstrap bias," suggesting a statistical consideration in the analysis.
* Diagram E presents a histogram-like distribution of counts across different bins.
### Interpretation
These diagrams appear to be exploring the relationship between network structure and the strength of causal inference. The increasing values of Φ and the associated metrics as the network grows suggest that more complex networks can support stronger causal conclusions. The "Distinctions" tables provide a quantitative breakdown of the causal mechanisms within each network. The bootstrap bias note in Diagram D indicates that the results may be sensitive to the sampling method used. Diagram E's data table likely represents the distribution of causal effect estimates obtained through a bootstrapping procedure, providing insight into the uncertainty associated with the causal inference. The consistent values in the "Distinctions" tables suggest a robust causal mechanism at play, while the increasing Φ values indicate a strengthening of the overall causal effect with network complexity. The diagrams collectively demonstrate a systematic investigation into the interplay between network structure, causal strength, and statistical considerations in causal inference.
</details>
Fig 6. Causal analysis of various network architectures. Each panel shows the network's causal model and weights on the left. Blue regions indicate complexes with their respective ϕ s values. In all networks, k = 4 and the state is Abcdef . The Φ -structure(s) specified by the network's complexes are illustrated below or to the right (with only second- and third-degree relation faces depicted) with a list of their distinctions for smaller systems and their ∑ ϕ values for those systems with many distinctions and relations. All integrated information values are in ibits. (A) A degenerate network in which unit A forms a bottleneck with redundant inputs from and outputs to the remaining units. The first-maximal complex is Ab , which excludes all other subsets with ϕ s > 0 except for the individual units c , d , e , and f . (B) The modular network condenses into three complexes along its fault lines (which exclude all subsets and supersets), each with a maximal ϕ s value, but low Φ , as the modules each specify only two or three distinctions and at most five relations. (C) A directed cycle of six units forms a six-unit complex with ϕ s = 1 . 74 ibits, as no other subset is integrated. However, the Φ -structure of the directed cycle is composed of only first-order distinctions and few relations. (D) A specialized lattice also forms a complex (which excludes all subsets), but specifies 24 first- and high-order distinctions, with many relations ( > 6 ∗ 10 5 ) among them. Its Φ value is 6747 ibits, much larger than the structured information of the complexes that exist in the other networks. (E) A slightly modified version of the specialized lattice in which the first-maximal complex is Abef . The full system is not maximally irreducible and is excluded as a complex, despite its positive ϕ s value (indicated in gray).
## Degenerate systems, indeterminism, and specificity
Fig. 6A shows a network with medium indeterminism ( k = 4) and high degeneracy, due to the fact that unit A forms a 'bottleneck' with inputs and outputs to and from the remaining units. The network condenses into one complex of two units Ab and four complexes corresponding to the individual units c , d , e , and f (also called 'monads').
The causes and effects of the causal distinctions for the two types of complexes are shown in the middle, and the corresponding cause-effect structures are illustrated on the right. In this case, degeneracy (coupled with indeterminism) undermines the ability of the maximal substrate to grow in size, which in turn limits the richness of the Φ -structure that can be supported. Because of the bottleneck architecture, the current state of candidate system Abcdef has many possible causes and effects, leading to an exponential decrease in selectivity (the conditional probabilities of cause and effect states). This dilutes the value of intrinsic information ( ii ) for larger subsets of units, which in turn reduces their value of system integrated information ϕ s . Consequently, the maximal substrates are small, and their Φ values are necessarily low.
This example suggests that to grow and achieve high values of Φ , substrates must be constituted of units that are specialized (low degeneracy) and interact very effectively (low indeterminism). Notably, the organization of the cerebral cortex, widely considered as the likely substrate of human consciousness, is characterized by extraordinary specialization of neural units at all levels [47-49]. Moreover, if the background conditions are well controlled, neurons are thought to interact in a highly reproducible, nearly deterministic manner [50-52].
## Modular systems, fault lines, and irreducibility
Fig. 6B shows a network comprising three weakly interconnected modules, each having two strongly connected units ( k = 4). In this case, the weak inter-module connections are clear fault lines. Properly normalized, partitions along these fault lines separating
modules yield values of ϕ s that are much smaller than those yielded by partitions that cut across modules. As a consequence, the 6-unit system condenses into three complexes ( Ab , cd , and ef ), as determined by their maximal ϕ s values. Again, because the modules are small, their Φ values are low. Intriguingly, a brain region such as the cerebellum, whose anatomical organization is highly modular, does not contribute to consciousness [53,54], even though it contains several times more neurons than the cerebral cortex (and is indirectly connected to it).
Note that fault lines can be due not just to neuroanatomy but also to neurophysiological factors. For example, during early slow-wave sleep, the dense interconnections among neuronal groups in cerebral cortical areas may break down, becoming causally ineffective due to the bistability of neuronal excitability. This bistability, brought about by neuromodulatory changes [55], is associated with the loss of consciousness [56].
## Directed cycles, structural sparseness, and composition
Fig. 6C shows a directed cycle in which six units are unidirectionally connected with weight w = 1 . 0 and k = 4. Each unit copies the state of the unit before it, and its state is copied by the unit after it, with some indeterminism. The copy cycle constitutes a 6-unit complex with a maximal ϕ s = 1 . 74 ibits. However, despite the 'large' substrate, the Φ -structure it specifies has low structured information ( Φ = 7 . 65). This is because the system's Φ -structure is composed exclusively of first-order distinctions, and consequently of a small number of relations.
Highly deterministic directed cycles can easily be extended to constitute large complexes, being more irreducible than any of their subsets. However, the lack of cross-connections ('chords' in graph-theoretic terms) greatly limits the number of components of the Φ -structures the complexes specify, and thus their structured information ( Φ ). (Note also that increasing the number of units that constitute the directed cycle would not change the amount of ϕ s specified by the network as a whole.)
The brain is rich in partially segregated, directed cycles, such as those originating in cortical areas, sequentially reaching stations in the basal ganglia and thalamus, and cycling back to cortex [57,58]. These cycles are critical for carrying out many cognitive and other functions, but they do not appear to contribute directly to experience [7].
## Specialized lattices and Φ -structures with high structured information
Fig. 6D shows a network consisting of six heterogeneously connected units-a 'specialized' lattice, again with k = 4. While many subsystems within the specialized network have positive values of system integrated information ϕ s (not shown), the full 6-unit system is the maximal substrate (excluding all its subsets from being maximal substrates). Out of 63 possible distinctions, the Φ -structure comprises 24 distinctions with causes and effects congruent with the system's maximal cause-effect state. Consequently, the full 6-unit system also specifies a much larger number of causal relations compared to the copy loop system.
Preliminary work (not shown, [59]) indicates that lattices of specialized units, implementing different input-output functions, but partially overlapping in their inputs (receptive field) and outputs (projective fields), are particularly well suited to constituting large substrates that unfold into extraordinarily rich Φ -structures. The number of distinctions specified by an optimally connected, specialized system is bounded above by 2 n -1, and that of the relations among as many distinctions is bounded by 2 (2 n -1) -1.The structured information composing such structures is correspondingly large [60].
In the brain, many portions of posterior cortex appear to constitute similarly organized lattices of specialized units, which makes the posterior cortex and similarly organized areas a plausible candidate for the substrate of human consciousness [7,14,61,62]. Note that directed cycles originating and ending in such lattices typically remain excluded from the first-maximal complex because minimal partitions across such cycles yield a much lower value of ϕ s compared to minimal partitions across large lattices [59].
## Near-maximal substrates, extrinsic entities, and exclusion
Finally, Fig. 6E shows a network of six units, four of which ( Abef ) constitute a specialized lattice that corresponds to the first complex. Though integrated, the full set of 6 units happens to be slightly less irreducible than one of its 4-unit subsets ( ϕ s = 0 . 15). From the extrinsic perspective, the 6-unit system undoubtedly behaves as a highly integrated whole (nearly as much as its 4-unit subset), one that could produce complex input-output functions due to its rich internal structure. From the intrinsic perspective of the system, however, only the 4-unit subset satisfies all the postulates of existence, including maximal irreducibility (accounting for the definite nature of experience). In this example, the remaining units form a second complex with low ϕ s and serve as background conditions for the first complex.
A similar situation may occur in the brain. The brain as a whole is undoubtedly integrated (not to mention that it is integrated with the body as a whole), and neural 'traffic' is heavy throughout. However, its anatomical organization may be such that a subset of brain regions, arranged in pyramids of grids and primarily located in posterior cortex, may achieve a much higher value of integrated information than any other subset. Those regions would then constitute the first complex (the 'main complex,' [7]), and the remaining regions might condense into a large number of much smaller complexes.
Taken together, the examples in Fig. 6 demonstrate that the connectivity among the units of a system has a strong impact on what set of units can constitute a complex and, thus, on the structured information it can specify. The examples also demonstrate the role played by the various requirements that must be satisfied by a substrate of consciousness: existence (causal power), intrinsicality, specificity, irreducibility, maximal irreducibility (exclusion), and composition (structure).
## Consciousness and activity: active, inactive, and inactivated units
A substrate exerts cause-effect power by being in its current state. For the same substrate, changing the state of even one unit may have major consequences on the distinctions and relations that compose its Φ -structure: many may be lost, or gained, and many may change their value of irreducibility ( ϕ d and ϕ r ).
Fig. 7 shows a network of five binary units that interact through excitatory and inhibitory connections (weights indicated in the figure). The system is initially in state s = ABcdE (Fig. 7A) and is a maximal substrate with ϕ s = 1 . 1 ibits and a Φ -structure composed of 21 distinctions and their 4760 relations.
If we change the state of unit E from ON to OFF (in neural terms, the unit becomes inactive), the distinctions that the unit contributes to when ON, as well as the associated relations, may change (Fig. 7B). In the case illustrated by the Figure, what changes are the purviews and irreducibility of several distinctions and associated relations, the number of distinctions and ϕ s change only slightly, while the number of relations is considerably higher, leading to a larger Φ value. In other words, what a single unit contributes to intrinsic existence is not some small 'bit' of information.
Fig 7. Causal analysis of the same system with one of its units set to active, inactive, or inactivated. In all panels, the same causal model and weights are shown on the left, but in different states. For all networks k = 4. The set of distinctions D ( s ), their causes and effects, and their ϕ d values are shown in the middle. The Φ -structure specified by the network's complex is illustrated on the right (again with only secondand third-degree relation faces depicted). All integrated information values are in ibits. (A) The system in state ABcdE is a complex with 21 out of 31 distinctions and Φ = 10 . 79. (B) The same system in state ABcde , where unit E is inactive ('OFF') also forms a complex with a similar number of distinctions, but a somewhat higher Φ value due to a higher number of relations between distinctions. In addition, the system's Φ -structure differs from that in (A), as the system now specifies a different set of compositional causes and effects. (C) If instead of being inactive, unit E is inactivated (permanently fixed into the 'OFF' state), the inactivated unit cannot contribute to the complex or Φ -structure anymore. The complex is now constituted of four units ( ABcd ), with only 14 distinctions and significantly reduced structured information ( Φ = 3 . 31).
<details>
<summary>Image 7 Details</summary>

### Visual Description
\n
## Network Diagram: Dynamical Systems Analysis
### Overview
The image presents three network diagrams (labeled A, B, and C) representing dynamical systems. Each diagram consists of a five-node network with connections indicated by edges. Alongside each network is a table listing "Distinctions (ABcde)" and associated numerical values, and a separate visualization showing the "Φ(ABcde)" value with labeled network components. The diagrams appear to illustrate the influence of active/inactive states on network dynamics.
### Components/Axes
Each diagram shares the following components:
* **Nodes:** Labeled A, B, C, d, and E.
* **Edges:** Representing connections between nodes. Edges are colored blue for active connections and red for inactive connections.
* **Distinctions Table:** A table listing combinations of node states (A, B, c, d, e) and corresponding numerical values. The table has columns labeled "mechanism cause effect" and a numerical value.
* **Φ(ABcde) Visualization:** A representation of the network with labels indicating the contributions of different node combinations to the overall Φ value.
* **k Value:** A numerical value (0.8, 0.05, -0.05, 0.2) is present at the top of each diagram.
* **Φ Value:** A numerical value (1.1, 28.69, 30.21) is present at the bottom of each diagram.
* **Active/Inactive Key:** A key at the top-right indicates that blue represents "active" and red represents "inactive".
### Detailed Analysis or Content Details
**Diagram (A)**
* **Network:** The network has active connections between A-B, B-C, C-d, d-E, and E-A. There is an inactive connection between B-d.
* **Distinctions Table:**
1. A: 0.783749
2. B: 0.898722
3. C: 0.898722
4. d: 0.898722
5. E: 0.783749
6. Ad: 0.817838
7. AE: 0.817838
8. BE: 0.817838
9. ABc: 0.899392
10. ABd: 0.817838
11. ABE: 0.817838
12. AcE: 0.817838
13. AdE: 0.817838
14. BcE: 0.817838
15. BdE: 0.817838
16. cDE: 0.899392
17. ABcDE: 0.817838
18. ABcEd: 0.817838
19. ABdE: 0.897669
20. ABcd: 0.898332
21. ABcde: 0.898781
* **Φ(ABcde):** 10.79. The visualization shows contributions from various node combinations (e.g., ABcde, ABcd, AcE, etc.).
**Diagram (B)**
* **Network:** The network has active connections between A-B, B-C, C-d, d-E, and E-A. There is an inactive connection between B-d.
* **Distinctions Table:**
1. A: 0.783749
2. B: 0.898722
3. C: 0.898722
4. d: 0.898722
5. E: 0.783749
6. Ad: 0.817838
7. AE: 0.817838
8. BE: 0.817838
9. ABc: 0.899392
10. ABd: 0.817838
11. ABE: 0.817838
12. AcE: 0.817838
13. AdE: 0.817838
14. BcE: 0.817838
15. BdE: 0.817838
16. cDE: 0.899392
17. ABcDE: 0.817838
18. ABcEd: 0.817838
19. ABdE: 0.897669
20. ABcd: 0.898332
21. ABcde: 0.898781
* **Φ(ABcde):** 28.69. The visualization shows contributions from various node combinations (e.g., ABcde, ABcd, AcE, etc.).
**Diagram (C)**
* **Network:** The network has active connections between A-B, B-C, C-d, d-E, and E-A. There is an inactive connection between B-d.
* **Distinctions Table:**
1. A: 0.783749
2. B: 0.898722
3. C: 0.898722
4. d: 0.898722
5. E: 0.783749
6. Ad: 0.817838
7. AE: 0.817838
8. BE: 0.817838
9. ABc: 0.899392
10. ABd: 0.817838
11. ABE: 0.817838
12. AcE: 0.817838
13. AdE: 0.817838
14. BcE: 0.817838
15. BdE: 0.817838
16. cDE: 0.899392
17. ABcDE: 0.817838
18. ABcEd: 0.817838
19. ABdE: 0.897669
20. ABcd: 0.898332
21. ABcde: 0.898781
* **Φ(ABcde):** 30.21. The visualization shows contributions from various node combinations (e.g., ABcde, ABcd, AcE, etc.).
### Key Observations
* The "Distinctions" tables are identical across all three diagrams.
* The Φ(ABcde) value increases from Diagram A (10.79) to Diagram B (28.69) to Diagram C (30.21).
* The k values are different for each diagram, suggesting a parameter variation.
* The network topology (active/inactive connections) is the same across all three diagrams.
### Interpretation
The diagrams likely represent a sensitivity analysis of a dynamical system. The identical "Distinctions" tables suggest that the fundamental relationships between node states remain constant. The varying k values and resulting Φ(ABcde) values indicate that the overall system behavior is sensitive to the parameter 'k'. The increasing Φ values suggest that as 'k' changes (0.8 -> 0.05 -> -0.05 -> 0.2), the system's overall dynamic response or complexity increases. The Φ(ABcde) visualization highlights which combinations of node states contribute most to this dynamic response. The consistent network topology suggests that the changes in Φ are not due to structural alterations but rather to the influence of the 'k' parameter on the interactions within the network. The diagrams are demonstrating how a single parameter can dramatically alter the system's behavior, even with a fixed network structure.
</details>
Instead, a unit contributes an entire sub-structure, composed of a very large number of distinctions and relations. The set of distinctions to which a subset of units contributes as a mechanism, either alone or in combination with other units, together with their associated relations, is called a mechanism Φ -fold . With respect to the neural substrate of consciousness in the brain, this means that even a change in the state of a single unit is typically associated with a change in an entire Φ -fold within the overall Φ -structure, with a corresponding change in the structure of the experience.
In Fig. 7, we see what happens if unit E , instead of just turning inactive (OFF) is inactivated (abolishing its cause-effect power because it no longer has any counterfactual states). In this case, all the distinctions and relations to which that unit contributes as a mechanism cease to exist (its mechanism Φ -fold collapses). Moreover, all the distinctions and relations to whose purviews that unit contributes-its purview Φ -fold-also collapse or change. The complex also shrinks because it cannot include that unit. With respect to the neural substrate of consciousness, this means that while an inactive unit contributes to a different experience, an inactivated unit ceases to contribute to experience altogether. The fundamental difference between inactive and inactivated units leads to the following corollary of IIT: unlike a fully inactivated substrate which, as would be suspected, cannot support any experience, an inactive substrate can. If a maximal substrate is in working order and specifies a large Φ -structure, it will support a highly structured experience, such as the experience of empty space [14] or the feeling of 'pure presence' [63].
## Consciousness and functional equivalence: being is not doing
By the intrinsicality postulate, the Φ -structure of a complex depends on the causal interactions between system subsets, not on the system's interaction with its environment. In general, different physical systems with different internal causal structure may perform the same input-output functions.
Fig. 8 shows three simple deterministic systems with binary units (here the 'OFF' state is 0, and 'ON' is 1) that perform the same input-output function, treating the internal dynamics of the system as a black box. The function could be thought of, for example, as an electronic tollbooth 'counting 8 valid coins' (8 times input I = 1) before opening the gate [64]. Each system receives one binary input ( I ) and has one binary output ( O ). The output unit switches 'ON' on a count of eight positive inputs I = 1 (when the global state with label '0' is reached in the cycle), upon which the system resets (Fig. 8A).
In addition to being functionally equivalent in their outward behavior, the three systems share the same internal global dynamics, as their internal states update according to the same global state-transition diagram (Fig. 8B). Given an input I = 1, the system updates its state, cycling through all its 8 global states (labeled 0-7) over 8 updates. For an input of I = 0, the system remains in its present state. Moreover, all three systems are constituted of three binary units whose joint states map one-to-one onto the systems' global state labels (0-7). However, the mapping is different for different systems (Fig. 8C, left). This is because the internal binary update sequence depends on the interactions among the internal units [36,64], which differ in the three cases, as can easily be determined through manipulations and observations.
For consistency in the causal analysis, in all three cases, the global state '0' that activates the output unit if I = 1 is selected such that it corresponds to the binary state 'all OFF' (000), which is followed by 1 := 100 and 2 := 010. Also, the Φ -structure of each system is unfolded in state 1 := 100 in all three cases.
Despite their functional equivalence and equivalent global dynamics, the systems differ in how they condense into complexes and in the cause-effect structures they specify.
Fig 8. Functionally equivalent networks with different Φ -structures. (A) The input-output function realized by three different systems (shown in (C)): a count of eight instances of input I = 1 leads to output O = 1. (B) The global state-transition diagram is also the same for the three systems: if I = 0, the systems will remain in their current global state, labeled as 0-7; if I = 1, the systems will move one state forward, cycling through their global states, and activate the output if S = 0. (C) Three systems constituted of three binary units but differing in how the units are connected and interact. As a consequence, the one-to-one mapping between the 3-bit binary states and the global state labels differ. However, all three systems initially transition from 000 to 100 to 010. Analyzed in state 100, the first system (top) turns out to be a single complex that specifies a Φ -structure with six distinctions and many relations, yielding a high value of Φ . The second system (middle) is also a complex, with the same ϕ s value, but it specifies a Φ -structure with fewer distinctions and relations, yielding a lower value of Φ . Finally, the third system (bottom) is reducible ( ϕ s = 0) and splits into two smaller complexes (entities) with minimal Φ -structures and low Φ .
<details>
<summary>Image 8 Details</summary>

### Visual Description
\n
## Diagram: Complex Systems Analysis - 8-Counter & State Transitions
### Overview
The image presents a complex diagram analyzing an 8-counter system. It combines elements of a state diagram, a truth table, and network visualizations to illustrate state transitions, distinctions between states, and associated probabilities. The diagram is divided into sections labeled (A), (B), and (C), each focusing on a different aspect of the system.
### Components/Axes
The diagram consists of the following components:
* **(A) 8-Counter:** Shows the input and output of an 8-counter system. The input is a repeating binary sequence, and the output is a longer binary sequence.
* **(B) I=1 & I=0 State Diagrams:** Two state diagrams representing the system's behavior when the input is 1 and 0, respectively. Each diagram has 7 states numbered 1-7.
* **(C) Network Visualizations & State Distinctions:** Three network visualizations, each representing a different state configuration (S). Each visualization includes a table listing distinctions between states (labeled 'Distinctions (Abc)'), mechanisms, cause effects, and the number of relations. Each network visualization is accompanied by a probability value (Φ(Abc)).
* **Tables:** Several tables displaying binary values, likely representing state transitions or system outputs.
### Detailed Analysis or Content Details
**Section (A) - 8-Counter:**
* **Input:** The input sequence is approximately ".1011101110111101...".
* **Output:** The output sequence is approximately "...0000000001000000100...".
**Section (B) - I=1 & I=0 State Diagrams:**
* **I=1 Diagram:**
* State 1 transitions to State 2.
* State 2 transitions to State 3.
* State 3 transitions to State 4.
* State 4 transitions to State 5.
* State 5 transitions to State 6.
* State 6 transitions to State 7.
* State 7 transitions to State 1.
* **I=0 Diagram:**
* State 1 transitions to State 2.
* State 2 transitions to State 3.
* State 3 transitions to State 4.
* State 4 transitions to State 5.
* State 5 transitions to State 6.
* State 6 transitions to State 7.
* State 7 transitions to State 1.
**Section (C) - Network Visualizations & State Distinctions:**
* **Top Network Visualization:**
* S: 000
* Distinctions (Abc): 6/7
* Mechanism: cause effect
* Effect: 0.4387, 0.4387, 0.4156, 0.4156, 0.4156, 1.0000
* #Relations (Abc): 60
* Φ(Abc) = 21.01
* **Middle Network Visualization:**
* S: 100
* Distinctions (Abc): 4/7
* Mechanism: cause effect
* Effect: 0.4387, 0.5000, 0.5000, 0.4387
* #Relations (Abc): 9
* Φ(Abc) = 3.64
* **Bottom Network Visualization (Two Sub-Visualizations):**
* **Left Sub-Visualization:**
* S: 010
* Distinctions (A): 1/1
* Mechanism: cause effect
* Effect: 1.0000
* #Relations (A): 1
* Φ(A) = 2.0
* **Right Sub-Visualization:**
* S: 110
* Distinctions (bc): 1/1
* Mechanism: cause effect
* Effect: 0.5000
* #Relations (bc): 1
* Φ(bc) = 2.0
**Tables (IS SO):**
* **Left Table (IS SO):**
| 1 | 0 | 1 | 1 |
|---|---|---|---|
| 1 | 1 | 2 | 0 |
| 1 | 3 | 0 | 0 |
| 1 | 5 | 6 | 0 |
| 1 | 6 | 7 | 0 |
* **Right Table (IS SO):**
| 0 | 0 | 0 | 0 |
|---|---|---|---|
| 0 | 2 | 3 | 0 |
| 0 | 4 | 5 | 0 |
| 0 | 6 | 7 | 0 |
| 0 | 0 | 0 | 0 |
### Key Observations
* The state diagrams in Section (B) are identical for I=1 and I=0, suggesting the system's state transitions are independent of the input value.
* The network visualizations in Section (C) show varying degrees of distinction between states, indicated by the "Distinctions" value.
* The probability values (Φ(Abc), Φ(A), Φ(bc)) differ significantly, suggesting varying levels of certainty or predictability associated with each state configuration.
* The tables labeled "IS SO" appear to represent a mapping between states or inputs and outputs, but the exact relationship is unclear without further context.
### Interpretation
The diagram appears to be analyzing a deterministic 8-counter system, where the state transitions are predictable regardless of the input. The network visualizations and associated probabilities likely represent a probabilistic model of the system's behavior, potentially derived from empirical data or simulations. The "Distinctions" values quantify the degree of difference between states, while the probability values indicate the confidence in the system's behavior within each state. The "IS SO" tables may represent a lookup table for determining the system's output based on its current state.
The high probability values in some of the network visualizations (e.g., Φ(Abc) = 21.01) suggest a strong correlation between the state configuration and the observed behavior. The lower probability values in others (e.g., Φ(Abc) = 3.64) may indicate a more complex or uncertain relationship. The diagram as a whole suggests a sophisticated analysis of a relatively simple system, aiming to uncover underlying patterns and probabilistic relationships. The use of network visualizations and probability values indicates a focus on understanding the system's behavior in terms of information flow and uncertainty.
</details>
As shown in Fig. 8C, the first system forms a 3-unit complex with a relatively rich Φ -structure ( Φ = 21 . 01 ibits). While the second system also forms a 3-unit complex with the same ϕ s = 2 ibits, it specifies a completely different set of distinctions and has much lower structured information ( Φ = 3 . 64 ibits).
Finally, the third system is reducible ( ϕ s = 0 ibits)-in this case, because there are only feed-forward connections from unit A to units B and C -and it condenses into two complexes with small Φ -structures.
These examples illustrate a simple scenario of functional equivalence of three systems characterized by a different architecture. The equivalence is with respect to a simple input-output function, in this case coin counting, which they multiply realize. The systems are also equivalent in terms of their global system dynamics, in the sense that they go through a globally equivalent sequence of internal states. However, because of their different substrates, the three systems specify different cause-effect structures. Therefore, based on the postulates of IIT, they are not phenomenally equivalent. In other words, they are equivalent in what they do extrinsically, but not in what they are intrinsically.
This dissociation between phenomenal and functional equivalence has important
implications. As we have seen, a purely feed-forward system necessarily has ϕ s = 0. Therefore, it cannot support a cause-effect structure and cannot be conscious, whereas systems with a recurrent architecture can. On the other hand, the behavior (input-output function) of any (discrete) recurrent system can also be implemented by a system with a feed-forward architecture [65]. This implies that any behavior performed by a conscious system supported by a recurrent architecture can also be performed by an unconscious system, no matter how complex the behavior is. More generally, digital computers implementing programs capable of artificial general intelligence may in principle be able to emulate any function performed by conscious humans and yet, because of the way they are physically organized, they would do so without experiencing anything, or at least anything resembling, in quantity and quality, what each of us would experience [23].
The examples also show that the overall system dynamics, while often revealing relevant aspects of a system's architecture, typically do not and cannot exhaust the richness of its current cause-effect structure. For example, a system in a fixed point is dynamically 'dead' (and 'does' nothing), but it may be phenomenally quite 'alive,' for example, experiencing 'pure presence' [63]. Of course, the system's causal powers can be fully unfolded, and revealed dynamically, by extensive manipulations and observations of subsets of system units because they are implicitly captured by the system's causal model and ultimately by its transition probability matrix [36].
## Conclusions
IIT attempts to account for the presence and quality of consciousness in physical terms. It starts from the existence of experience, and proceeds by characterizing its essential properties-those that are immediate and irrefutably true of every conceivable experience (axioms). These are then translated into essential properties of physical existence (postulates), the necessary and sufficient conditions that a substrate must satisfy to support an experience-to constitute a complex. 'Physical' is understood purely as cause-effect power, and 'substrate' is meant in purely operational terms-as a set of units that a conscious observer can observe and manipulate.
The postulates can be assessed based purely on a substrate's transition probability matrix, as was illustrated by a few idealized causal models. Thus, a substrate of consciousness must be able to take and make a difference upon itself (existence and intrinsicality), it must be able to specify a cause and an effect state that are highly informative and selective (information), and it must do so in a way that is both irreducible (integration) and definite (exclusion). Finally, it must specify its cause and effect in a structured manner (composition), where the causal powers of its subsets over its subsets compose a cause-effect structure of distinctions and relations-a Φ -structure. Thus, a complex does not exist as such but only 'unfolded' as a Φ -structure-an intrinsic entity that exists for itself, absolutely, rather than relative to an external observer.
As shown above, these requirements constrain what substrates can and cannot support consciousness. Substrates that lack in specificity, due to indeterminism and/or degeneracy, cannot grow to be large complexes. Substrates that are weakly integrated, due to architectural or functional fault lines in their interactions, are less integrated than some of their subsets. Because they are not maximally irreducible, they do not qualify as complexes. This is the case even though they may 'hang together' well enough from an extrinsic perspective (having a respectable value of ϕ s ). Furthermore, even substrates that are maximally integrated may support Φ -structures that are extremely sparse, as in the case of directed cycles. Based on the postulates of IIT, a universal substrate ultimately 'condenses' into a set of disjoint (non-overlapping) complexes, each constituted of a set of macro or micro units.
The physical account of consciousness provided by IIT should be understood as an explanatory identity: every property of an experience should ultimately be accounted for by a property of the cause-effect structure specified by a substrate that satisfies its postulates, with no additional ingredients. The identity is not between two different substances or realms-the phenomenal and the physical-but between intrinsic (subjective) existence and extrinsic (objective) existence. Intrinsic existence is immediate and irrefutable, while extrinsic existence is defined operationally as cause-effect power discovered through observation and manipulation. The primacy of intrinsic existence (of experience) in IIT contrasts with standard attempts at accounting for consciousness as something 'generated by' or 'emerging from' a substrate constituted of matter and energy and following physical laws.
The physical correspondent of an experience is not the substrate as such but the Φ -structure specified by the substrate in its current state. Therefore, minor changes in the substrate state can correspond to major changes in the specified Φ -structure. For example, if the state of a single unit changes, an entire Φ -fold within the Φ -structure will change, and if a single inactive unit is inactivated, its associated Φ -fold will collapse, even though the current state of the substrate appears the same (Fig. 7).
Each experience corresponds to a Φ -structure, not a set of functions. Said otherwise, consciousness is about being, not doing [1,4,36,66]. This means that systems with different architectures may be functionally equivalent-both in terms of global input-output functions and global intrinsic dynamics-but they will not be phenomenally equivalent. For example, a feed-forward system can be functionally equivalent to a recurrent system that constitutes a complex, but feed-forward systems cannot constitute complexes because they do not satisfy maximal irreducibility. Accordingly, artificial systems powered by super-intelligent computer programs would experience nothing (or nearly nothing) because they have the wrong kind of physical architecture, even though they may be behaviorally indistinguishable from human beings [23].
Even though the entire framework of IIT is based on just a few axioms and postulates, it is not possible in practice to exhaustively apply the postulates to unfold the cause-effect power of realistic systems [40,67]. It is not feasible to perform all possible observations and manipulations to fully characterize a universal TPM, or to perform all calculations on the TPM that would be necessary to condense it exhaustively into complexes and unfold their cause-effect power in full. The number of possible systems, of system partitions, of candidate distinctions-each with their partitions and relations-is the result of multiple, nested combinatorial explosions. Moreover, these observations, manipulations, and calculations would need to be repeated at many different grains, with many rounds of maximizations. For these reasons, a full analysis of complexes and their cause-effect structure can only be performed on idealized systems of a few units [46].
On the other hand, we can simplify the computation considerably by using various assumptions and approximations, as with the 'cut one' approximation described in [46]. Also, while the number of relations vastly exceeds the number of units and of distinctions (its upper bound for a system of n units is 2 (2 n -1) -1), it can be determined analytically, and so can ∑ ϕ r for a given set of distinctions A.3. Developing tight approximations, as well as bounded estimates of a system's integrated information Φ ( s ), is one of the main areas of ongoing research related to IIT [60].
Despite the infeasibility of an exhaustive calculation of the relevant quantities and structures for a realistic system, IIT already provides considerable explanatory and predictive power in many real-world situations, making it eminently testable [7,68,69]. A fundamental prediction is that Φ should be high in conscious states, such as wakefulness and dreaming, and low in unconscious states, such as dreamless sleep and
anesthesia. This prediction has already found substantial support in human studies that have applied a crude approximation of Φ to successfully classify subjects as conscious vs. unconscious [7,25,26,70]. IIT can also account mechanistically for the loss of consciousness in deep sleep and anesthesia [7,56]. Furthermore, it can provide a principled account of why certain portions of the brain may constitute an ideal substrate of consciousness and others may not, why the borders of the main complex in the brain should be where they are, and why the units of the complex should have a particular grain (the one that yields a maximum of ϕ s ). A stringent prediction is that the location of the main complex, as determined by the overall maximum of ϕ s within the brain, should correspond to its location as determined through clinical and experimental evidence. Another prediction that follows from first principles is that constituents of the main complex can support conscious contents even if they are mostly inactive, but not if they are inactivated [7,14]. Yet another prediction is that the complete inactivation of constituents of the main complex should lead to absolute agnosia (unawareness that anything is missing).
IIT further predicts that the quality of experience should be accounted for by the way the Φ -structure is composed, which in turn depends on the architecture of the substrate specifying it. This was demonstrated in a recent paper showing how the fundamental properties of spatial experiences-those that make space feel 'extended'-can be accounted for by those of Φ -structures specified by 2D grids of units, such as those found in much of posterior cortex [14]. This prediction is in line with neurological evidence of their role in supporting the experience of space [14]. Ongoing work aims at accounting for the quality of experienced time [28] and that of experienced objects [29]. A related prediction is that changes in the strength of connections within the neural substrate of consciousness should be associated with changes in experience, even if neural activity does not change [71]. Also, similarities and dissimilarities in the structure of experience should be accounted for by similarities and dissimilarities among Φ -structures and Φ -folds specified by the neural substrate of consciousness. Further validation of IIT will depend on a systematic back-and-forth between phenomenology, theoretical inferences, and neuroscientific evidence [1].
In addition to empirical work aimed at validating the theory, much remains to be done at the theoretical level. According to IIT, all meaning is intrinsic-whether that of spatial extendedness, of temporal flow, or the feeling of an object that binds a general concept with particular features. Every conscious meaning can be considered as identical to a sub-structure within a current Φ -structure, whether it is triggered by extrinsic inputs or it occurs spontaneously during a dream. Ongoing work aims at providing a self-consistent explanation of how intrinsic meanings-whether those of spatial extendedness, of temporal flow, or of objects, to name but a few-can capture relevant features of causal processes in the environment [72]. It will also be important to explain how intersubjectively validated knowledge can be obtained despite the intrinsic and partially idiosyncratic nature of meaning.
To the extent that the theory is validated through empirical evidence obtained from the human brain, IIT can then offer a plausible inferential basis for addressing several questions that depend on an explicit theory of consciousness. As indicated in the section on phenomenal and functional equivalence, and argued in ongoing work [23], one consequence of IIT is that typical computer architectures are not suitable for supporting consciousness, no matter whether their behavior may resemble ours. By the same token, it can be inferred from IIT that animal species that may look and behave quite differently from us may be highly conscious, as long as their brains have a compatible architecture. Other inferences concern our own experience and whether it plays a causal role, or is simply 'along for the ride' while our brain performs its functions. As recently argued, IIT implies that we have true free will-that we have alternatives, make
decisions, and cause-and that we, rather than our neurons, are responsible for our actions and their consequences [21].
Finally, an ontology that is grounded in experience as intrinsic existence-an intrinsic ontology-must not only provide an account of subjective existence in objective, operational terms, but also offer a path toward a unified view of nature-of all that exists and happens. One step in this direction is the application of the same postulates that define causal powers (existence) to the evaluation of actual causes and effects ('what caused what' [13]). Another is to unify classical accounts of information (as communication and storage of signals) with IIT's notion of information as derived from the properties of experience-that is, information as causal, intrinsic, specific, maximally irreducible, and structured (meaningful) [11,73]. Yet another is the study of the evolution of a substrate's causal powers as conditional probabilities that update themselves [74].
Even so, there are many ways in which IIT may turn out to be inadequate or wrong. Are some of its assumptions, including those of a discrete, finite set of 'atomic' units of cause-effect power, incompatible with current physics [40,75] (but see [76-79])? Are its axiomatic basis and the translation of axioms into postulates sound and unique? And, most critically, can IIT survive the results of empirical investigations assessing the relationship between the quantity and quality of consciousness and its substrate in the brain?
## A Appendix
## A.1 Ties
The information postulate requires that a system's cause-effect power is specific: the system in its current state must select a specific cause-effect state for its units. Likewise, mechanisms within the system must select a specific cause and effect state over their purviews. The exclusion postulate requires that a complex must be constituted of a definite set of units, and that mechanisms within the complex specify a definite cause and effect. By the principle of maximal existence, cause-effect states, complexes, and the cause-effect states of a mechanism within the system are identified as those with maximal cause-effect power.
However, for systems with built-in symmetries in their architecture or the input-output functions of their units, multiple sets of units or cause-effect states may 'tie' for maximal intrinsic information or cause-effect power [15,41,45,80]. Here we outline how those ties should be resolved in line with IIT's postulates and principles.
Maximal substrates can be identified using an iterative algorithm (24). A maximal substrate excludes all overlapping systems with lower ϕ s from existing as complexes. If overlapping systems tie for max S ⊆ U k ϕ s ( T s , s ) those systems do not comply with the exclusion postulate and we choose the next best system that is unique (see [16] for details, and also [41] for an argument to exclude such ties).
For a system, the maximal cause-effect state s ′ = { s ′ c , s ′ e } is the one that maximizes the system's intrinsic cause and effect information. If multiple states comply with equation (11), we select the one for which the system specifies the maximal integrated information ϕ s ( T S , s, θ ′ ) (20) over its minimum partition θ ′ . Remaining ties in the system state with the same ϕ s do not matter for system selection, but need to be resolved in order to determine the system's cause-effect structure. By the maximum existence principle, we choose the cause-effect state that maximizes the system's structured information Φ . In general, any remaining ties in Φ in highly symmetrical systems specify cause-effect structures that would be identical from the intrinsic
perspective, in which case the tie would be extrinsic and not a violation of the information postulate.
The cause or effect state of a mechanism within the system for a candidate purview is first selected based on its intrinsic information ii ( m,z ) (34). Next, we compare the integrated information ϕ ( m,Z = z ′ ) (40) of all maximal cause or effect states across all possible purviews (including all possible ties in ii ( m,z ) within a candidate purview) to identify the maximally irreducible cause or effect z ∗ c/e of the mechanism within the system (43). By the maximum existence postulate, potential ties in max ϕ d ( m,Z = z ′ ) and thus in the cause-effect state z ∗ c/e of a distinction may be resolved at the level of the cause-effect structure, by selecting the z ∗ c/e that maximizes the system's structured information Φ . Accordingly, in case of state ties within the same purview, we select the state that is congruent with the system's cause-effect state s ′ . In case of ties across different purviews, the maximal cause-effect state will generally correspond to the one that supports the most relations with other distinctions, which typically favors larger purviews.
## A.2 Comparison to IIT 1.0-3.0 and subsequent publications
As highlighted in the main text, IIT is a work in progress. While the core theory has remained the same, its formal framework has been progressively refined and extended [8-11]. Compared to prior versions (IIT 1.0 [8,9], IIT 2.0 [10,12,81], and IIT 3.0 [11,46,77,82]), IIT 4.0 presents a more complete, self-consistent formulation. The most notable advances in IIT 4.0 include the introduction of an Intrinsic Difference (ID) measure [15,17] that is uniquely consistent with IIT's postulates, the explicit assessment of causal relations [14], and a more exact translation of the axioms into postulates. Because IIT 3.0 already included a comparison to IIT 1.0 and 2.0 (see [11], Supporting Information Text S1), we will mainly focus on subsequent developments.
## Axioms and postulates
While the starting point of IIT has always been phenomenology, the axioms and postulates of the theory were first explicitly presented in IIT 3.0 [7,11]. The updated 4.0 exposition of IIT's axioms explicitly separates phenomenal existence, which is not a property, from intrinsicality, which is one of the essential properties of phenomenal existence. Accordingly, the existence of experience is introduced as IIT's foundational, zeroth axiom. The remaining five axioms (intrinsicality, information, integration, exclusion, and composition) capture the essential properties that are immediate and irrefutably true of every conceivable experience.
Compared to IIT 3.0, the formulation of the axioms has been refined to avoid misunderstandings [83,84] and to highlight their immediacy and irrefutability. The formulation of IIT's postulates has been updated accordingly with the objective of tracking the phenomenal axioms as closely as possible. For example, conforming more closely to the information axiom, the information postulate requires that the system must select a specific cause-effect state over its units. The composition axiom now highlights both phenomenal distinctions and their relations. By the composition postulate, phenomenal distinctions and relations are accounted for in physical terms by causal distinctions and relations. Because only an experience that exists intrinsically, in a way that is specific, irreducible, and definite can also be structured, composition takes the final position in the ordering of the axioms and postulates.
## Identifying maximal substrates
The IIT 4.0 formalism to identify maximal substrates was first described in detail in [16]. Maximal substrates, or complexes, are identified based on their system integrated information ϕ s (as in IIT 2.0 but unlike IIT 3.0, which evaluated integration after composition). System partitions remain directional (as in IIT 3.0). In IIT 4.0, the minimum partition (MIP) is identified as the partition with minimal integrated information ( ϕ s ), normalized by the maximal possible value of ϕ s across this partition for an arbitrary TPM of the same dimensions (22). In this way, the MIP is sensitive to the fault lines of a candidate system, rather than defaulting to partitions of individual system units. The IIT 4.0 analysis is state-dependent (as in IIT 2.0 and 3.0) and requires positive cause and effect power for a system to exist (as in IIT 1.0 and 3.0).
## Measuring intrinsic information
Supplanting prior measures such as the Kullback-Leibler divergence (IIT 2.0 [10]), or the (extended) Earth Mover's Distance (EMD, IIT 3.0, [11]), the IIT 4.0 formalism features a newly developed Intrinsic Difference (ID) measure [17], which uniquely complies with the postulates of IIT. Accordingly, the intrinsic information specified by a system (or mechanism) over a cause or effect state is evaluated as a product of informativeness and selectivity, which makes it subadditive if p < 1, that is, if cause-effect power is spread over more than one state. As intrinsic information is evaluated over specific cause or effect states, its maximal value over a state distribution identifies the specific cause or effect state selected by the system (or mechanism), in line with the information postulate.
## Causal distinctions
Distinctions capture how the various system subsets specify system subsets as their cause and effect within the system. In IIT 3.0, distinctions were called 'concepts' (which composed to a 'conceptual' structure), a term that could generate unnecessary misunderstandings [14]. An updated formalism to identify causal distinctions was first presented in [15]. As in IIT 3.0, causes and effects must be specified in a way that is irreducible ( ϕ d > 0). Unlike IIT 3.0, in IIT 4.0 distinctions select a specific state as a cause and an effect. The definition of cause and effect probabilities π ( z | m ) ((31) and (29)) remains unchanged, but is now presented more formally in terms of product probabilities, rather than referring to 'virtual elements' [13,45]). In IIT 4.0, a mechanism selects a specific cause and effect state based on the Intrinsic Difference (ID) measure introduced in [17] (see above). The set of permissible partitions θ ∈ Θ( M,Z ) (36) has also been updated [13,15] to ensure that partitions are always 'disintegrating' the mechanism.
The present formulation includes several updates compared to [15]. First, the cause-effect state z ′ is selected based on the intrinsic information ii e ( m,Z ) (34), before evaluating its integrated information ϕ ( m,Z = z ′ ) (40). This is because the cause/effect of m should be determined by the mechanism as a whole, independent of how it can be partitioned. Second, we evaluate intrinsic information without the absolute value, as in [17], because, to comply with the existence postulate, a mechanism's cause state should be one that would increase the probability of its current state, and its effect state one whose probability would be increased by the mechanism being in its current state. Third, to correctly capture this increase in probability on the cause side, the informativeness term is expressed in terms of effect probabilities also on the cause side for ii c and ϕ c , evaluating the increase in probability of the current state due to the cause state. Fourth, we have updated the resolution of ties at the level of distinctions according to the principle of maximal existence (see A.1). Finally, a candidate
distinction only contributes to the system's cause-effect structure if its maximal cause-effect state z ∗ is congruent with the maximal cause-effect state s ′ of the system.
## Relations
Relations bind together a set of causal distinctions over a congruent overlap between their causes and/or effects. Developing an explicit account of phenomenal relations in terms of causal relations was a main goal of IIT since IIT 1.0. IIT 3.0 employed a distance metric-the Earth Mover's Distance-that was sensitive to whether different distinctions ('concepts') had similar cause-effect repertoires, but relations did not figure explicitly in the formalism despite their central role in characterizing experience. An explicit account was first described in [14]. The IIT 4.0 formalism further distinguishes between relations (which bind a set of distinctions with overlapping causes and/or effects) and the faces of a relation (which specify the maximal overlap of a set of purviews and jointly characterize the type of the relation). Moreover, the amount of information specified by a distinction over the overlap and the way relation partitions are assessed differs from the original account [14]. Because distinctions are irreducible components within the cause-effect structure upon which relations are built, a distinction involved in a relation contributes its entire ϕ d , weighted by the extent of its joint overlap (53). For this reason, we do not recompute the irreducible information of the mechanism m of a distinction d ( m,z ∗ , ϕ d ) over the candidate overlap o . In [14], distinctions contributing to a relation were partitioned by 'noising' the interactions among distinction units. (For the same reason, once the system is 'unfolded' and its candidate distinctions are established, the probabilities of z ∗ ∈ z are no longer referenced).
## Φ -structures
In IIT 4.0, a system is a substrate of consciousness (a complex) if it corresponds to a maximum of system integrated information ϕ s , as determined through information, integration, and exclusion. This is similar to IIT 2.0 [10], although in that case only causes (and not effects) were evaluated. The quality of the experience is identical to the Φ -structure of distinctions and relations unfolded from the complex. The quantity of experience corresponds to the Φ value, the sum of the integrated information of the distinctions and relations that compose the Φ -structure. In IIT 3.0, the determination of the complex through information, integration, and exclusion took into account its compositional structure, although without explicit relations. However, the Φ value corresponding to the quantity of consciousness only captured the distinctions affected by the minimum partition, as opposed to all the distinction (and relations) unfolded from a maximally irreducible substrate.
## A.3 Analytical solution for ∑ ϕ r and the number of causal relations
Here, we show how the sum of the relation integrated information over all the causal relations ( ∑ ϕ r ) and the number of relations can be computed without assessing the relations individually. We only need the set of causal distinctions:
<!-- formula-not-decoded -->
<!-- formula-not-decoded -->
## Analytical computation of ∑ ϕ r
Given a subset of distinctions d ⊆ D ( T S , s ) with | d | ≥ 2, any subset z of purviews that contains either the cause, or effect, or both the cause and effect of each distinction d ∈ d and overlap congruently defines a relation face f with face overlap o ∗ f = ⋂ z ∈ z z . The relation overlap is further defined as the union of the face overlaps ⋃ f ∈ f ( d ) o ∗ f , where f ( d ) represents the set of all the faces over the distinction set d . Here, intersection and union take into account both the units and their states.
First, we can show:
<!-- formula-not-decoded -->
by proving any unit n in ⋃ f ∈ f ( d ) o ∗ f is in ⋂ d ∈ d ( z ∗ c ( d ) ∪ z ∗ e ( d ) ) and vice versa:
<!-- formula-not-decoded -->
<!-- formula-not-decoded -->
This helps us to rewrite the relation integrated information of a set of distinctions d ⊆ D ( T S , s ) with | d | ≥ 2 as:
<!-- formula-not-decoded -->
We further define the set of z ∗ c ( d ) ∪ z ∗ e ( d ) of all distinctions in D and their corresponding distinction integrated information as:
<!-- formula-not-decoded -->
Now, given a single node n in a specific state, we can find all the distinctions that contain n in that state in their cause, or effect, or both purviews as:
<!-- formula-not-decoded -->
Any subset of Z ( n ) of size 2 or larger defines a relation whose overlap contains at least n . Formally, for r ⊆ Z ( n ), | r | ≥ 2, there exists a relation with relation purview ⋂ ( z d ,ϕ d ) ∈ r z d and integrated information value of:
<!-- formula-not-decoded -->
Note that, by definition of Z ( n ) and Z ( T S , s ), z d is the union of cause and effect purviews. Using the definition of Z ( T S , s ) and Z ( n ), we can write the sum of the integrated information of relations, except self-relations, as
<!-- formula-not-decoded -->
By factoring the sum over r ⊆ Z ( T S , s ) into two sums over the nodes n and the relations whose purview contains at least n , r ⊆ Z ( n ) , | r | ≥ 2, we are overcounting each
relation by a factor of its joint purview size ∣ ∣ ∣ ⋂ ( z d ,ϕ d ) ∈ r z d ∣ ∣ ∣ . For example, if a set of distinctions make up a relation r over two units n 1 and n 2 , they all are members of both Z ( n 1 ) and Z ( n 2 ). Therefore, r ⊆ Z ( n 1 ) and r ⊆ Z ( n 2 ). This simplifies the summand to just min ( z d ,ϕ d ) ∈ r ϕ d | z d | . To compute the inner sum, we can sort the distinctions in Z ( n ) by their ϕ | z | value in a non-decreasing order, such that ( z (1) , ϕ (1) ) has the smallest ϕ | z | ratio, ( z (2) , ϕ (2) ) has the second smallest ϕ | z | ratio, and so on. Then, we can compute the sum as:
<!-- formula-not-decoded -->
In words, any subset r ⊆ Z ( n ) , | r | ≥ 2, that contains ( z (1) , ϕ (1) ) will have min ( z d ,ϕ d ) ∈ r ϕ d | z d | = ϕ (1) | z (1) | . There are 2 |Z ( n ) |-1 -1 of such subsets. Similarly, there are 2 |Z ( n ) |-2 -1 subsets that contain ( z (2) , ϕ (2) ), but not ( z (1) , ϕ (1) ), etc. This helps us arrive at our final results:
<!-- formula-not-decoded -->
This gives us the sum of the relation integrated information of all the relations, except the self-relations, i.e. | r | = 1. The self-relations can be assessed individually without combinatorial explosion.
## Analytical count of the number of relations
We can also count all the causal relations among all the distinctions in D ( T S , s ) by generalizing the definition of Z ( n ) in (60) to all the subsets o ⊆ s ′ c ∪ s ′ e :
<!-- formula-not-decoded -->
For each distinction d ∈ D ( T S , s ), there is a corresponding element ( ( z ∗ c ( d ) ∪ z ∗ e ( d ) , ϕ ( d ) ) in Z ( o ) if o ⊆ z ∗ c ( d ) ∪ z ∗ e ( d ). Any subset of Z ( o ) of size 2 or larger defines a relation whose overlap contains at least o . The number of such subsets is:
<!-- formula-not-decoded -->
We can count all the relations by applying the inclusion-exclusion principle as:
<!-- formula-not-decoded -->
This is the number of all the causal relations among the causal distinctions in D ( T S , s ), except the self-relations. Again, the self-relations can be counted individually without combinatorial explosion.
## A.4 IIT Algorithm
Let the physical substrate U be a stochastic system of interacting units {U1, U2,..., Un} Let u be its current state, and u → u an update within state space Su =II u; Let Tu = p(u |do(u) = IIp(u |do(u) be its interventional transition probability function i=1 Let theinterventionaldistributionbeuniform
<details>
<summary>Image 9 Details</summary>

### Visual Description
\n
## Mathematical Document: Integrated Information Theory Equations
### Overview
The image presents a dense block of mathematical equations and text related to Integrated Information Theory (IIT). It outlines a series of computations for determining the integrated information of a system, broken down into sections for candidate systems, cause-effect relationships, and mechanisms/purviews. The document appears to be a formal description of the IIT framework.
### Components/Axes
There are no axes or charts in this image. It consists entirely of text and mathematical formulas. The document is structured into sections, each addressing a specific aspect of the IIT calculation. Key terms include:
* SCU (candidate system)
* w (state of background units)
* Ts (transition state)
* p(s) (probability of state s)
* Ωs (set of states)
* ii (intrinsic information)
* s'(Ts, s) (maximal cause/effect state)
* θ (directional system partition)
* Φs (integrated information)
* M (mechanism)
* Z (purview)
* PSC (Perturbational State Complex)
### Detailed Analysis or Content Details
The document can be broken down into the following sections and equations:
**1. For each candidate system SCU in state s:**
* Fix the state w of background units W = U \ S: Ts = Σ w p(s, w) = p(s | s)
* Compute the unconstrained probability pₒ(s) = [Ωs⁻¹ p(s | s)]
* Compute the probability p(s | s) = p(s | s) [Ωs⁻¹]
**2. For each candidate cause state g:**
* Compute intrinsic cause information: iiₒ(s, g) = p(s | g) * log [p(s | g) / p(s)]
* Find the maximal cause state: s'(Ts, s) = argmax iiₒ(s, g)
**3. For each candidate effect state g:**
* Compute intrinsic effect information: iiₒ(s, g) = p(g | s) * log [p(g | s) / p(g)]
* Find the maximal effect state: s'(Ts, s) = argmax iiₒ(s, g)
**4. For each directional system partition θ:**
* Compute the partitioned transition probability function Tsθ
* Compute the integrated cause information: Φc(Ts, s, θ) = p(s | θ) * log [p(s | θ) / p(s)]
* Compute the integrated effect information: Φe(Ts, s, θ) = p(s | θ) * log [p(s | θ) / p(s)]
* (Candidate) system integrated information: Φs(Ts, s, θ) = min(Φc(Ts, s, θ), Φe(Ts, s, θ))
* Find the minimum partition (MIP) θ' = argmin θ∈Θ max(Φc(Ts, s, θ), Φe(Ts, s, θ))
* Identify system integrated information: Φs(Ts, s) = Φs(Ts, s, θ')
* Find the first complex S' = argmax Φs(Ts, s). This is the PSC
*(In principle, not only sets of units, but also the grain of units, updates, and states should be considered)*
**5. Unfold the cause-effect structure of the complex:**
**6. For each candidate mechanism M ⊆ S* in state m:**
* For each candidate purview Z ⊆ S*:
* Compute the probability over M, marginalizing out external influences y ⊆ Z\
* p(z | m) = [|M|⁻¹ Σ y∈Z\M p(z, y | m)]
* Compute the unconstrained effect probability:
* pₒ(z | m) = [|Z|⁻¹ Σ y∈Z p(z, y | m)]
* Compute the information:
* ii(z | m) = p(z | m) * log [p(z | m) / pₒ(z | m)]
* Find the maximal purview: Z' = argmax ii(z | m)
**7. For each candidate purview Z ⊆ S*:**
* Compute the probability over Z, marginalizing out external influences y ⊆ Z\
* p(z | m) = [|Z|⁻¹ Σ y∈Z p(z, y | m)]
* Compute the unconstrained effect probability:
* pₒ(z | m) = [|Z|⁻¹ Σ y∈Z p(z, y | m)]
* Compute the information:
* ii(z | m) = p(z | m) * log [p(z | m) / pₒ(z | m)]
* Find the maximal purview: Z' = argmax ii(z | m)
### Key Observations
The document presents a highly formalized and mathematical description of a complex theory. The repeated use of probability, logarithms, and maximization/minimization operations suggests a rigorous attempt to quantify information integration. The distinction between cause and effect, and the concept of partitioning the system, are central to the framework. The notation is dense and requires a strong background in probability theory and information theory to fully understand.
### Interpretation
This document outlines the computational steps involved in calculating integrated information, a core concept in Integrated Information Theory (IIT). IIT proposes that consciousness is fundamentally related to the amount of integrated information a system possesses. The equations describe how to determine the intrinsic information of a system, its cause-effect relationships, and ultimately, its level of consciousness. The document is not presenting empirical data, but rather a theoretical framework and its mathematical implementation. The emphasis on probabilities and marginalization suggests that IIT attempts to account for the uncertainty and complexity inherent in real-world systems. The final sections dealing with mechanisms and purviews suggest an attempt to identify the specific components and interactions that contribute most to a system's integrated information. The note about considering "grain of units, updates, and states" indicates the theory is sensitive to the level of abstraction at which the system is analyzed. The document is a foundational piece for anyone seeking to understand or implement IIT.
</details>
<details>
<summary>Image 10 Details</summary>

### Visual Description
\n
## Diagram: Conceptual Representation of Stages
### Overview
The image presents a diagram consisting of four vertically stacked, curved lines, each labeled with a conceptual stage: "exclusion", "integration", "information", "intrinsicality", and "existence". Braces connect the lines, visually grouping them. The diagram appears to represent a progression or hierarchy of concepts. There is no quantitative data present.
### Components/Axes
The diagram consists of the following labeled components:
* **exclusion:** Located at the bottom of the diagram.
* **integration:** Positioned above "exclusion".
* **information:** Positioned above "integration".
* **intrinsicality:** Positioned above "information".
* **existence:** Positioned at the top of the diagram.
Braces are used to visually group "exclusion" and "integration", "integration" and "information", "information" and "intrinsicality", and "intrinsicality" and "existence".
### Detailed Analysis or Content Details
The diagram is purely conceptual. There are no numerical values, axes, or scales. The lines are simple curves, and their lengths are roughly equal, though there is some variation. The lines are black and appear to be of uniform thickness. The braces are also black.
### Key Observations
The diagram suggests a hierarchical relationship between the labeled concepts. The order of the labels implies a progression from "exclusion" to "existence". The grouping via braces suggests that "exclusion" and "integration" are related, as are "integration" and "information", and so on.
### Interpretation
The diagram likely represents a philosophical or theoretical model. The progression from "exclusion" to "existence" could represent a process of development, emergence, or becoming. "Exclusion" might represent the initial state of separation or differentiation, while "integration" represents the bringing together of elements. "Information" could represent the content or structure that arises from integration, "intrinsicality" the inherent qualities, and "existence" the final state of being.
The diagram is abstract and open to interpretation. Without further context, it is difficult to determine the specific meaning of the concepts or the model being represented. It could be related to systems theory, information theory, or a philosophical framework dealing with being and becoming. The diagram's simplicity suggests it is intended as a high-level conceptual overview rather than a detailed technical specification.
</details>
<details>
<summary>Image 11 Details</summary>

### Visual Description
\n
## Diagram: Causal Inference Process
### Overview
The image presents a diagram outlining a process for causal inference, broken down into four main stages: prediction, composition, association, and intervention. Each stage is further divided into several steps, described as mathematical operations and logical flows. The diagram is structured as two parallel columns, likely representing a forward and reverse perspective of the same process. A color-coded vertical bar on the right side visually categorizes the stages.
### Components/Axes
The diagram consists of a series of text blocks arranged in a vertical flow. There are no explicit axes or scales. The stages are labeled as follows (from top to bottom):
1. **Prediction** (light blue)
2. **Composition** (light green)
3. **Association** (light orange)
4. **Intervention** (light red)
Each stage contains several steps, each described with mathematical notation. The diagram is split into two columns, with identical stages mirrored on each side.
### Detailed Analysis or Content Details
**Left Column:**
* **Prediction:**
* Compute the intrinsic cause information: `i(m, z) = -π(z|m) log(π(z|m))`
* Find the maximal cause state: `z*(m, Z) = argmax_{z ∈ Z} i(m, z)`
* **Composition:**
* Compute the partitioned effect probability: `π*(z|m) = Π_{i=1}^d π(z_i|m)`
* Compute the integrated cause information: `φ(m, z) = -π*(z|m) log(π*(z|m))`
* **Association:**
* Find the minimum partition (MIP): `θ* = argmin_{θ ∈ Θ} Σ_{i=1}^d π(z_i|m, θ)`
* Identify integrated information: `φ(m, z, θ*)`
* **Intervention:**
* Find the maximally irreducible cause: `z*(m) = argmax_{z ∈ Z} φ(m, z)`
* Compute the relational information: `Γ(m, z*) = min_{m’ ∈ M} φ(m’, z*)`
* Find the total interventional effect: `δ(m, z*) = φ(m, z*) - Γ(m, z*)`
**Right Column:**
* **Prediction:**
* Compute the intrinsic effect information: `ii(m, z) = -π(z|m) log(π(z|m))`
* Find the maximal effect state: `z*(m, Z) = argmax_{z ∈ Z} ii(m, z)`
* **Composition:**
* Compute the partitioned effect probability: `π*(z|m) = Π_{i=1}^d π(z_i|m)`
* Compute the integrated effect information: `φ(m, z) = -π*(z|m) log(π*(z|m))`
* **Association:**
* Find the minimum partition (MIP): `θ* = argmin_{θ ∈ Θ} Σ_{i=1}^d π(z_i|m, θ)`
* Identify integrated information: `φ(m, z, θ*)`
* **Intervention:**
* Find the maximally irreducible effect: `z*(m) = argmax_{z ∈ Z} φ(m, z)`
* Compute the maximal overlap: `η = max_{z ∈ Z} φ(m, z)`
* Compute the total interventional effect: `δ(m, z*) = φ(m, z*) - η`
**Footer:**
* The diagram includes the text: "The causal model: M = (M, π, Z)"
* The text "For each set of causes and/or effects z such that:" is present.
* The text "Compute the set of congruent distinctions D*(T) = {d : p_i(d) > 0, Σ_{i=1}^d < δ}" is present.
* The text "For each candidate set of distinctions d ∈ D(T)s, for each:" is present.
* The text "Maximize Γ(m, z*) for z* ∈ Z" is present.
### Key Observations
The diagram presents a highly mathematical and abstract process. The mirrored structure suggests a duality or inverse relationship between cause and effect. The use of mathematical notation (log, argmax, summation) indicates a quantitative approach to causal inference. The color-coding helps to visually separate the different stages of the process. The footer text suggests a formal definition of the causal model and a process for identifying congruent distinctions.
### Interpretation
This diagram outlines a formal framework for causal inference, likely rooted in information theory or statistical mechanics. The process appears to aim at identifying the "maximally irreducible" cause or effect, and quantifying the interventional effect of manipulating a cause. The use of partitioned probabilities and integrated information suggests an attempt to decompose complex systems into simpler causal components. The mirrored structure implies that the same mathematical principles apply to both identifying causes and predicting effects. The diagram is not about a specific domain, but rather a general methodology for causal reasoning. The mathematical notation and abstract concepts suggest this is intended for a highly technical audience with a strong background in statistics, information theory, or related fields. The diagram is a theoretical model, and its practical application would likely require significant computational resources and domain-specific knowledge.
</details>
## Acknowledgments
This project was made possible through the support of a grant from Templeton World Charity Foundation (TWCF0216). In addition, this research was supported by the David P White Chair in Sleep Medicine at the University of Wisconsin-Madison, by the Tiny Blue Dot Foundation (UW 133AAG3451; G.T.), and by the Natural Science and Engineering Research Council of Canada (NSERC; RGPIN-2019-05418; W.M.). L.A. also acknowledges the support of a grant from the Templeton World Charity Foundation (TWCF-2020-20526).
## References
1. Ellia F, Hendren J, Grasso M, Kozma C, Mindt G, Lang JP, Haun AM, Albantakis L, Boly M, and Tononi G. Consciousness and the fallacy of misplaced objectivity. Neuroscience of Consciousness. 2021;2021(2):1-12. doi:10.1093/NC/NIAB032.
2. Nagel T. What is it like to be a bat? The philosophical review. 1974;83(4):435-450.
3. Tononi G. Integrated information theory. Scholarpedia. 2015;10(1):4164.
4. Tononi G. On being; forthcoming.
5. A substrate should be understood as a set of units that can be observed and manipulated.
6. As mentioned in the section 'Determining maximal unit grains,' a substrate unit must be maximally irreducible within, which is likely the case for 'real' neurons in the brain, but is not the case for 'virtual,' simulated neurons in a computer program.
7. Tononi G, Boly M, Massimini M, Koch C. Integrated information theory: from consciousness to its physical substrate. Nature Reviews Neuroscience. 2016;17(7):450-461. doi:10.1038/nrn.2016.44.
8. Tononi G, Sporns O. Measuring information integration. BMC neuroscience. 2003;4(31):1-20.
9. Tononi G. An information integration theory of consciousness. BMC neuroscience. 2004;5:42. doi:10.1186/1471-2202-5-42.
10. Balduzzi D, Tononi G. Integrated information in discrete dynamical systems: motivation and theoretical framework. PLoS Comput Biol. 2008;4(6):e1000091. doi:10.1371/journal.pcbi.1000091.
11. Oizumi M, Albantakis L, Tononi G. From the Phenomenology to the Mechanisms of Consciousness: Integrated Information Theory 3.0. PLoS Computational Biology. 2014;10(5):e1003588. doi:10.1371/journal.pcbi.1003588.
12. Balduzzi D, Tononi G. Qualia: the geometry of integrated information. PLoS computational biology. 2009;5(8):e1000462. doi:10.1371/journal.pcbi.1000462.
13. Albantakis L, Marshall W, Hoel E, Tononi G. What caused what? A quantitative account of actual causation using dynamical causal networks. Entropy. 2019;21(5):459. doi:10.3390/e21050459.
14. Haun AM, Tononi G. Why Does Space Feel the Way it Does? Towards a Principled Account of Spatial Experience. Entropy. 2019;21(12):1160. doi:10.3390/e21121160.
15. Barbosa LS, Marshall W, Albantakis L, Tononi G. Mechanism Integrated Information. Entropy. 2021;23(3):362.
16. Marshall W, Grasso M, Mayner WGP, Zaaemzadeh A, Barbosa LS, Chastain E, Findlay G, Sasai S, Albantakis L, and Tononi G. System integrated information; submitted.
17. Barbosa LS, Marshall W, Streipert S, Albantakis L, Tononi G. A measure for intrinsic information. Scientific Reports. 2020;10(1):18803. doi:10.1038/s41598-020-75943-4.
18. Intrinsic Ontology Wiki;. Available from: https://centerforsleepandconsciousness.psychiatry.wisc.edu/ intrinsic-ontology-wiki/ .
19. Albantakis L. Integrated information theory. In: Overgaard M, Mogensen J, Kirkeby-Hinrup A, editors. Beyond Neural Correlates of Consciousness. Routledge; 2020. p. 87-103.
20. Grasso M, Albantakis L, Lang JP, Tononi G. Causal reductionism and causal structures. Nature Neuroscience. 2021;24(10):1348-1355. doi:10.1038/s41593-021-00911-8.
21. Tononi G, Albantakis L, Boly M, Cirelli C, Koch C. Only what exists can cause: An intrinsic view of free will. 2022;doi:10.48550/arxiv.2206.02069.
22. Tononi G, Koch C. Consciousness: here, there and everywhere? Philosophical transactions of the Royal Society of London Series B, Biological sciences. 2015;370:20140167-. doi:10.1098/rstb.2014.0167.
23. Findlay G, Marshall W, Albantakis L, Mayner WGP, Koch C, Tononi G. Dissociating Intelligence from Consciousness in Artificial Systems - Implications of Integrated Information Theory. In: Proceedings of the 2019 Towards Conscious AI Systems Symposium, AAAI SSS19; 2019 and forthcoming.
24. Albantakis L, Prentner R, Durham I. Measuring the integrated information of a quantum mechanism; submitted.
25. Massimini M, Ferrarelli F, Huber R, Esser SK, Singh H, Tononi G. Breakdown of cortical effective connectivity during sleep. Science. 2005;309(5744):2228-2232.
26. Casarotto S, Comanducci A, Rosanova M, Sarasso S, Fecchio M, Napolitani M, et al. Stratification of unresponsive patients by an independently validated index of brain complexity. Annals of Neurology. 2016;80(5):718-729.
27. Strictly speaking, distinctions and relations that can be singled out phenomenally, such as a spot, a book, and so on, correspond, in physical terms, to bundles of distinctions and relations (compound distinctions and relations)-that is, to sub-structures of a Φ -structure ( Φ -folds) [1, 14]. This can be understood in neural terms because attentional mechanisms can only highlight subsets of units, and thereby all the associated distinctions and relations, rather than individual distinctions and relations. In other words, introspection is the starting point for an explanation of experience in physical terms, but it can only go so far. A full
- explanation can only be provided through a back-and-forth between the properties of a substrate, which can be explored in great detail, and the properties of experience, which can only be characterized crudely through introspection.
28. Comolatti, R et al. Why does time feel flowing?; in preparation.
29. Grasso, M et al. How do phenomenal objects bind general concepts with particular features?; in preparation.
30. Janzing D, Balduzzi D, Grosse-Wentrup M, Sch¨ olkopf B. Quantifying causal influences. The Annals of Statistics. 2013;41(5):2324-2358. doi:10.1214/13-AOS1145.
31. Ay N, Polani D. Information Flows in Causal Networks. Advances in Complex Systems. 2008;11(01):17-41. doi:10.1142/S0219525908001465.
32. Pearl J. Causality: models, reasoning and inference. vol. 29. Cambridge Univ Press; 2000.
33. While the IIT formalism can be applied to hypothetical or simulated systems (as we do for the example systems in the Results/Discussion section), for the resulting quantities to capture existence in physical terms they must be applied to substrate units that can actually be observed and manipulated in physical terms.
34. As demonstrated in [24], it is possible to extend IIT's causal framework to finite quantum systems under unitary evolution, where the conditional independence assumption (2) applies to non-entangled subsystems.
35. Note that this notion of irreducibility based on set-partitions differs from typical information-theoretical notions like redundancy or synergy [36,85,86].
36. Albantakis L, Tononi G. Causal Composition: Structural Differences among Dynamically Equivalent Systems. Entropy 2019, Vol 21, Page 989. 2019;21(10):989. doi:10.3390/E21100989.
37. Cooper J. Plato: Complete Works. Hackett; 1997.
38. Tillemans T. Dharmak¯ ırti. In: Zalta EN, editor. The Stanford Encyclopedia of Philosophy. Spring 2021 ed. Metaphysics Research Lab, Stanford University; 2021.
39. A principle of IIT not discussed here is the Principle of becoming, which states that powers become what powers do. That is, conditional probabilities in the TPM update depending on what happens. The principle and some of its implications-examined in [4] and forthcoming work.
40. Barrett AB, Mediano PAM. The phi measure of integrated information is not well-defined for general physical systems. Journal of Consciousness Studies. 2019;26(1-2):11-20.
41. Moon K. Exclusion and Underdetermined Qualia. Entropy. 2019;21(4):405. doi:10.3390/e21040405.
42. Hoel EP, Albantakis L, Marshall W, Tononi G. Can the macro beat the micro? Integrated information across spatiotemporal scales. Neuroscience of Consciousness. 2016;2016(1).
43. Marshall W, Albantakis L, Tononi G. Black-boxing and cause-effect power. PLOS Computational Biology. 2018;14(4):e1006114. doi:10.1371/journal.pcbi.1006114.
44. Units within the candidate system are causally marginalized to discount their causal contribution if they are not part of the mechanism or purview under consideration. By contrast, units outside the candidate system are causally conditioned in their current state throughout the analysis and act as fixed background conditions.
45. Krohn S, Ostwald D. Computing integrated information. Neuroscience of Consciousness. 2017;2017(1). doi:10.1093/nc/nix017.
46. Mayner WGP, Marshall W, Albantakis L, Findlay G, Marchman R, Tononi G. PyPhi: A toolbox for integrated information theory. PLoS Computational Biology. 2018;14(7):e1006343. doi:10.1371/journal.pcbi.1006343.
47. Kanwisher N. Functional specificity in the human brain: a window into the functional architecture of the mind. Proceedings of the National Academy of Sciences. 2010;107(25):11163-11170.
48. Ponce CR, Xiao W, Schade PF, Hartmann TS, Kreiman G, Livingstone MS. Evolving images for visual neurons using a deep generative network reveals coding principles and neuronal preferences. Cell. 2019;177(4):999-1009.
49. Khosla M, Wehbe L. High-level visual areas act like domain-general filters with strong selectivity and functional specialization. bioRxiv. 2022;doi:10.1101/2022.03.16.484578.
50. Mainen ZF, Sejnowski TJ. Reliability of spike timing in neocortical neurons. Science. 1995;268(5216):1503-1506.
51. Hires SA, Gutnisky DA, Yu J, O'Connor DH, Svoboda K. Low-noise encoding of active touch by layer 4 in the somatosensory cortex. eLife. 2015;4:e06619.
52. Nolte M, Reimann MW, King JG, Markram H, Muller EB. Cortical reliability amid noise and chaos. Nature Communications. 2019;10(1):1-15.
53. Lemon R, Edgley S. Life without a cerebellum. Brain. 2010;133(3):652-654.
54. Yu F, Jiang Qj, Sun Xy, Zhang Rw. A new case of complete primary cerebellar agenesis: clinical and imaging findings in a living patient. Brain. 2015;138(6):e353-e353.
55. Steriade M, Nunez A, Amzica F. A novel slow ( < 1 Hz) oscillation of neocortical neurons in vivo: depolarizing and hyperpolarizing components. Journal of Neuroscience. 1993;13(8):3252-3265.
56. Pigorini A, Sarasso S, Proserpio P, Szymanski C, Arnulfo G, Casarotto S, et al. Bistability breaks-off deterministic responses to intracortical stimulation during non-REM sleep. Neuroimage. 2015;112:105-113.
57. Middleton FA, Strick PL. Basal ganglia and cerebellar loops: motor and cognitive circuits. Brain Research Reviews. 2000;31(2-3):236-250.
58. Foster NN, Barry J, Korobkova L, Garcia L, Gao L, Becerra M, et al. The mouse cortico-basal ganglia-thalamic network. Nature. 2021;598(7879):188-194.
59. Fujii, K et al. Some anatomical constraints on integrated information; in preparation.
60. Zaeemzadeh, A et al. Upper Bounds for Integrated Information; in preparation.
61. Boly M, Massimini M, Tsuchiya N, Postle BR, Koch C, Tononi G. Are the neural correlates of consciousness in the front or in the back of the cerebral cortex? Clinical and neuroimaging evidence. Journal of Neuroscience. 2017;37(40):9603-9613.
62. Watakabe A, Skibbe H, Nakae K, Abe H, Ichinohe N, Rachmadi MF, et al. Local and long-distance organization of prefrontal cortex circuits in the marmoset brain. bioRxiv. 2022;doi:10.1101/2021.12.26.474213.
63. Boly, M et al. Neural correlates of pure presence; in preparation.
64. Hanson JR, Walker SI. Formalizing falsification for theories of consciousness across computational hierarchies. Neuroscience of Consciousness. 2021;2021(2). doi:10.1093/NC/NIAB014.
65. Krohn K, Rhodes J. Algebraic Theory of Machines. I. Prime Decomposition Theorem for Finite Semigroups and Machines. Transactions of the American Mathematical Society. 1965;116:450. doi:10.2307/1994127.
66. Albantakis L, Tononi G. The Intrinsic Cause-Effect Power of Discrete Dynamical Systems-From Elementary Cellular Automata to Adapting Animats. Entropy. 2015;17(8):5472-5502. doi:10.3390/e17085472.
67. Moyal R, Fekete T, Edelman S. Dynamical Emergence Theory (DET): A Computational Account of Phenomenal Consciousness. Minds and Machines. 2020;30(1):1-21. doi:10.1007/s11023-020-09516-9.
68. Melloni L, Mudrik L, Pitts M, Koch C. Making the hard problem of consciousness easier. Science. 2021;372(6545):911-912.
69. Sarasso S, Casali AG, Casarotto S, Rosanova M, Sinigaglia C, Massimini M, et al. Consciousness and complexity: a consilience of evidence. Neuroscience of Consciousness. 2021;7(2):1-24.
70. Sarasso S, D'Ambrosio S, Fecchio M, Casarotto S, Vigan` o A, Landi C, et al. Local sleep-like cortical reactivity in the awake brain after focal injury. Brain. 2020;143(12):3672-3684.
71. Song C, Haun AM, Tononi G. Plasticity in the structure of visual space. Eneuro. 2017;4(3).
72. Mayner WGP, Juel BE, Tononi G. Intrinsic meaning, perception, and matching: measuring how intrinsic cause-effect structures resonate with extrinsic inputs; in preparation.
73. Zaeemzadeh, A et al. Shannon Information and Integrated Information; in preparation.
74. Albantakis L, Hintze A, Koch C, Adami C, Tononi G. Evolution of Integrated Causal Structures in Animats Exposed to Environments of Increasing Complexity. PLoS computational biology. 2014;10(12):e1003966. doi:10.1371/journal.pcbi.1003966.
75. Carroll S. Consciousness and the Laws of Physics. Journal of Consciousness Studies. 2021;28(9):16-31. doi:10.53765/20512201.28.9.016.
76. Zanardi P, Tomka M, Venuti LC. Towards Quantum Integrated Information Theory. arXiv. 2018;1806.01421.
77. Kleiner J, Tull S. The Mathematical Structure of Integrated Information Theory. Frontiers in Applied Mathematics and Statistics. 2021;6:74. doi:10.3389/FAMS.2020.602973/BIBTEX.
78. Esteban FJ, Galad´ ı JA, Langa JA, Portillo JR, Soler-Toscano F. Informational structures: A dynamical system approach for integrated information. PLOS Computational Biology. 2018;14(9):e1006154. doi:10.1371/journal.pcbi.1006154.
79. Kalita P, Langa JA, Soler-Toscano F. Informational Structures and Informational Fields as a Prototype for the Description of Postulates of the Integrated Information Theory. Entropy. 2019;21(5):493. doi:10.3390/e21050493.
80. Hanson JR, Walker SI. On the Non-uniqueness Problem in Integrated Information Theory. 2020;doi:10.1101/2021.04.07.438793.
81. Tononi G. Consciousness as integrated information: a provisional manifesto. Biol Bull. 2008;215(3):216-242. doi:215/3/216 [pii].
82. Tononi G. Integrated Information Theory of Consciousness: An Updated Account. Arch Ital Biol. 2012;150:56-90.
83. Bayne T. On the axiomatic foundations of the integrated information theory of consciousness. Neuroscience of Consciousness. 2018;2018(1). doi:10.1093/nc/niy007.
84. Merker B, Williford K, Rudrauf D. The Integrated Information Theory of consciousness: A case of mistaken identity. Behavioral and Brain Sciences. 2021;21:1-72. doi:10.1017/S0140525X21000881.
85. Beer RD, Williams PL. Information processing and dynamics in minimally cognitive agents. Cognitive Science. 2015;39(1):1-38. doi:10.1111/cogs.12142.
86. Mediano PAM, Rosas F, Carhart-Harris RL, Seth AK, Barrett AB. Beyond integrated information: A taxonomy of information dynamics phenomena. arXiv. 2019;1909.02297.