2502.13025
Model: gemini-2.0-flash
# Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks
**Authors**:
- Markus J. Buehler (Laboratory for Atomistic and Molecular Mechanics)
- Cambridge, MA 02139, USA
> Corresponding author.
Abstract
We present an agentic, autonomous graph expansion framework that iteratively structures and refines knowledge in situ. Unlike conventional knowledge graph construction methods relying on static extraction or single-pass learning, our approach couples a reasoning-native large language model with a continually updated graph representation. At each step, the system actively generates new concepts and relationships, merges them into a global graph, and formulates subsequent prompts based on its evolving structure. Through this feedback-driven loop, the model organizes information into a scale-free network characterized by hub formation, stable modularity, and bridging nodes that link disparate knowledge clusters. Over hundreds of iterations, new nodes and edges continue to appear without saturating, while centrality measures and shortest path distributions evolve to yield increasingly distributed connectivity. Our analysis reveals emergent patterns—such as the rise of highly connected “hub” concepts and the shifting influence of “bridge” nodes—indicating that agentic, self-reinforcing graph construction can yield open-ended, coherent knowledge structures. Applied to materials design problems, we present compositional reasoning experiments by extracting node-specific and synergy-level principles to foster genuinely novel knowledge synthesis, yielding cross-domain ideas that transcend rote summarization and strengthen the framework’s potential for open-ended scientific discovery. We discuss other applications in scientific discovery and outline future directions for enhancing scalability and interpretability.
Keywords Artificial Intelligence $·$ Science $·$ Graph Theory $·$ Category Theory $·$ Materials Science $·$ Materiomics $·$ Language Modeling $·$ Reasoning $·$ Isomorphisms $·$ Engineering
1 Introduction
Scientific inquiry often proceeds through an interplay of incremental refinement and transformative leaps, evoking broader questions of how knowledge evolves under continual reflection and questioning. In many accounts of discovery, sustained progress arises not from isolated insights but from an iterative process in which prior conclusions are revisited, expressed as generalizable ideas, refined, or even reorganized as new evidence and perspectives emerge [1]. Foundational work in category theory has formalized aspects of this recursive structuring, showing how hierarchical representations can unify diverse knowledge domains and enable higher-level abstractions in both the natural and social sciences [2, 3, 4]. Across engineering disciplines including materials science, such iterative integration of information has proven essential in synthesizing deeply interlinked concepts.
Recent AI methods, however, often emphasize predictive accuracy and single-step outputs over the layered, self-reflective processes that characterize human problem-solving. Impressive gains in natural language processing, multimodal reasoning [5, 6, 7, 8, 9, 10, 11, 12], and materials science [13, 14, 15, 16, 17], including breakthroughs in molecular biology [18] and protein folding [19, 20, 21], showcase the prowess of large-scale models trained on vast datasets. Yet most of the early systems generate answers in a single pass, sidestepping the symbolic, stepwise reasoning that often underpins scientific exploration. This gap has prompted a line of research into modeling that explicitly incorporates relational modeling, reflection or multi-step inferences [2, 3, 4, 22, 23, 24, 25, 26, 27, 28], hinting at a transition from single-shot pattern recognition to more adaptive synthesis of answers from first principles in ways that more closely resemble compositional mechanisms. Thus, a fundamental challenge now is how can we build scientific AI systems that synthesize information rather than memorizing it.
Graphs offer a natural substrate for this kind of iterative knowledge building. By representing concepts and their relationships as a network, it becomes possible to capture higher-order structure—such as hubs, bridging nodes, or densely interconnected communities—that might otherwise remain implicit. This explicit relational format also facilitates systematic expansion: each newly added node or edge can be linked back to existing concepts, reshaping the network and enabling new paths of inference [29, 23, 27]. Moreover, graph-based abstractions can help large language models move beyond memorizing discrete facts; as nodes accumulate and form clusters, emergent properties may reveal cross-domain synergies or overlooked gaps in the knowledge space.
Recent work suggests that standard Transformer architectures can be viewed as a form of Graph Isomorphism Network (GIN), where attention operates over relational structures rather than raw token sequences [23]. Under this lens, each attention head effectively tests for isomorphisms in local neighborhoods of the graph, offering a principled way to capture both global and local dependencies. A category-theoretic perspective further bolsters this approach by providing a unified framework for compositional abstractions: nodes and edges can be treated as objects and morphisms, respectively, while higher-level concepts emerge from functorial mappings that preserve relational structure [2, 3, 4]. Taken together, these insights hint at the potential for compositional capabilities in AI systems, where simpler building blocks can be combined and reconfigured to form increasingly sophisticated representations, rather than relying on one-pass computations or static ontologies. By using graph-native modeling and viewing nodes and edges as composable abstractions, such a model may be able to recognize and reapply learned configurations in new contexts—akin to rearranging building blocks to form unanticipated solutions. This compositional approach, strengthened by category-theoretic insights, allows the system to not only interpolate among known scenarios but to extrapolate to genuinely novel configurations. In effect, graph-native attention mechanisms treat interconnected concepts as first-class entities, enabling the discovery of new behaviors or interactions that purely sequence-based methods might otherwise overlook.
A fundamental challenge remains: How can we design AI systems that, rather than merely retrieving or matching existing patterns, build and refine their own knowledge structures across iterations. Recent work proposes that graphs can be useful strategies to endow AI models with relational capabilities [29, 23, 27] both within the framework of creating graph-native attention mechanisms and by training models to use graphs as native abstractions during learned reasoning phases. Addressing this challenge requires not only methods for extracting concepts but also mechanisms for dynamically organizing them so that new information reshapes what is already known. By endowing large language models with recursively expanding knowledge graph capabilities, we aim to show how stepwise reasoning can support open-ended discovery and conceptual reorganization. The work presented here explores how such feedback-driven graph construction may lead to emergent, self-organizing behaviors, shedding light on the potential for truly iterative AI approaches that align more closely with the evolving, integrative nature of human scientific inquiry. Earlier work on graph-native reasoning has demonstrated that models explicitly taught how to reason in graphs and abstractions can lead to systems that generalize better and are more interpretable [27].
Here we explore whether we can push this approach toward ever-larger graphs, creating extensive in situ graph reasoning loops where models spend hours or days developing complex relational structures before responding to a task. Within such a vision, several key issues arise: Will repeated expansions naturally preserve the network’s relational cohesion, or risk splintering into disconnected clusters? Does the continuous addition of new concepts and edges maintain meaningful structure, or lead to saturation and redundancy? And to what extent do bridging nodes, which may initially spark interdisciplinary links, remain influential over hundreds of iterations? In the sections ahead, we investigate these questions by analyzing how our recursively expanded knowledge graphs grow and reorganize at scale—quantifying hub formation, modular stability, and the persistence of cross-domain connectors. Our findings suggest that, rather than collapsing under its own complexity, the system retains coherent, open-ended development, pointing to new possibilities for large-scale knowledge formation in AI-driven research for scientific exploration. Iterative Reasoning $i<N$
Define Initial Question (Broad question or specific topic, e.g., "Impact-Resistant Materials")
Generate Graph-native Reasoning Tokens <|thinking|> ... <|/thinking|>
Parse Graph $\mathcal{G}_{\text{local}}^{i}$ (Extract Nodes and Relations)
Merge Extracted Graph with Larger Graph (Append Newly Added Nodes/Edges) $\mathcal{G}←\mathcal{G}\cup\mathcal{G}_{\text{local}}^{i}$
Save and Visualize
Final Integrated Graph $\mathcal{G}$
Generate New Question Based on Last Extracted Added Nodes/Edges as captured in $\mathcal{G}_{\text{local}}^{i}$
Figure 1: Algorithm used for iterative knowledge extraction and graph refinement. At each iteration $i$ , the model generates reasoning tokens (blue). From the response, a local graph $\mathcal{G}_{\text{local}}^{i}$ is extracted (violet) and merged with the global knowledge graph $\mathcal{G}$ (light violet). The evolving graph is stored in multiple formats for visualization and analysis (yellow). Instead of letting the model respond to the task, a follow-up task is generated based on the latest extracted nodes and edges in $\mathcal{G}_{\text{local}}^{i}$ (green), ensuring iterative refinement (orange), so that the model generates yet more reasoning tokens, and as part of that process, new nodes and edges. The process continues until the stopping condition $i<N$ is met, yielding a final structured knowledge graph $\mathcal{G}$ (orange).
1.1 Knowledge Graph Expansion Approaches
Knowledge graphs are one way to organize relational understanding of the world. They have grown from manually curated ontologies decades ago into massive automatically constructed repositories of facts. A variety of methodologies have been developed for expanding knowledge graphs. Early approaches focused on information extraction from text using pattern-based or open-domain extractors. For example, the DIPRE algorithm [30] bootstrapped relational patterns from a few seed examples to extract new facts in a self-reinforcing loop. Similarly, the KnowItAll system [31] introduced an open-ended, autonomous “generate-and-test” paradigm to extract entity facts from the web with minimal supervision. Open Information Extraction methods like TextRunner [32] and ReVerb [33] further enabled unsupervised extraction of subject–predicate–object triples from large text corpora without requiring a predefined schema. These unsupervised techniques expanded knowledge graphs by harvesting new entities and relations from unstructured data, although they often required subsequent mapping of raw extractions to a coherent ontology.
In parallel, research on knowledge graph completion has aimed to expand graphs by inferring missing links and attributes. Statistical relational learning and embedding-based models (e.g., translational embeddings like TransE [34]) predict new relationships by generalizing from known graph structures. Such approaches, while not fully unsupervised (they rely on an existing core of facts for training), can autonomously suggest plausible new edges to add to a knowledge graph. Complementary to embeddings, logical rule-mining systems such as AMIE [35] showed that high-confidence Horn rules can be extracted from an existing knowledge base and applied to infer new facts recursively. Traditional link prediction heuristics from network science – for example, preferential attachment and other graph connectivity measures – have also been used as simple unsupervised methods to propose new connections in knowledge networks. Together, these techniques form a broad toolkit for knowledge graph expansion, combining text-derived new content with graph-internal inference to improve a graph’s coverage and completeness.
1.2 Recursive and Autonomous Expansion Techniques
A notable line of work seeks to make knowledge graphs growth continuous and self-sustaining – essentially achieving never-ending expansion. The NELL project (Never-Ending Language Learner) [36] pioneered this paradigm, with a system that runs 24/7, iteratively extracting new beliefs from the web, integrating them into its knowledge base, and retraining itself to improve extraction competence each day. Over years of operation, NELL has autonomously accumulated millions of facts by coupling multiple learners (for parsing, classification, relation extraction, etc.) in a semi-supervised bootstrapping loop. This recursive approach uses the knowledge learned so far to guide future extractions, gradually expanding coverage while self-correcting errors; notably, NELL can even propose extensions to its ontology as new concepts emerge.
Another milestone in autonomous knowledge graph construction was Knowledge Vault [37], which demonstrated web-scale automatic knowledge base population by fusing facts from diverse extractors with probabilistic inference. Knowledge Vault combined extractions from text, tables, page structure, and human annotations with prior knowledge from existing knowledge graphs, yielding a vast collection of candidate facts (on the order of 300 million) each accompanied by a calibrated probability of correctness. This approach showed that an ensemble of extractors, coupled with statistical fusion, can populate a knowledge graph at scales far beyond what manual curation or single-source extraction can achieve. Both NELL and Knowledge Vault illustrate the power of autonomous or weakly-supervised systems that grow a knowledge graph with minimal human intervention, using recursive learning and data fusion to continually expand and refine the knowledge repository.
More recent research has explored agent-based and reinforcement learning (RL) frameworks for knowledge graph expansion and reasoning. Instead of one-shot predictions, these methods allow an agent to make multi-hop queries or sequential decisions to discover new facts or paths in the graph. For example, some work [38] employ an agent that learns to navigate a knowledge graph and find multi-step relational paths, effectively learning to reason over the graph to answer queries. Such techniques highlight the potential of autonomous reasoning agents that expand knowledge by exploring connections in a guided manner (using a reward signal for finding correct or novel information). This idea of exploratory graph expansion aligns with concepts in network science, where traversing a network can reveal undiscovered links or communities. It also foreshadowed approaches like Graph-PReFLexOR [27] that treat reasoning as a sequential decision process, marked by special tokens, that can iteratively build and refine a task-specific knowledge graph.
Applications of these expansion techniques in science and engineering domains underscore their value for discovery [29]. Automatically constructed knowledge graphs have been used to integrate and navigate scientific literature, enabling hypothesis generation by linking disparate findings. A classic example is Swanson’s manual discovery of a connection between dietary fish oil and Raynaud’s disease, which emerged by linking two disjoint bodies of literature through intermediate concepts [39, 40]. Modern approaches attempt to replicate such cross-domain discovery in an automated way: for instance, mining biomedical literature to propose new drug–disease links, or building materials science knowledge graphs that connect material properties, processes, and applications to suggest novel materials, engineering concepts, or designs [41, 29].
1.3 Relation to Earlier Work and Key Hypothesis
The prior work discussed in Section 1.2 provides a foundation for our approach, which draws on the never-ending learning spirit of NELL [36] and the web-scale automation of Knowledge Vault [37] to dynamically grow a knowledge graph in situ as it reasons. Like those systems, it integrates information from diverse sources and uses iterative self-improvement. However, rather than relying on passive extraction or purely probabilistic link prediction, our method pairs on-the-fly logical reasoning with graph expansion within the construct of a graph-native reasoning LLM. This means each newly added node or edge is both informed by and used for the model’s next step of reasoning. Inspired in part by category theory and hierarchical inference, we move beyond static curation by introducing a principled, recursive reasoning loop that helps maintain transparency in how the knowledge graph evolves. In this sense, the work can be seen as a synthesis of existing ideas—continuous learning, flexible extraction, and structured reasoning—geared toward autonomous problem-solving in scientific domains.
Despite substantial progress in knowledge graph expansion, many existing methods still depend on predefined ontologies, extensive post-processing, or reinforce only a fixed set of relations. NELL and Knowledge Vault, for instance, demonstrated how large-scale extraction and integration of facts can be automated, but they rely on established schemas or require manual oversight to refine extracted knowledge [36, 37]. Reinforcement learning approaches such as DeepPath [38] can efficiently navigate existing graphs but do not grow them by generating new concepts or hypotheses.
By contrast, the work reported here treats reasoning as an active, recursive process that expands a knowledge graph while simultaneously refining its structure. This aligns with scientific and biological discovery processes, where knowledge is not just passively accumulated but also reorganized in light of new insights. Another key distinction is the integration of preference-based objectives, enabling more explicit interpretability of each expansion step. Methods like TransE [34] excel at capturing statistical regularities but lack an internal record of reasoning paths; our approach, in contrast, tracks and justifies each newly added node or relation. This design allows for a transparent, evolving representation that is readily applied to interdisciplinary exploration—such as in biomedicine [39] and materials science [41] —without depending on rigid taxonomies.
Hence, this work goes beyond conventional graph expansion by embedding recursive reasoning directly into the construction process, bridging the gap between passive knowledge extraction and active discovery. As we show in subsequent sections, this self-expanding paradigm yields scale-free knowledge graphs in which emergent hubs and bridge nodes enable continuous reorganization, allowing the system to evolve its understanding without exhaustive supervision and paving the way for scalable hypothesis generation and autonomous reasoning.
Hypothesis.
We hypothesize that recursive graph expansion enables self-organizing knowledge formation, allowing intelligence-like behavior to emerge without predefined ontologies, external supervision, or centralized control. Using a pre-trained model, Graph-PReFLexOR (an autonomous graph-reasoning model trained on a corpus of biological and biologically inspired materials principles) we demonstrate that knowledge graphs can continuously expand in a structured yet open-ended manner, forming scale-free networks with emergent conceptual hubs and interdisciplinary bridge nodes. Our findings suggest that intelligence-like reasoning can arise from recursive self-organization, challenging conventional paradigms and advancing possibilities for autonomous scientific discovery and scalable epistemic reasoning.
2 Results and Discussion
We present the results of experiments in which the graph-native reasoning model engages in a continuous, recursive process of graph-based reasoning, expanding its knowledge graph representation autonomously over 1,000 iterations. Unlike prior approaches that rely on a small number of just a few recursive reasoning steps, the experiments reported in this paper explore how knowledge formation unfolds in an open-ended manner, generating a dynamically evolving graph. As the system iterates, it formulates new tasks, refines reasoning pathways, and integrates emerging concepts, progressively structuring its own knowledge representation following the simple algorithmic paradigm delineated in Figure 1. The resulting graphs from all iterations form a final integrated knowledge graph, which we analyze for structural and conceptual insights. Figure 2 depicts the final state of the graph, referred to as graph $\mathcal{G}_{1}$ , after the full reasoning process.
The recursive graph reasoning process can be conducted in either an open-ended setting or develoepd into a more tailored manner to address a specific domain or flavor in which reasoning steps are carried out (details, see Materials and Methods). In the example explored here, we focus on designing impact-resistant materials. In this specialized scenario, we initiate the model with a concise, topic-specific prompt – e.g., Describe a way to design impact resistant materials, and maintain the iterative process of extracting structured knowledge from the model’s reasoning. We refer to the resulting graph as $\mathcal{G}_{2}$ . Despite the narrower focus, the same core principles apply: each new piece of information from the language model is parsed into nodes and edges, appended to a global graph, and informs the next iteration’s query. In this way, $\mathcal{G}_{2}$ captures a highly directed and domain-specific knowledge space while still exhibiting many of the emergent structural traits—such as hub formation, stable modularity, and growing connectivity—previously seen in the more general graph $\mathcal{G}_{1}$ . Figure 3 shows the final snapshot for $\mathcal{G}_{2}$ . To further examine the emergent structural organization of both graphs, Figures S1 and S2 display the same graphs with nodes and edges colored according to cluster identification, revealing the conceptual groupings that emerge during recursive knowledge expansion.
<details>
<summary>x2.png Details</summary>

### Visual Description
## Network Diagram: Interconnected Nodes
### Overview
The image is a network diagram showing interconnected nodes. The nodes are represented by circles, and the connections between them are represented by curved lines. The color of the nodes and lines varies from light orange to dark red, potentially indicating the strength or frequency of the connection.
### Components/Axes
* **Nodes:** Represented by circles of varying sizes and colors.
* **Connections:** Represented by curved lines of varying thickness and colors, linking the nodes.
* **Color Gradient:** The color of the nodes and lines ranges from light orange to dark red.
### Detailed Analysis
* **Node Distribution:** The nodes are clustered in several distinct regions, with some nodes acting as central hubs connecting multiple clusters.
* **Connection Density:** The density of connections varies across the diagram, with some regions having a high concentration of lines and others being more sparse.
* **Color Variation:** The color of the nodes and lines appears to correlate with the number of connections a node has. Darker red nodes seem to have more connections than lighter orange nodes.
* **Hub Nodes:** There are a few prominent nodes that are significantly larger and darker red than the others. These nodes act as central hubs, with many connections radiating from them. One such hub is located in the bottom-left quadrant of the image, and another is in the upper-center.
### Key Observations
* The network exhibits a clustered structure, with some nodes acting as central hubs.
* The color gradient suggests a correlation between the number of connections and the color intensity of the nodes and lines.
* The diagram lacks explicit labels or a legend, making it difficult to determine the exact meaning of the nodes, connections, and color gradient.
### Interpretation
The network diagram likely represents a complex system of relationships between different entities. The nodes could represent individuals, organizations, concepts, or any other type of entity, while the connections represent the relationships between them. The color gradient could indicate the strength, frequency, or importance of these relationships.
The clustered structure of the network suggests that the entities are grouped into distinct communities or categories. The central hubs represent influential entities that play a key role in connecting these communities.
Without additional information, it is difficult to determine the specific meaning of the diagram. However, the visual representation provides valuable insights into the structure and dynamics of the underlying system.
</details>
Figure 2: Knowledge graph $\mathcal{G_{1}}$ after around 1,000 iterations, under a flexible self-exploration scheme initiated with the prompt Discuss an interesting idea in bio-inspired materials science. We observe the formation of a highly connected graph with multiple hubs and centers.
<details>
<summary>x3.png Details</summary>

### Visual Description
## Network Diagram: Connectivity Map
### Overview
The image is a network diagram depicting connections between various nodes. The nodes are represented by circles, and the connections between them are represented by lines. The color intensity of both the nodes and the lines varies, suggesting a weighting or intensity of connection. The diagram appears to show three main clusters of nodes, with one cluster being significantly more central and densely connected than the other two.
### Components/Axes
* **Nodes:** Represented by circles, varying in size and color intensity (light orange to dark red).
* **Edges:** Represented by lines connecting the nodes, also varying in color intensity (light orange to dark red).
* **Clusters:** Three distinct clusters of interconnected nodes are visible.
### Detailed Analysis
* **Central Cluster:** Located in the center-left of the image. This cluster has the largest and darkest red node, indicating a high degree of connectivity or importance. Many lines converge on this central node, and the lines are generally darker red closer to this node.
* **Upper Cluster:** Located above and slightly to the right of the central cluster. This cluster is less dense than the central cluster, with lighter orange nodes and edges.
* **Lower Cluster:** Located in the bottom-right of the image. This cluster is the least dense, with the faintest orange nodes and edges.
* **Connectivity:** The lines connecting the nodes vary in thickness and color intensity. Thicker, darker lines suggest stronger or more frequent connections. The lines are curved, giving the diagram a more organic feel.
### Key Observations
* The central cluster is the most prominent and highly connected, suggesting it plays a critical role in the network.
* The upper and lower clusters are less densely connected and may represent peripheral or less important components of the network.
* The color gradient from light orange to dark red likely represents the strength or frequency of connections, with darker colors indicating stronger connections.
### Interpretation
The network diagram likely represents a system where different entities (nodes) are interconnected. The central cluster represents a hub or core component of the system, while the other clusters represent peripheral components. The connections between the nodes indicate relationships or interactions between the entities. The color intensity of the nodes and edges likely represents the strength or frequency of these relationships.
The diagram suggests that the central node is a critical point of failure, as its disruption would likely have a significant impact on the entire network. The other clusters may be more resilient to disruption, as they are less densely connected.
The diagram could represent a variety of systems, such as:
* A social network, where nodes represent individuals and edges represent relationships.
* A transportation network, where nodes represent cities and edges represent routes.
* A computer network, where nodes represent devices and edges represent connections.
* A biological network, where nodes represent genes or proteins and edges represent interactions.
</details>
Figure 3: Visualizatrion of the knowledge graph Graph 2 after around 500 iterations, under a topic-specific self-exploration scheme initiated with the prompt Describe a way to design impact resistant materials. The graph structure features a complex interwoven but highly connected network with multiple centers.
Table 1 shows a comparison of network properties for two graphs (graph $\mathcal{G_{1}}$ , see Figure 2 and graph $\mathcal{G_{2}}$ , see Figure 3), each computed at the end of their iterations. The scale-free nature of each graph is determined by fitting the degree distribution to a power-law model using the maximum likelihood estimation method. The analysis involves estimating the power-law exponent ( $\alpha$ ) and the lower bound ( $x_{\min}$ ), followed by a statistical comparison against an alternative exponential distribution. A log-likelihood ratio (LR) greater than zero and a $p$ -value below 0.05 indicate that the power-law distribution better explains the degree distribution than an exponential fit, suggesting that the network exhibits scale-free behavior. In both graphs, these criteria are met, supporting a scale-free classification. We observe that $\mathcal{G_{1}}$ has a power-law exponent of $\alpha=3.0055$ , whereas $\mathcal{G_{2}}$ has a lower $\alpha=2.6455$ , indicating that Graph 2 has a heavier-tailed degree distribution with a greater presence of high-degree nodes (hubs). The lower bound $x_{\min}$ is smaller in $\mathcal{G_{2}}$ ( $x_{\min}=10.0$ ) compared to $\mathcal{G_{1}}$ ( $x_{\min}=24.0$ ), suggesting that the power-law regime starts at a lower degree value, reinforcing its stronger scale-free characteristics.
Other structural properties provide additional insights into the connectivity and organization of these graphs. The average clustering coefficients (0.1363 and 0.1434) indicate moderate levels of local connectivity, with $\mathcal{G_{2}}$ exhibiting slightly higher clustering. The average shortest path lengths (5.1596 and 4.8984) and diameters (17 and 13) suggest that both graphs maintain small-world characteristics, where any node can be reached within a relatively short number of steps. The modularity values (0.6970 and 0.6932) indicate strong community structures in both graphs, implying the presence of well-defined clusters of interconnected nodes. These findings collectively suggest that both graphs exhibit small-world and scale-free properties, with $\mathcal{G_{2}}$ demonstrating a stronger tendency towards scale-free behavior due to its lower exponent and smaller $x_{\min}$ .
Beyond scale-free characteristics, we note that the two graphs exhibit differences in structural properties that influence their connectivity and community organization. We find that $\mathcal{G_{1}}$ , with 3,835 nodes and 11,910 edges, is much larger and more densely connected than $\mathcal{G_{2}}$ , which has 2,180 nodes and 6,290 edges. However, both graphs have similar average degrees (6.2112 and 5.7706), suggesting comparable overall connectivity per node. The number of self-loops is slightly higher in Graph 1 (70 vs. 33), though this does not significantly impact global structure. The clustering coefficients (0.1363 and 0.1434) indicate moderate levels of local connectivity, with Graph 2 exhibiting slightly more pronounced local clustering. The small-world nature of both graphs is evident from their average shortest path lengths (5.1596 and 4.8984) and diameters (17 and 13), implying efficient information flow. Modularity values (0.6970 and 0.6932) suggest both graphs have well-defined community structures, with Graph 1 showing marginally stronger modularity, possibly due to its larger size. Overall, while both graphs display small-world and scale-free properties, $\mathcal{G_{2}}$ appears to have a more cohesive structure with shorter paths and higher clustering, whereas $\mathcal{G_{1}}$ is larger with a slightly stronger community division.
| Number of nodes Number of edges Average degree | 3835 11910 6.2112 | 2180 6290 5.7706 |
| --- | --- | --- |
| Number of self-loops | 70 | 33 |
| Average clustering coefficient | 0.1363 | 0.1434 |
| Average shortest path length (LCC) | 5.1596 | 4.8984 |
| Diameter (LCC) | 17 | 13 |
| Modularity (Louvain) | 0.6970 | 0.6932 |
| Log-likelihood ratio (LR) | 15.6952 | 39.6937 |
| p-value | 0.0250 | 0.0118 |
| Power-law exponent ( $\alpha$ ) | 3.0055 | 2.6455 |
| Lower bound ( $x_{\min}$ ) | 24.0 | 10.0 |
| Scale-free classification | Yes | Yes |
Table 1: Comparison of network properties for two graphs (graph $\mathcal{G_{1}}$ , see Figure 2 and S1 and graph $\mathcal{G_{2}}$ , see Figure 3 and S2), each computed at the end of their iterations. Both graphs exhibit scale-free characteristics, as indicated by the statistically significant preference for a power-law degree distribution over an exponential fit (log-likelihood ratio $LR>0$ and $p<0.05$ ). The power-law exponent ( $\alpha$ ) for $\mathcal{G_{1}}$ is 3.0055, while $\mathcal{G_{2}}$ has a lower exponent of 2.6455, suggesting a heavier-tailed degree distribution. The clustering coefficients (0.1363 and 0.1434) indicate the presence of local connectivity, while the shortest path lengths (5.1596 and 4.8984) and diameters (17 and 13) suggest efficient global reachability. The high modularity values (0.6970 and 0.6932) indicate strong community structure in both graphs. Overall, both networks exhibit hallmark properties of scale-free networks, with $\mathcal{G_{2}}$ showing a more pronounced scale-free behavior due to its lower $\alpha$ and lower $x_{\min}$ .
2.1 Basic Analysis of Recursive Graph Growth
We now move on to a detailed analysis of the evolution of the graph as the reasoning process unfolds over thinking iterations. This sheds light into how the iterative process dynamically changes the nature of the graph. The analysis is largely focused on $\mathcal{G_{1}}$ , albeit a few key results are also included for $\mathcal{G_{2}}$ . Detailed methods about how the various quantities are computed are included in Materials and Methods.
Figure 4 illustrates the evolution of key structural properties of the recursively generated knowledge graph. The number of nodes and edges both exhibit linear growth with iterations, indicating that the reasoning process systematically expands the graph without saturation. The increase in edges is slightly steeper than that of nodes, suggesting that each new concept introduced is integrated into an increasingly dense network of relationships rather than remaining isolated. This continuous expansion supports the hypothesis that the model enables open-ended knowledge discovery through recursive self-organization.
The average degree of the graph steadily increases, stabilizing around six edges per node. This trend signifies that the knowledge graph maintains a balance between exploration and connectivity, ensuring that newly introduced concepts remain well-integrated within the broader structure. Simultaneously, the maximum degree follows a non-linear trajectory, demonstrating that certain nodes become significantly more connected over time. This emergent hub formation is characteristic of scale-free networks and aligns with patterns observed in human knowledge organization, where certain concepts act as central abstractions that facilitate higher-order reasoning.
The size of the largest connected component (LCC) grows proportionally with the total number of nodes, reinforcing the observation that the graph remains a unified, traversable structure rather than fragmenting into disconnected subgraphs. This property is crucial for recursive reasoning, as it ensures that the system retains coherence while expanding. The average clustering coefficient initially fluctuates but stabilizes around 0.16, indicating that while localized connections are formed, the graph does not devolve into tightly clustered sub-networks. Instead, it maintains a relatively open structure that enables adaptive reasoning pathways.
These findings highlight the self-organizing nature of the recursive reasoning process, wherein hierarchical knowledge formation emerges without the need for predefined ontologies or supervised corrections. The presence of conceptual hubs, increasing relational connectivity, and sustained network coherence suggest that the model autonomously structures knowledge in a manner that mirrors epistemic intelligence. This emergent organization enables the system to navigate complex knowledge spaces efficiently, reinforcing the premise that intelligence-like behavior can arise through recursive, feedback-driven information processing. Further analysis of degree distribution and centrality metrics would provide deeper insights into the exact nature of this evolving graph topology.
<details>
<summary>x4.png Details</summary>

### Visual Description
## Line Charts: Network Growth Metrics vs. Iteration
### Overview
The image presents six line charts arranged in a 2x3 grid, each displaying a different network metric plotted against the iteration number. The metrics include the number of nodes, number of edges, average degree, maximum degree, largest component size, and average clustering coefficient. All charts share the same x-axis: "Iteration", ranging from 0 to 1000.
### Components/Axes
* **Overall Layout:** Six line charts labeled (a) through (f), arranged in two rows and three columns.
* **X-Axis (Shared):** "Iteration", with tick marks at 0, 200, 400, 600, 800, and 1000.
* **Chart (a): #Nodes vs. Iteration**
* Y-Axis: "#Nodes", with tick marks at 0, 500, 1000, 1500, 2000, 2500, 3000, 3500, and 4000.
* Data Series: A blue line representing the number of nodes.
* **Chart (b): #Edges vs. Iteration**
* Y-Axis: "#Edges", with tick marks at 0, 2000, 4000, 6000, 8000, 10000, and 12000.
* Data Series: An orange line representing the number of edges.
* **Chart (c): Avg Degree vs. Iteration**
* Y-Axis: "Avg Degree", with tick marks at 4.0, 4.5, 5.0, 5.5, and 6.0.
* Data Series: A green line representing the average degree.
* **Chart (d): Max Degree vs. Iteration**
* Y-Axis: "Max Degree", with tick marks at 0, 50, 100, 150, 200, and 250.
* Data Series: A red line representing the maximum degree.
* **Chart (e): Largest Comp Size vs. Iteration**
* Y-Axis: "Nodes in LCC", with tick marks at 0, 500, 1000, 1500, and 2000.
* Data Series: A purple line representing the number of nodes in the largest connected component (LCC).
* **Chart (f): Avg Clustering vs. Iteration**
* Y-Axis: "Clustering", with tick marks at 0.12, 0.14, 0.16, 0.18, 0.20, 0.22, 0.24, 0.26, and 0.28.
* Data Series: A teal line representing the average clustering coefficient.
### Detailed Analysis
* **#Nodes vs. Iteration (Chart a):** The blue line shows a generally increasing trend. Starting from approximately 0 nodes at iteration 0, the number of nodes increases to approximately 3800 at iteration 1000. The increase is not perfectly linear, showing some fluctuations.
* **#Edges vs. Iteration (Chart b):** The orange line shows a nearly linear increasing trend. Starting from approximately 0 edges at iteration 0, the number of edges increases to approximately 11800 at iteration 1000.
* **Avg Degree vs. Iteration (Chart c):** The green line shows an increasing trend that plateaus. Starting from approximately 4.2 at iteration 0, the average degree increases rapidly until around iteration 200, then continues to increase at a slower rate, reaching approximately 6.2 at iteration 1000.
* **Max Degree vs. Iteration (Chart d):** The red line shows a stepwise increasing trend. Starting from approximately 10 at iteration 0, the maximum degree increases in discrete jumps, reaching approximately 250 at iteration 1000.
* **Largest Comp Size vs. Iteration (Chart e):** The purple line shows an increasing trend. Starting from approximately 0 at iteration 0, the size of the largest connected component increases to approximately 2100 at iteration 1000.
* **Avg Clustering vs. Iteration (Chart f):** The teal line shows a decreasing trend followed by stabilization. Starting from approximately 0.28 at iteration 0, the average clustering coefficient decreases rapidly until around iteration 100, then fluctuates around a value of approximately 0.17 for the remaining iterations.
### Key Observations
* The number of nodes and edges generally increase with iteration.
* The average degree increases and then plateaus, suggesting a limit to how connected nodes become.
* The maximum degree increases in steps, indicating the addition of highly connected nodes at certain iterations.
* The largest component size increases, showing the network becoming more connected.
* The average clustering coefficient decreases initially and then stabilizes, suggesting that while the network grows, the local clustering of nodes does not continue to increase.
### Interpretation
The charts illustrate the evolution of a network as it grows over iterations. The increasing number of nodes and edges indicates network expansion. The average degree reaching a plateau suggests that nodes do not become arbitrarily connected; there might be constraints or preferential attachment mechanisms at play. The stepwise increase in maximum degree could indicate the addition of "hub" nodes at specific points in the network's evolution. The initial decrease and subsequent stabilization of the average clustering coefficient suggest that while the network becomes larger and more connected, the tendency for nodes to form tightly knit clusters does not continue to increase indefinitely. This could be due to the network expanding into new regions or the emergence of a core-periphery structure.
</details>
Figure 4: Evolution of basic graph properties over recursive iterations, highlighting the emergence of hierarchical structure, hub formation, and adaptive connectivity, for $\mathcal{G_{1}}$ .
Figure S5 illustrates the same analysis of the evolution of key structural properties of the recursively generated knowledge graph for graph $\mathcal{G_{2}}$ , as a comparison.
Structural Evolution of the Recursive Knowledge Graph
Figure 5 presents the evolution of three key structural properties, including Louvain modularity, average shortest path length, and graph diameter, over iterations. These metrics provide deeper insights into the self-organizing behavior of the graph as it expands through iterative reasoning. The Louvain modularity, depicted in Figure 5 (a), measures the strength of community structure within the graph. Initially, modularity increases sharply, reaching a peak around 0.75 within the first few iterations. This indicates that the early phases of reasoning lead to the rapid formation of well-defined conceptual clusters. As the graph expands, modularity stabilizes at approximately 0.70, suggesting that the system maintains distinct knowledge domains while allowing new interconnections to form. This behavior implies that the model preserves structural coherence, ensuring that the recursive expansion does not collapse existing conceptual groupings.
The evolution of the average shortest path length (SPL), shown in Figure 5 (b), provides further evidence of structured self-organization. Initially, the SPL increases sharply before stabilizing around 4.5–5.0. The initial rise reflects the introduction of new nodes that temporarily extend shortest paths before they are effectively integrated into the existing structure. The subsequent stabilization suggests that the recursive process maintains an efficient knowledge representation, ensuring that information remains accessible despite continuous expansion. This property is crucial for reasoning, as it implies that the system does not suffer from runaway growth in path lengths, preserving navigability.
The graph diameter, illustrated in Figure 5 (c), exhibits a stepwise increase, eventually stabilizing around 16–18. The staircase-like behavior suggests that the recursive expansion occurs in structured phases, where certain iterations introduce concepts that temporarily extend the longest shortest path before subsequent refinements integrate them more effectively. This bounded expansion indicates that the system autonomously regulates its hierarchical growth, maintaining a balance between depth and connectivity.
These findings reveal several emergent properties of the recursive reasoning model. The stabilization of modularity demonstrates the ability to autonomously maintain structured conceptual groupings, resembling human-like hierarchical knowledge formation. The controlled growth of the shortest path length highlights the system’s capacity for efficient information propagation, preventing fragmentation. We note that the bounded expansion of graph diameter suggests that reasoning-driven recursive self-organization is capable of structuring knowledge in a way that mirrors epistemic intelligence, reinforcing the hypothesis that certain forms of intelligent-like behavior can emerge without predefined ontologies.
<details>
<summary>x5.png Details</summary>

### Visual Description
## Chart: Louvain Modularity, Avg Shortest Path, and Diameter vs. Iteration
### Overview
The image presents three line charts comparing the Louvain Modularity, Average Shortest Path (Avg SPL), and Diameter against the number of iterations. Each chart displays how these metrics change over 1000 iterations.
### Components/Axes
* **Chart (a): Louvain Modularity vs. Iteration**
* X-axis: Iteration, ranging from 0 to 1000.
* Y-axis: Modularity, ranging from 0.45 to 0.80.
* Data Series: Magenta line representing Louvain Modularity.
* **Chart (b): Avg Shortest Path vs. Iteration**
* X-axis: Iteration, ranging from 0 to 1000.
* Y-axis: Avg SPL, ranging from 2 to 6.
* Data Series: Blue line representing Average Shortest Path.
* **Chart (c): Diameter vs. Iteration**
* X-axis: Iteration, ranging from 0 to 1000.
* Y-axis: Diameter, ranging from 4 to 18.
* Data Series: Red line representing Diameter.
### Detailed Analysis
* **Chart (a): Louvain Modularity vs. Iteration**
* Trend: The magenta line starts at approximately 0.45, rapidly increases to around 0.77 within the first 100 iterations, and then gradually decreases and stabilizes around 0.71 for the remaining iterations.
* Data Points:
* Iteration 0: Modularity ~0.45
* Iteration 100: Modularity ~0.77
* Iteration 1000: Modularity ~0.71
* **Chart (b): Avg Shortest Path vs. Iteration**
* Trend: The blue line starts at approximately 2, rapidly increases to around 6.2 within the first 100 iterations, and then gradually decreases and stabilizes around 5.2 for the remaining iterations.
* Data Points:
* Iteration 0: Avg SPL ~2
* Iteration 100: Avg SPL ~6.2
* Iteration 1000: Avg SPL ~5.2
* **Chart (c): Diameter vs. Iteration**
* Trend: The red line starts at approximately 4, increases stepwise to around 17 by iteration 400, and then remains relatively stable around 17 for the remaining iterations.
* Data Points:
* Iteration 0: Diameter ~4
* Iteration 400: Diameter ~17
* Iteration 1000: Diameter ~17
### Key Observations
* Louvain Modularity initially increases sharply and then stabilizes with a slight decrease.
* Average Shortest Path also increases sharply initially and then stabilizes with a slight decrease.
* Diameter increases in discrete steps and then stabilizes.
### Interpretation
The charts illustrate the convergence behavior of a network analysis algorithm over iterations. The Louvain Modularity and Average Shortest Path metrics show an initial rapid change, indicating the algorithm is quickly optimizing the network structure. The subsequent stabilization suggests the algorithm has reached a point of diminishing returns. The Diameter, representing the longest shortest path between any two nodes in the network, increases in steps, likely reflecting significant structural changes at specific iterations before plateauing. The data suggests that the algorithm finds a relatively stable network configuration after a certain number of iterations, with only minor adjustments occurring thereafter.
</details>
Figure 5: Evolution of key structural properties in the recursively generated knowledge graph $\mathcal{G_{1}}$ : (a) Louvain modularity, showing stable community formation; (b) average shortest path length, highlighting efficient information propagation; and (c) graph diameter, demonstrating bounded hierarchical expansion.
For comparison, Figure S4 presents the evolution of three key structural properties—Louvain modularity, average shortest path length, and graph diameter—over recursive iterations for graph $\mathcal{G_{2}}$ .
2.2 Analysis of Advanced Graph Evolution Metrics
Figure 6 presents the evolution of six advanced structural metrics over recursive iterations, capturing higher-order properties of the self-expanding knowledge graph. These measures provide insights into network organization, resilience, and connectivity patterns emerging during recursive reasoning.
Degree assortativity coefficient is a measure of the tendency of nodes to connect to others with similar degrees. A negative value indicates disassortativity (high-degree nodes connect to low-degree nodes), while a positive value suggests assortativity (nodes prefer connections to similarly connected nodes). The degree assortativity coefficient (Figure 6 (a)) begins with a strongly negative value near $-0.25$ , indicating a disassortative structure where high-degree nodes preferentially connect to low-degree nodes. Over time, assortativity increases and stabilizes around $-0.05$ , suggesting a gradual shift toward a more balanced connectivity structure without fully transitioning to an assortative regime. This trend is consistent with the emergence of hub-like structures, characteristic of scale-free networks, where a few nodes accumulate a disproportionately high number of connections.
The global transitivity (Figure 6 (b)), measuring the fraction of closed triplets in the network, exhibits an initial peak near 0.35 before rapidly declining and stabilizing towards 0.10, albeit still decreasing. This suggests that early in the recursive reasoning process, the graph forms tightly clustered regions, likely due to localized conceptual groupings. As iterations progress, interconnections between distant parts of the graph increase, reducing local clustering and favoring long-range connectivity, a hallmark of expanding knowledge networks.
The $k$ -core Index defines the largest integer $k$ for which a subgraph exists where all nodes have at least $k$ connections. A higher maximum $k$ -core index suggests a more densely interconnected core. The maximum $k$ -core index (Figure 6 (c)), representing the deepest level of connectivity, increases in discrete steps, reaching a maximum value of 11. This indicates that as the graph expands, an increasingly dense core emerges, reinforcing the formation of highly interconnected substructures. The stepwise progression suggests that specific iterations introduce structural reorganizations that significantly enhance connectivity rather than continuous incremental growth.
We observe that the size of the largest $k$ -core (Figure 6 (d)) follows a similar pattern, growing in discrete steps and experiencing a sudden drop around iteration 700 before stabilizing again. This behavior suggests that the graph undergoes structural realignments, possibly due to the introduction of new reasoning pathways that temporarily reduce the dominance of the most connected core before further stabilization.
Betweenness Centrality is a measure of how often a node appears on the shortest paths between other nodes. High betweenness suggests a critical role in information flow, while a decrease indicates decentralization and redundancy in pathways. The average betweenness centrality (Figure 6 (e)) initially exhibits high values, indicating that early reasoning iterations rely heavily on specific nodes to mediate information flow. Over time, betweenness declines and stabilizes a bit below 0.01, suggesting that the graph becomes more navigable and distributed, reducing reliance on key bottleneck nodes over more iterations. This trend aligns with the emergence of redundant reasoning pathways, making the system more robust to localized disruptions.
Articulation points are nodes whose removal would increase the number of disconnected components in the graph, meaning they serve as key bridges between different knowledge clusters. The number of articulation points (Figure 6 (f)) steadily increases throughout iterations, reaching over 800. This suggests that as the knowledge graph expands, an increasing number of bridging nodes emerge, reflecting a hierarchical structure where key nodes maintain connectivity between distinct regions. Despite this increase, the network remains well connected, indicating that redundant pathways mitigate the risk of fragmentation.
A network where the degree distribution follows a power-law, meaning most nodes have few connections, but a small number (hubs) have many (supporting the notion of a scale-free network). Our findings provide evidence that the recursive graph reasoning process spontaneously organizes into a hierarchical, scale-free structure, balancing local clustering, global connectivity, and efficient navigability. The noted trends in assortativity, core connectivity, and betweenness centrality confirm that the system optimally structures its knowledge representation over iterations, reinforcing the hypothesis that self-organized reasoning processes naturally form efficient and resilient knowledge networks.
<details>
<summary>x6.png Details</summary>

### Visual Description
## Multiple Line Charts: Network Analysis Metrics vs. Iteration
### Overview
The image presents six line charts arranged in a 2x3 grid, each displaying a different network analysis metric's evolution over iterations. The metrics include Degree Assortativity, Global Transitivity, Max k-Core Index, Size of Largest k-Core, Average Betweenness, and the Number of Articulation Points. All charts share a common x-axis representing "Iteration," ranging from 0 to 1000.
### Components/Axes
* **General:**
* X-axis (all charts): "Iteration," ranging from 0 to 1000.
* **(a) Degree Assortativity vs. Iteration:**
* Y-axis: "Assortativity," ranging from -0.25 to 0.00.
* Data Series: Blue line representing Degree Assortativity.
* **(b) Global Transitivity vs. Iteration:**
* Y-axis: "Transitivity," ranging from 0.10 to 0.35.
* Data Series: Green line representing Global Transitivity.
* **(c) Max k-Core Index vs. Iteration:**
* Y-axis: "Max k," ranging from 5 to 11.
* Data Series: Red line representing Max k-Core Index.
* **(d) Size of Largest k-Core vs. Iteration:**
* Y-axis: "#Nodes in max k-core," ranging from 0 to 70.
* Data Series: Purple line representing the Size of Largest k-Core.
* **(e) Avg Betweenness vs. Iteration:**
* Y-axis: "Betweenness," ranging from 0.00 to 0.14.
* Data Series: Orange line representing Average Betweenness.
* **(f) #Articulation Points vs. Iteration:**
* Y-axis: "Count," ranging from 0 to 800.
* Data Series: Teal line representing the Number of Articulation Points.
### Detailed Analysis
* **(a) Degree Assortativity vs. Iteration:**
* Trend: The blue line starts at approximately -0.25 at iteration 0, rapidly increases to around -0.05 by iteration 100, and then gradually increases to stabilize around -0.04 for iterations 400-1000.
* Data Points:
* Iteration 0: Assortativity ≈ -0.25
* Iteration 100: Assortativity ≈ -0.05
* Iteration 400-1000: Assortativity ≈ -0.04
* **(b) Global Transitivity vs. Iteration:**
* Trend: The green line starts at approximately 0.37 at iteration 0, rapidly decreases to around 0.15 by iteration 200, and then gradually decreases to stabilize around 0.09 for iterations 600-1000.
* Data Points:
* Iteration 0: Transitivity ≈ 0.37
* Iteration 200: Transitivity ≈ 0.15
* Iteration 600-1000: Transitivity ≈ 0.09
* **(c) Max k-Core Index vs. Iteration:**
* Trend: The red line shows a step-wise increase. It starts at 5, jumps to 7 around iteration 50, then to 8 around iteration 150, then to 9 around iteration 250, then to 10 around iteration 350, and finally to 11 around iteration 750, where it remains stable.
* Data Points:
* Iteration 0-50: Max k = 5
* Iteration 50-150: Max k = 7
* Iteration 150-250: Max k = 8
* Iteration 250-350: Max k = 9
* Iteration 350-750: Max k = 10
* Iteration 750-1000: Max k = 11
* **(d) Size of Largest k-Core vs. Iteration:**
* Trend: The purple line shows a step-wise increase, similar to the Max k-Core Index. It starts around 5, jumps to 10 around iteration 50, then to 15 around iteration 100, then to 20 around iteration 150, then to 30 around iteration 200, then to 43 around iteration 350, then to 45 around iteration 400, then to 50 around iteration 550, then to 72 around iteration 650, and finally drops to 30 around iteration 750, where it remains stable.
* Data Points:
* Iteration 0-50: #Nodes ≈ 5
* Iteration 50-100: #Nodes ≈ 10
* Iteration 100-150: #Nodes ≈ 15
* Iteration 150-200: #Nodes ≈ 20
* Iteration 200-350: #Nodes ≈ 30
* Iteration 350-400: #Nodes ≈ 43
* Iteration 400-550: #Nodes ≈ 45
* Iteration 550-650: #Nodes ≈ 50
* Iteration 650-750: #Nodes ≈ 72
* Iteration 750-1000: #Nodes ≈ 30
* **(e) Avg Betweenness vs. Iteration:**
* Trend: The orange line starts at approximately 0.14 at iteration 0, rapidly decreases to around 0.01 by iteration 200, and then gradually decreases to stabilize around 0.003 for iterations 600-1000.
* Data Points:
* Iteration 0: Betweenness ≈ 0.14
* Iteration 200: Betweenness ≈ 0.01
* Iteration 600-1000: Betweenness ≈ 0.003
* **(f) #Articulation Points vs. Iteration:**
* Trend: The teal line shows a nearly linear increase from approximately 0 at iteration 0 to approximately 850 at iteration 1000.
* Data Points:
* Iteration 0: Count ≈ 0
* Iteration 1000: Count ≈ 850
### Key Observations
* **Convergence:** Several metrics (Degree Assortativity, Global Transitivity, Avg Betweenness) show a trend of converging to a stable value after an initial period of rapid change.
* **Step-wise Changes:** The Max k-Core Index and Size of Largest k-Core exhibit step-wise increases, suggesting discrete phases of network evolution.
* **Linear Growth:** The Number of Articulation Points increases almost linearly with the number of iterations.
### Interpretation
The charts illustrate how various network properties evolve over iterations, likely during a network growth or optimization process. The convergence of metrics like Assortativity, Transitivity, and Betweenness suggests that the network reaches a relatively stable state after a certain number of iterations. The step-wise changes in k-Core related metrics indicate that the network's core structure is evolving in discrete stages. The linear increase in articulation points suggests a continuous addition of critical nodes that connect different parts of the network. These observations provide insights into the dynamics and structural changes occurring within the network as it evolves.
</details>
Figure 6: Evolution of advanced structural properties in the recursively generated knowledge graph $\mathcal{G_{1}}$ : (a) degree assortativity, (b) global transitivity, (c) maximum k-core index, (d) size of the largest k-core, (e) average betweenness centrality, and (f) number of articulation points. These metrics reveal the emergence of hierarchical organization, hub formation, and increased navigability over recursive iterations.
2.3 Evolution of Newly Connected Pairs
Figure 7 presents the evolution of newly connected node pairs as a function of iteration, illustrating how the recursive reasoning process expands the knowledge graph over time. This metric captures the rate at which new relationships are established between nodes, providing insights into the self-organizing nature of the network.
In the early iterations (0–100), the number of newly connected pairs exhibits high variance, fluctuating between 0 and 400 connections per iteration. This suggests that the initial phase of recursive reasoning leads to significant structural reorganization, where large bursts of new edges are formed as the network establishes its fundamental connectivity patterns. The high variability in this region indicates an exploratory phase, where the graph undergoes rapid adjustments to define its core structure.
Beyond approximately 200 iterations, the number of newly connected pairs stabilizes around 500–600 per iteration, with only minor fluctuations. This plateau suggests that the knowledge graph has transitioned into a steady-state expansion phase, where new nodes and edges are integrated into an increasingly structured and predictable manner. Unlike random growth, this behavior indicates that the system follows a self-organized expansion process, reinforcing existing structures rather than disrupting them.
The stabilization at a high connection rate suggests the emergence of hierarchical organization, where newly introduced nodes preferentially attach to well-established structures. This pattern aligns with the scale-free properties observed in other experimentally acquired knowledge networks, where central concepts continuously accumulate new links, strengthening core reasoning pathways. The overall trend highlights how recursive self-organization leads to sustained, structured knowledge expansion, rather than arbitrary or saturation-driven growth.
<details>
<summary>x7.png Details</summary>

### Visual Description
## Line Chart: Newly Connected Pairs (sampled) vs. Iteration
### Overview
The image is a line chart showing the relationship between the number of newly connected pairs (sampled) and the iteration number. The chart displays a single data series, represented by a blue line, which initially increases rapidly and then plateaus with some fluctuations.
### Components/Axes
* **Title:** Newly Connected Pairs (sampled) vs. Iteration
* **X-axis:** Iteration
* Scale: 0 to 1000, with tick marks at 0, 200, 400, 600, 800, and 1000.
* **Y-axis:** Count of newly connected pairs
* Scale: 0 to 600, with tick marks at 0, 100, 200, 300, 400, 500, and 600.
* **Data Series:**
* Newly Connected Pairs (sampled) - represented by a blue line.
### Detailed Analysis
The blue line represents the count of newly connected pairs at each iteration.
* **Initial Phase (Iteration 0-200):** The line starts at approximately 0 at iteration 0. It rapidly increases to approximately 470 around iteration 50, then fluctuates between 450 and 550 until iteration 200.
* **Plateau Phase (Iteration 200-1000):** From iteration 200 onwards, the line generally plateaus, fluctuating between approximately 470 and 600. There are some dips and peaks, but the overall trend is relatively stable. The line ends at approximately 580 at iteration 1000.
Here are some approximate data points:
* Iteration 0: ~0
* Iteration 25: ~250
* Iteration 50: ~470
* Iteration 100: ~320
* Iteration 200: ~520
* Iteration 400: ~550
* Iteration 600: ~530
* Iteration 800: ~550
* Iteration 1000: ~580
### Key Observations
* The number of newly connected pairs increases rapidly in the initial iterations.
* The number of newly connected pairs plateaus after approximately 200 iterations.
* There are fluctuations in the number of newly connected pairs even after the plateau phase.
### Interpretation
The chart suggests that the process of forming new connections is most active in the early iterations. After a certain point (around 200 iterations), the rate of forming new connections slows down significantly, and the system reaches a state of relative equilibrium. The fluctuations observed after the plateau phase could be due to random variations or other factors influencing the connection process. The data demonstrates a diminishing return in new connections as the number of iterations increases.
</details>
Figure 7: Evolution of newly connected node pairs over recursive iterations, $\mathcal{G_{1}}$ . Early iterations exhibit high variability, reflecting an exploratory phase of rapid structural reorganization. Beyond 200 iterations, the process stabilizes, suggesting a steady-state expansion phase with sustained connectivity formation.
The observed transition from high-variance, exploratory graph expansion to a stable, structured growth phase suggests that recursive self-organization follows a process similar to human cognitive learning and scientific discovery. We believe that this indicates that in early iterations, the system explores diverse reasoning pathways, mirroring how scientific fields establish foundational concepts through broad exploration before refining them into structured disciplines [1]. The stabilization of connectivity beyond 200 iterations reflects preferential attachment dynamics, a hallmark of scale-free networks where highly connected nodes continue to accumulate new links, much like citation networks in academia [42]. This mechanism ensures that core concepts serve as attractors for further knowledge integration, reinforcing structured reasoning while maintaining adaptability. Importantly, the system does not exhibit saturation or stagnation, suggesting that open-ended knowledge discovery is possible through recursive reasoning alone, without requiring predefined ontologies or externally imposed constraints. This aligns with findings in AI-driven scientific hypothesis generation, where graph-based models dynamically infer new connections by iterating over expanding knowledge structures [39, 41]. The ability of the system to self-organize, expand, and refine its knowledge base autonomously underscores its potential as a scalable framework for automated scientific discovery and epistemic reasoning.
2.4 Analysis of Node Centrality Distributions at Final Stage of Reasoning
Next, Figure 8 presents histograms for three key centrality measures—betweenness centrality, closeness centrality, and eigenvector centrality—computed for the recursively generated knowledge graph, at the final iteration. These metrics provide insights into the role of different nodes in maintaining connectivity, network efficiency, and global influence.
Figure 8 (a) shows the distribution of betweenness centrality. We find the distribution of betweenness centrality to be highly skewed, with the majority of nodes exhibiting values close to zero. Only a small fraction of nodes attain significantly higher centrality values, indicating that very few nodes serve as critical intermediaries for shortest paths. This pattern is characteristic of hierarchical or scale-free networks, where a small number of hub nodes facilitate global connectivity, while most nodes remain peripheral. The presence of a few high-betweenness outliers suggests that key nodes emerge as crucial mediators of information flow, reinforcing the hypothesis that self-organizing structures lead to the formation of highly connected bridging nodes.
Figure 8 (b) depicts the closeness centrality distribution. It follows an approximately normal distribution centered around 0.20, suggesting that most nodes remain well-connected within the network. This result implies that the network maintains a compact structure, allowing for efficient navigation between nodes despite continuous expansion. The relatively low spread indicates that the recursive reasoning process prevents excessive distance growth, ensuring that newly introduced nodes do not become isolated. This reinforces the observation that the graph remains navigable as it evolves, an essential property for maintaining coherent reasoning pathways.
Next, Figure 8 (c) shows the eigenvector centrality distribution, identified to be also highly skewed, with most nodes having values close to zero. However, a few nodes attain substantially higher eigenvector centrality scores, indicating that only a select few nodes dominate the network in terms of global influence. This suggests that the network naturally organizes into a hierarchical structure, where dominant hubs accumulate influence over time, while the majority of nodes play a more peripheral role. The emergence of high-eigenvector hubs aligns with scale-free network behavior, further supporting the idea that reasoning-driven recursive self-organization leads to structured knowledge representation.
These findings indicate that the recursive knowledge graph balances global connectivity and local modularity, self-organizing into a structured yet efficient system. The few high-betweenness nodes act as key mediators, while the closeness centrality distribution suggests that the network remains efficiently connected. The eigenvector centrality pattern highlights the formation of dominant conceptual hubs, reinforcing the presence of hierarchical knowledge organization within the evolving reasoning framework.
<details>
<summary>x8.png Details</summary>

### Visual Description
## Histogram: Centrality Measures Distribution
### Overview
The image presents three histograms, each displaying the distribution of a different centrality measure: Betweenness Centrality, Closeness Centrality, and Eigenvector Centrality. The histograms are arranged horizontally, labeled (a), (b), and (c) respectively. Each histogram plots the count of nodes (y-axis) against the centrality value (x-axis).
### Components/Axes
* **Titles:**
* (a) Betweenness Centrality
* (b) Closeness Centrality
* (c) Eigenvector Centrality
* **X-axis (Value):**
* All three histograms share the same x-axis scale, ranging from 0.00 to 0.30, with tick marks at intervals of 0.05.
* **Y-axis (Count):**
* Histogram (a) ranges from 0 to 2000, with tick marks at intervals of 250.
* Histogram (b) ranges from 0 to 200, with tick marks at intervals of 25.
* Histogram (c) ranges from 0 to 1400, with tick marks at intervals of 200.
* **Colors:**
* Betweenness Centrality (a) is represented in blue.
* Closeness Centrality (b) is represented in red.
* Eigenvector Centrality (c) is represented in green.
### Detailed Analysis
* **Betweenness Centrality (a):**
* The distribution is heavily skewed to the left.
* A large number of nodes have a betweenness centrality value close to 0.
* The count at value 0.00 is approximately 2050.
* The count at value 0.05 is approximately 100.
* The count at value 0.10 is approximately 25.
* The count at value 0.15 is approximately 10.
* The count at value 0.20 is approximately 5.
* The count at value 0.25 is approximately 2.
* **Closeness Centrality (b):**
* The distribution is approximately normal, centered around 0.20.
* The count at value 0.10 is approximately 5.
* The count at value 0.15 is approximately 75.
* The count at value 0.20 is approximately 200.
* The count at value 0.25 is approximately 50.
* The count at value 0.30 is approximately 5.
* **Eigenvector Centrality (c):**
* The distribution is heavily skewed to the left, similar to Betweenness Centrality.
* A large number of nodes have an eigenvector centrality value close to 0.
* The count at value 0.00 is approximately 1450.
* The count at value 0.05 is approximately 350.
* The count at value 0.10 is approximately 50.
* The count at value 0.15 is approximately 10.
* The count at value 0.20 is approximately 5.
* The count at value 0.25 is approximately 2.
* The count at value 0.30 is approximately 1.
### Key Observations
* Betweenness and Eigenvector Centrality distributions are highly skewed, indicating that most nodes have very low centrality values for these measures.
* Closeness Centrality exhibits a more normal distribution, suggesting a more even distribution of closeness centrality values among the nodes.
* The scales of the y-axes differ significantly, reflecting the different ranges of counts for each centrality measure.
### Interpretation
The histograms illustrate the distribution of three different centrality measures within a network. The skewness observed in the Betweenness and Eigenvector Centrality distributions suggests that only a small fraction of nodes have high influence or control over the network's communication pathways (Betweenness) or are highly connected to other influential nodes (Eigenvector). In contrast, the more normal distribution of Closeness Centrality indicates that nodes are, on average, relatively close to all other nodes in the network.
These distributions can provide insights into the network's structure and dynamics. For example, a network with a highly skewed Betweenness Centrality distribution might be vulnerable to disruptions if the few nodes with high betweenness are removed. Similarly, a network with a more evenly distributed Closeness Centrality might be more resilient to such disruptions.
</details>
Figure 8: Distribution of node centrality measures in the recursively generated knowledge graph, for $\mathcal{G_{1}}$ : (a) Betweenness centrality, showing that only a few nodes serve as major intermediaries; (b) Closeness centrality, indicating that the majority of nodes remain well-connected; (c) Eigenvector centrality, revealing the emergence of dominant hub nodes. These distributions highlight the hierarchical and scale-free nature of the evolving knowledge graph.
Figure 9 presents the distribution of sampled shortest path lengths. This distribution provides insights into the overall compactness, navigability, and structural efficiency of the network.
The histogram reveals that the most frequent shortest path length is centered around 5–6 steps, indicating that the majority of node pairs are relatively close in the network. The distribution follows a bell-shaped pattern, suggesting a typical range of distances between nodes, with a slight right skew where some paths extend beyond 10 steps. The presence of longer paths implies that certain nodes remain in the periphery or are indirectly connected to the core reasoning structure.
The relatively narrow range of shortest path lengths affirms that the network remains well-integrated, ensuring efficient knowledge propagation and retrieval. The absence of extreme outliers suggests that the recursive expansion process does not lead to fragmented or sparsely connected regions. This structure contrasts with purely random graphs, where shortest path lengths typically exhibit a narrower peak at lower values. The broader peak observed here suggests that the model does not generate arbitrary connections but instead organizes knowledge in a structured manner, balancing global integration with local modularity.
The observed path length distribution supports the hypothesis that recursive graph reasoning constructs an efficiently connected knowledge framework, where most concepts can be accessed within a small number of steps. The presence of some longer paths further suggests that the network exhibits hierarchical expansion, with certain areas developing as specialized subdomains that extend outward from the core structure.
<details>
<summary>x9.png Details</summary>

### Visual Description
## Histogram: Distribution of Shortest Path Lengths (sampled)
### Overview
The image contains two histograms, labeled (a) and (b), displaying the distribution of shortest path lengths. Both histograms show the frequency of different shortest path lengths, with the x-axis representing the shortest path length and the y-axis representing the frequency. The bars are colored purple.
### Components/Axes
* **Title (a):** Distribution of Shortest Path Lengths (sampled)
* **Title (b):** Distribution of Shortest Path Lengths (sampled)
* **X-axis Label:** Shortest Path Length
* **X-axis Scale:** 2, 4, 6, 8, 10, 12 (for chart a)
* **X-axis Scale:** 2, 4, 6, 8, 10 (for chart b)
* **Y-axis Label:** Frequency
* **Y-axis Scale:** 0, 100, 200, 300, 400, 500
### Detailed Analysis
**Histogram (a):**
* **Shortest Path Length 2:** Frequency approximately 50
* **Shortest Path Length 3:** Frequency approximately 200
* **Shortest Path Length 4:** Frequency approximately 420
* **Shortest Path Length 5:** Frequency approximately 540
* **Shortest Path Length 6:** Frequency approximately 450
* **Shortest Path Length 7:** Frequency approximately 250
* **Shortest Path Length 8:** Frequency approximately 100
* **Shortest Path Length 9:** Frequency approximately 50
* **Shortest Path Length 10:** Frequency approximately 20
* **Shortest Path Length 11:** Frequency approximately 10
* **Shortest Path Length 12:** Frequency approximately 5
**Histogram (b):**
* **Shortest Path Length 2:** Frequency approximately 20
* **Shortest Path Length 3:** Frequency approximately 250
* **Shortest Path Length 4:** Frequency approximately 420
* **Shortest Path Length 5:** Frequency approximately 490
* **Shortest Path Length 6:** Frequency approximately 520
* **Shortest Path Length 7:** Frequency approximately 350
* **Shortest Path Length 8:** Frequency approximately 70
* **Shortest Path Length 9:** Frequency approximately 60
* **Shortest Path Length 10:** Frequency approximately 10
### Key Observations
* Both histograms show a similar distribution pattern, with the frequency increasing to a peak around shortest path lengths of 5 or 6, and then decreasing as the shortest path length increases.
* Histogram (a) has a longer tail, extending to a shortest path length of 12, while histogram (b) only extends to 10.
* The peak frequency is slightly higher in histogram (b) compared to histogram (a).
### Interpretation
The histograms illustrate the distribution of shortest path lengths in a sampled network or graph. The data suggests that the most common shortest path length is around 5 or 6. The shape of the distribution indicates that shorter paths are more frequent than longer paths. The difference between the two histograms (a) and (b) might be due to different sampling methods or different network structures. The longer tail in histogram (a) suggests that there are some longer shortest paths present in the sample represented by (a) that are less common or absent in the sample represented by (b).
</details>
Figure 9: Distribution of sampled shortest path lengths in the recursively generated knowledge graphs (panel (a), for graph $\mathcal{G_{2}}$ , panel (b), graph $\mathcal{G_{2}}$ ). The peak around 5–6 steps suggests that the network remains compact and navigable, while the slight right skew especially in panel (a) indicates the presence of peripheral nodes or specialized subdomains.
2.5 Knowledge Graph Evolution and Conceptual Breakthroughs
The evolution of the knowledge graph over iterative expansions discussed so far reveals distinct patterns in knowledge accumulation, conceptual breakthroughs, and interdisciplinary integration. To analyze these processes, we now examine (i) the growth trajectories of major conceptual hubs, (ii) the emergence of new highly connected nodes, and (iii) overall network connectivity trends across iterations. The results of these analyses are presented in Figure 11, which consists of three sub-components.
<details>
<summary>x10.png Details</summary>

### Visual Description
## Chart Analysis: Network Growth and Connectivity Over Iterations
### Overview
The image presents three charts analyzing the growth and connectivity of a network over iterations. Chart (a) shows the growth of the absolute degree of top hubs over iterations for various nodes. Chart (b) displays the emergence of new hubs over iterations. Chart (c) illustrates the overall network connectivity over time, measured by the average node degree.
### Components/Axes
**Chart (a): Growth of Top Hubs Over Iterations**
* **Title:** Growth of Top Hubs Over Iterations
* **X-axis:** Iteration (0 to 1000)
* **Y-axis:** Absolute Degree (0 to 200)
* **Legend (Top-Right):**
* Node Artificial Intelligence (AI) - Blue
* Node Knowledge Graph - Orange
* Node Urban Ecosystems - Green
* Node Bioluminescent Technology - Red
* Node Learning Outcomes - Purple
* Node Climate Change - Brown
* Node Resilience - Pink
* Node Human Well-being - Gray
* Node Bioluminescent Organisms - Yellow
* Node Symbiotic Relationships - Teal
**Chart (b): Emergence of New Hubs Over Iterations**
* **Title:** Emergence of New Hubs Over Iterations
* **X-axis:** Iteration Number (0 to 200)
* **Y-axis:** Number of New Hubs (0.00 to 2.00)
**Chart (c): Overall Network Connectivity Over Time**
* **Title:** Overall Network Connectivity Over Time
* **X-axis:** Iteration (0 to 1000)
* **Y-axis:** Average Node Degree (0 to 4)
### Detailed Analysis
**Chart (a): Growth of Top Hubs Over Iterations**
* **Node Artificial Intelligence (AI) (Blue):** Shows the most significant growth, starting at approximately 0 and reaching approximately 195 by iteration 1000. The line has several steep upward jumps, indicating rapid growth spurts.
* **Node Knowledge Graph (Orange):** Starts at approximately 0, increases to approximately 130 by iteration 1000. The growth is less steep than AI.
* **Node Urban Ecosystems (Green):** Starts at approximately 0, increases to approximately 65 by iteration 1000.
* **Node Bioluminescent Technology (Red):** Starts at approximately 0, increases to approximately 105 by iteration 1000.
* **Node Learning Outcomes (Purple):** Starts at approximately 0, increases to approximately 85 by iteration 1000.
* **Node Climate Change (Brown):** Starts at approximately 0, increases to approximately 80 by iteration 1000.
* **Node Resilience (Pink):** Starts at approximately 0, increases to approximately 100 by iteration 1000.
* **Node Human Well-being (Gray):** Starts at approximately 0, increases to approximately 70 by iteration 1000.
* **Node Bioluminescent Organisms (Yellow):** Starts at approximately 0, increases to approximately 60 by iteration 1000.
* **Node Symbiotic Relationships (Teal):** Starts at approximately 0, increases to approximately 65 by iteration 1000.
**Chart (b): Emergence of New Hubs Over Iterations**
* The chart shows the number of new hubs emerging at specific iteration numbers.
* At iteration 10, the number of new hubs is approximately 1.
* At iteration 30, the number of new hubs is approximately 1.
* At iteration 40, the number of new hubs is approximately 1.
* At iteration 90, the number of new hubs peaks at approximately 2.
* At iteration 100, the number of new hubs is approximately 1.
* At iteration 220, the number of new hubs is approximately 1.
**Chart (c): Overall Network Connectivity Over Time**
* The average node degree increases almost linearly from approximately 0.2 at iteration 0 to approximately 4.5 at iteration 1000.
* The trend is consistently upward, indicating increasing network connectivity over time.
### Key Observations
* **Chart (a):** Node Artificial Intelligence (AI) exhibits the most significant growth in absolute degree compared to other nodes.
* **Chart (b):** The emergence of new hubs is concentrated in the early iterations, with a peak around iteration 90.
* **Chart (c):** The overall network connectivity, as measured by average node degree, increases steadily over time.
### Interpretation
The charts collectively suggest that the network experiences significant growth in both individual hub degrees and overall connectivity. The "Node Artificial Intelligence (AI)" becomes a dominant hub, while other nodes grow at varying rates. The emergence of new hubs is more prominent in the early stages of the network's evolution. The consistent increase in average node degree indicates that the network becomes more interconnected as iterations progress. The data demonstrates the dynamic nature of network growth, with some nodes becoming more central and the overall network becoming more densely connected over time.
</details>
Figure 10: Evolution of knowledge graph structure across iterations, for $\mathcal{G_{1}}$ . (a) Degree growth of the top conceptual hubs, showing both steady accumulation and sudden breakthroughs. (b) Histogram of newly emerging high-degree nodes across iterations, indicating phases of conceptual expansion. (c) Plot of the mean node degree over time, illustrating the system’s progressive integration of new knowledge.
The trajectory of hub development (Figure 10 (a)) suggests two primary modes of knowledge accumulation: steady growth and conceptual breakthroughs. Certain concepts, such as Artificial Intelligence (AI) and Knowledge Graphs, exhibit continuous incremental expansion, reflecting their persistent relevance in structuring knowledge. In contrast, hubs like Bioluminescent Technology and Urban Ecosystems experience extended periods of low connectivity followed by sudden increases in node degree, suggesting moments when these concepts became structurally significant in the knowledge graph. These results indicate that the system does not expand knowledge in a purely linear fashion but undergoes phases of conceptual restructuring, akin to punctuated equilibrium in scientific development.
The emergence of new hubs (Figure 10 (b)) further supports this interpretation. Instead of a continuous influx of new central concepts, we observe discrete bursts of hub formation occurring at specific iteration milestones. These bursts likely correspond to the accumulation of contextual knowledge reaching a critical threshold, after which the system autonomously generates new organizing principles to structure its expanding knowledge base. This finding suggests that the system’s reasoning process undergoes alternating cycles of consolidation and discovery, where previously formed knowledge stabilizes before new abstract concepts emerge.
The overall network connectivity trends (Figure 10 (c)) demonstrate a steady increase in average node degree, indicating that the graph maintains a structurally stable expansion while integrating additional knowledge. The absence of abrupt drops in connectivity suggests that previously introduced concepts remain relevant and continue to influence reasoning rather than become obsolete. This trend supports the hypothesis that the system exhibits self-organizing knowledge structures, continuously refining its conceptual hierarchy as it expands.
These observations lead to several overarching conclusions. First, the results indicate that the system follows a hybrid knowledge expansion model, combining gradual accumulation with disruptive conceptual breakthroughs. This behavior closely mirrors the dynamics of human knowledge formation, where foundational ideas develop progressively, but major paradigm shifts occur when conceptual thresholds are crossed. Second, the persistence of high-degree hubs suggests that knowledge graphs generated in this manner do not suffer from catastrophic forgetting; instead, they maintain and reinforce previously established structures while integrating new insights. Third, the formation of new hubs in discrete bursts implies that knowledge expansion is not driven by uniform growth but by self-reinforcing epistemic structures, where accumulated reasoning reaches a tipping point that necessitates new abstract representations.
Additionally, the system demonstrates a structured directionality in knowledge formation, as evidenced by the smooth increase in average node degree without fragmentation. This suggests that new concepts do not disrupt existing structures but are incrementally woven into the broader network. Such behavior is characteristic of self-organizing knowledge systems, where conceptual evolution follows a dynamic yet cohesive trajectory. The model also exhibits potential for cross-domain knowledge synthesis, as indicated by the presence of nodes that transition into highly connected hubs later in the process. These nodes likely act as bridges between previously distinct knowledge clusters, fostering interdisciplinary connections.
These analyses provide strong evidence that the recursive graph expansion model is capable of simulating key characteristics of scientific knowledge formation. The presence of alternating stability and breakthrough phases, the hierarchical organization of concepts, and the increasing connectivity across knowledge domains all highlight the potential for autonomous reasoning systems to construct, refine, and reorganize knowledge representations dynamically. Future research could potentially focus on exploring the role of interdisciplinary bridge nodes, analyzing the hierarchical depth of reasoning paths, and examining whether the system can autonomously infer meta-theoretical insights from its evolving knowledge graph.
2.6 Structural Evolution of the Knowledge Graph
The expansion of the knowledge graph over iterative refinements reveals emergent structural patterns that highlight how knowledge communities form, how interdisciplinary connections evolve, and how reasoning complexity changes over time. These dynamics provide insight into how autonomous knowledge expansion follows systematic self-organization rather than random accumulation. Figure 11 presents three key trends: (a) the formation and growth of knowledge sub-networks, (b) the number of bridge nodes that connect different knowledge domains, and (c) the depth of multi-hop reasoning over iterations.
<details>
<summary>x11.png Details</summary>

### Visual Description
## Line Charts: Evolution of Knowledge and Reasoning
### Overview
The image presents three line charts illustrating the evolution of knowledge communities, the number of concepts connecting different domains, and the depth of multi-hop reasoning over time. Each chart plots data against "Iteration" on the x-axis.
### Components/Axes
**Chart (a): Evolution of Knowledge Communities Over Time**
* **Title:** Evolution of Knowledge Communities Over Time
* **X-axis:** Iteration (ranging from 0 to 1000)
* **Y-axis:** Number of Sub-Networks (ranging from 5 to 30)
* **Data Series:** A purple line representing the number of sub-networks.
**Chart (b): Number of Concepts Connecting Different Domains Over Time**
* **Title:** Number of Concepts Connecting Different Domains Over Time
* **X-axis:** Iteration (ranging from 0 to 1050)
* **Y-axis:** Number of Bridge Nodes (ranging from 0 to 1200)
* **Data Series:** A red line representing the number of bridge nodes.
**Chart (c): Depth of Multi-Hop Reasoning Over Time**
* **Title:** Depth of Multi-Hop Reasoning Over Time
* **X-axis:** Iteration (ranging from 0 to 1000)
* **Y-axis:** Avg Shortest Path Length (ranging from 3 to 7)
* **Data Series:** A blue line representing the average shortest path length.
### Detailed Analysis
**Chart (a): Evolution of Knowledge Communities Over Time**
* **Trend:** The purple line shows an initial rapid increase in the number of sub-networks, followed by a period of fluctuation and eventual stabilization.
* **Data Points:**
* At Iteration 0, the number of sub-networks is approximately 6.
* By Iteration 200, the number of sub-networks reaches approximately 17.
* From Iteration 600 onwards, the number of sub-networks fluctuates between approximately 22 and 28.
**Chart (b): Number of Concepts Connecting Different Domains Over Time**
* **Trend:** The red line shows a generally linear, positive trend, indicating a steady increase in the number of bridge nodes over time.
* **Data Points:**
* At Iteration 0, the number of bridge nodes is approximately 0.
* At Iteration 300, the number of bridge nodes is approximately 300.
* At Iteration 600, the number of bridge nodes is approximately 700.
* At Iteration 900, the number of bridge nodes is approximately 1050.
**Chart (c): Depth of Multi-Hop Reasoning Over Time**
* **Trend:** The blue line shows an initial spike in the average shortest path length, followed by a sharp decline and then stabilization.
* **Data Points:**
* At Iteration 0, the average shortest path length is approximately 2.5.
* Around Iteration 50, the average shortest path length peaks at approximately 7.
* By Iteration 200, the average shortest path length drops to approximately 5.2.
* From Iteration 400 onwards, the average shortest path length stabilizes around 5.
### Key Observations
* The number of sub-networks (Chart a) initially grows rapidly but plateaus over time.
* The number of bridge nodes (Chart b) increases linearly with iteration.
* The average shortest path length (Chart c) initially spikes and then decreases to a stable level.
### Interpretation
The charts suggest that as the number of iterations increases:
* The knowledge communities initially evolve rapidly, forming new sub-networks, but this growth slows down as the system matures.
* The number of concepts connecting different domains increases steadily, indicating a continuous integration of knowledge.
* The depth of multi-hop reasoning initially increases, possibly due to exploration of new connections, but then decreases and stabilizes, suggesting that the system finds more efficient reasoning paths over time.
The relationship between these elements suggests a dynamic system where knowledge is initially fragmented but becomes more integrated and efficient over time. The initial spike in reasoning depth might represent a period of exploration, while the subsequent stabilization indicates optimization and refinement of reasoning processes.
</details>
Figure 11: Structural evolution of the knowledge graph across iterations. (a) The number of distinct knowledge communities over time, showing an increasing trend with some fluctuations, for graph $\mathcal{G_{1}}$ . (b) The growth of bridge nodes that connect multiple knowledge domains, following a steady linear increase. (c) The average shortest path length over iterations, indicating shifts in reasoning complexity as the graph expands.
Figure 11 (a) illustrates the formation of knowledge sub-networks over time. The number of distinct communities increases as iterations progress, reflecting the system’s ability to differentiate between specialized fields of knowledge. The trend suggests two key observations: (i) an early rapid formation of new communities as novel knowledge domains emerge and (ii) a later stage where the number of communities stabilizes with occasional fluctuations. The latter behavior indicates that rather than indefinitely forming new disconnected knowledge clusters, the system reaches a regime where previously distinct domains remain relatively stable while undergoing minor structural reorganizations. The fluctuations in the later stages may correspond to moments where knowledge clusters merge or when new abstractions cause domain shifts.
Figure 11 (b) tracks the number of bridge nodes (concepts that serve as interdisciplinary connectors) over iterative expansions. The steady, almost linear increase in bridge nodes suggests that as knowledge expands, more concepts naturally emerge as crucial links between different domains. This behavior reflects the self-reinforcing nature of knowledge integration, where new ideas not only expand within their respective fields but also introduce new ways to connect previously unrelated disciplines. Interestingly, there is no evidence of saturation in the number of bridge nodes, implying that the graph remains highly adaptive, continuously uncovering interdisciplinary relationships without premature convergence. This property is reminiscent of human knowledge structures, where interdisciplinary connections become more prevalent as scientific inquiry deepens.
Figure 11 (c) examines the depth of multi-hop reasoning over iterations by measuring the average shortest path length in the graph. Initially, reasoning depth fluctuates significantly, which corresponds to the early phase of knowledge graph formation when structural organization is still emergent. As iterations progress, the average path length stabilizes, indicating that the system achieves a balance between hierarchical depth and accessibility of information. The early fluctuations may be attributed to the rapid reorganization of knowledge, where some paths temporarily become longer as new concepts emerge before stabilizing into more efficient reasoning structures. The eventual stabilization suggests that the graph reaches an equilibrium in how information propagates through interconnected domains, maintaining reasoning efficiency while still allowing for complex inferential pathways.
Taken together, these findings suggest that the autonomous knowledge expansion model exhibits structured self-organization, balancing specialization and integration. The interplay between distinct community formation, interdisciplinary connectivity, and reasoning depth highlights the emergence of a dynamically evolving but structurally coherent knowledge network. The continuous increase in bridge nodes reinforces the idea that interdisciplinary reasoning remains a central feature throughout the system’s expansion, which may have significant implications for autonomous discovery processes. Future analyses will explore whether certain bridge nodes exhibit long-term persistence as central knowledge connectors or if interdisciplinary pathways evolve dynamically based on newly introduced concepts.
2.7 Persistence of Bridge Nodes in Knowledge Evolution
To understand the structural stability of interdisciplinary connections, we further analyze the persistence of bridge nodes—concepts that act as connectors between distinct knowledge domains, over multiple iterations. Figure 12 presents a histogram of bridge node lifespans, showing how long each node remained an active bridge in the knowledge graph.
<details>
<summary>x12.png Details</summary>

### Visual Description
## Histogram: Bridge Node Persistence Over Time
### Overview
The image is a histogram showing the persistence of bridge nodes over time. The x-axis represents the number of iterations a node acts as a bridge, and the y-axis represents the number of nodes. The histogram bars are light blue with dark blue outlines. A dark blue curve is overlaid on the histogram, representing a smoothed distribution of the data.
### Components/Axes
* **Title:** Bridge Node Persistence Over Time
* **X-axis:** Number of Iterations as a Bridge Node
* Scale: 0 to 800, with tick marks at intervals of 100 (0, 100, 200, 300, 400, 500, 600, 700, 800)
* **Y-axis:** Number of Nodes
* Scale: 0 to 350, with tick marks at intervals of 50 (0, 50, 100, 150, 200, 250, 300, 350)
* **Bars:** Light blue with dark blue outlines.
* **Curve:** Dark blue, overlaid on the histogram.
### Detailed Analysis
The histogram shows a right-skewed distribution. The highest bar is at the beginning of the x-axis, indicating that most bridge nodes have a low number of iterations. As the number of iterations increases, the number of nodes decreases.
* **Bar 1 (0-25 iterations):** Approximately 350 nodes.
* **Bar 2 (25-50 iterations):** Approximately 130 nodes.
* **Bar 3 (50-75 iterations):** Approximately 120 nodes.
* **Bar 4 (75-100 iterations):** Approximately 90 nodes.
* **Bar at 200 iterations:** Approximately 80 nodes.
* **Bar at 300 iterations:** Approximately 60 nodes.
* **Bar at 400 iterations:** Approximately 65 nodes.
* **Bar at 500 iterations:** Approximately 50 nodes.
* **Bar at 600 iterations:** Approximately 30 nodes.
* **Bar at 700 iterations:** Approximately 25 nodes.
* **Bar at 800 iterations:** Approximately 25 nodes.
The dark blue curve follows the general trend of the histogram, showing a rapid decrease in the number of nodes as the number of iterations increases, followed by a more gradual decline.
### Key Observations
* The majority of bridge nodes have a low number of iterations.
* The distribution is right-skewed, indicating that there are a few bridge nodes with a high number of iterations.
* The dark blue curve provides a smoothed representation of the distribution.
### Interpretation
The data suggests that most bridge nodes are not persistent and only act as bridges for a short period. A smaller number of bridge nodes are more persistent and act as bridges for a longer time. The right-skewed distribution indicates that the persistence of bridge nodes follows a power-law distribution, where a small number of nodes are responsible for a large fraction of the bridge functionality. This could be due to various factors, such as node stability, network topology, or routing algorithms. The histogram provides a visual representation of the distribution of bridge node persistence, which can be used to analyze the performance and reliability of the network.
</details>
Figure 12: Histogram of bridge node persistence over iterations, for $\mathcal{G_{1}}$ . The distribution follows a long-tail pattern, indicating that while most bridge nodes exist only briefly, a subset remains active across hundreds of iterations.
The distribution in Figure 12 suggests that knowledge graph connectivity follows a hybrid model of structural evolution. The majority of bridge nodes appear only for a limited number of iterations, reinforcing the hypothesis that interdisciplinary pathways frequently evolve as new concepts emerge and replace older ones. This aligns with earlier observations that the knowledge system exhibits a high degree of conceptual dynamism.
However, a subset of bridge nodes remains persistent for hundreds of iterations. These nodes likely correspond to fundamental concepts that sustain long-term interdisciplinary connectivity. Their extended presence suggests that the system does not solely undergo continuous restructuring; rather, it maintains a set of core concepts that act as stable anchors in the evolving knowledge landscape.
These results refine our earlier observations by distinguishing between transient interdisciplinary connections and long-term structural stability. While knowledge graph expansion is dynamic, certain foundational concepts maintain their bridging role, structuring the broader knowledge network over extended periods. This hybrid model suggests that autonomous knowledge expansion does not operate under complete conceptual turnover but instead converges toward the emergence of stable, high-impact concepts that persist across iterations.
Related questions that could be explored in future research is whether these persistent bridge nodes correspond to widely used theoretical frameworks, methodological paradigms, or cross-domain knowledge principles. Additionally, further analysis is needed to examine whether long-term bridge nodes exhibit distinct topological properties, such as higher degree centrality or clustering coefficients, compared to short-lived connectors.
2.8 Early Evolution of Bridge Nodes in Knowledge Expansion
To examine the mechanics of the formation of interdisciplinary connections in the early stages of knowledge graph evolution, we pay close attention to the process. In the analysis discussed here, we identify the first occurrences of bridge nodes over the initial 200 iterations. Figure 13 presents a binary heatmap, where each row represents a bridge node, and each column corresponds to an iteration. The bridge nodes are sorted by the iteration in which they first appeared, providing a clearer view of how interdisciplinary connectors emerge over time.
<details>
<summary>x13.png Details</summary>

### Visual Description
## Heatmap: Bridge Node Appearance Over Early Iterations (Sorted by First Appearance)
### Overview
The image is a heatmap visualizing the appearance of bridge nodes over the first 200 iterations. The y-axis lists the top 100 earliest appearing bridge nodes, and the x-axis represents the iteration number. Dark blue cells indicate the presence of a specific bridge node at a given iteration. The nodes are sorted by their first appearance.
### Components/Axes
* **Title:** Bridge Node Appearance Over Early Iterations (Sorted by First Appearance)
* **X-axis:** Iteration (First 200). The x-axis is labeled with iteration numbers from 2 to 199, with increments varying. Specific values include: 2, 5, 8, 11, 17, 23, 26, 29, 35, 40, 44, 50, 53, 56, 60, 63, 66, 69, 73, 76, 82, 87, 90, 94, 100, 105, 108, 112, 116, 120, 125, 130, 133, 137, 142, 146, 149, 152, 156, 159, 164, 169, 176, 179, 184, 188, 192, 195, 199.
* **Y-axis:** Top 100 Earliest Appearing Bridge Nodes. The y-axis lists the following bridge nodes:
* Closed-Loop Life Cycle Design
* Environmental Sustainability
* Human Well-being
* Material Utilization
* Material Waste
* Recycling
* Bio-inspired Materials
* Bio-inspired Materials Science
* Closed-loop Life Cycle Design
* Design Approach
* Development of Novel, Adaptive Urban Ecosystems
* Materials Production
* Materials Science
* Nature
* Novel, Adaptive Urban Ecosystems
* Social Impact
* Sustainable Materials Development
* Self-healing Infrastructure
* Environmental Impact
* More Resilient Urban Ecosystems
* Urban Planning and Development
* Adaptability of cities to climate change
* Enhancement of Adaptability and Resilience in cities
* Integration
* Key Design Considerations
* Sustainability
* Adaptability and Resilience of Cities
* Climate Change
* Smart Ecosystems
* Biological and Bio-Inspired Materials
* Materials
* Urban Infrastructure Design
* Environmental Health
* Flood Resilience
* Infrastructure
* Adaptive
* Advanced Material
* Economic Growth
* Economic Outcome
* Floodwall System
* Smart Material
* Urban Flood Defenses
* Urban Infrastructure
* Adaptive, Modular Design
* Economic Growth of Affected Communities
* Self-healing Concrete
* Artificial Intelligence (AI)
* Feedback Mechanism
* Inclusive Learning Ecosystem (ILE)
* Personalized Learning Experience
* Personalized Learning Experiences
* Adaptive Learning
* Learning Effectiveness
* Learning Pathways
* Adaptive Learning Systems
* Cognitive Profiling
* Learning Environment
* Learning Motivation
* Learning Outcomes
* AI-driven Knowledge Graph (KG)
* Adaptive Assessments
* Knowledge Graph-based Adaptation (KG-BA)
* Personalized Learning Pathways
* Personalized Learning Pathways (PLP)
* Adaptive Learning System (ALS)
* Flow
* Individual Differences
* Knowledge Graph
* Knowledge Graph Construction
* Knowledge Representation
* Learning Platform
* Learning Preferences
* Neuroplasticity
* Neuroplasticity-Based Learning
* Personalized Education Strategies
* Learning Approach
* Learning Outcome
* Learning Process
* Student Success
* VR Therapy
* AI-Driven Narrative Generation
* Personalized Adaptive Narratives
* Anxiety Disorders
* Immersive Storytelling
* Virtual Reality (VR) Therapy
* BCIs
* Long-term Outcomes
* Personalized VR Therapy
* Therapeutic Approach
* User Engagement
* Treatment Plans
* Brain-Computer Interfaces (BCIs)
* Neurological Disorders
* Outcome
* Recovery
* Technology
* Treatment Longevity
* Personalization and Adaptation Therapy
* **Color Scheme:** Dark blue indicates the presence of a bridge node at a specific iteration; white indicates absence.
### Detailed Analysis
The heatmap shows a clear trend:
* Nodes at the top of the list (e.g., "Closed-Loop Life Cycle Design," "Environmental Sustainability") appear consistently throughout the early iterations.
* As you move down the list, nodes appear later and less frequently.
* Some nodes, like "Personalization and Adaptation Therapy," only appear sporadically in the later iterations.
### Key Observations
* The heatmap visually confirms that the nodes are sorted by their first appearance.
* There is a wide variation in the persistence of different bridge nodes across iterations. Some nodes are consistently present, while others are only briefly active.
* The density of dark blue cells decreases as you move from left to right and from top to bottom, indicating that fewer nodes are active in later iterations and that the later-appearing nodes are less consistently present.
### Interpretation
The heatmap provides insights into the evolution of bridge node activity over the first 200 iterations. The nodes at the top, which appear early and consistently, likely represent fundamental or foundational concepts. The nodes that appear later and less frequently may represent more specialized or derivative concepts that build upon the earlier ones. The visualization suggests a hierarchical structure where some concepts are consistently relevant, while others become important only in specific contexts or later stages of development. The data suggests that the initial iterations are dominated by a core set of concepts, and as the process evolves, new and more specialized concepts emerge.
</details>
Figure 13: Emergence of bridge nodes over the first 200 iterations, sorted by first appearance, for $\mathcal{G_{1}}$ . White regions indicate the absence of a node as a bridge, while dark blue regions denote its presence. Nodes that appear earlier in the graph evolution are positioned at the top. The structured emergence pattern suggests phases of knowledge expansion and stabilization.
The heatmap in Figure 13 reveals several key trends in the evolution of bridge nodes. Notably, the earliest iterations feature a rapid influx of bridge nodes, reflecting the initial structuring phase of the knowledge graph. Many nodes appear and remain active for extended periods, suggesting that certain concepts establish themselves as core interdisciplinary connectors early in the process. These nodes likely play a foundational role in structuring knowledge integration across domains.
A second notable pattern is the episodic emergence of new bridge nodes, rather than a continuous accumulation. The visualization shows distinct clusters of newly appearing bridge nodes, interspersed with periods of relative stability. These bursts suggest that knowledge integration occurs in structured phases rather than through gradual accumulation. Such phases may represent moments when the system reaches a threshold where newly integrated concepts allow for the creation of previously infeasible interdisciplinary links.
In contrast to the early-established bridge nodes, a subset of nodes appears only in later iterations. These late-emerging bridge nodes indicate that interdisciplinary roles are notably not static; rather, the system continuously restructures itself, incorporating new ideas as they gain relevance. This supports the hypothesis that certain bridge nodes emerge not from initial structuring but from later stages of conceptual refinement, possibly as higher-order abstractions connecting previously developed knowledge clusters.
The distribution of bridge node activity also suggests a mix of persistent and transient connectors. While some nodes appear briefly and disappear, others remain active over long stretches. This behavior reinforces the idea that knowledge expansion is both dynamic and structured, balancing exploration (where new connections are tested) and stabilization (where key interdisciplinary links persist).
We note that the structured emergence of bridge nodes may indicate that interdisciplinary pathways do not form randomly but are shaped by systematic phases of knowledge integration and refinement. Future analyses could explore the long-term impact of early bridge nodes, assessing whether they remain influential throughout the knowledge graph’s evolution, and whether the structure of interdisciplinary connectivity stabilizes or continues to reorganize over extended iterations.
2.9 Evolution of Key Bridge Nodes Over Iterations
To investigate how interdisciplinary pathways evolve in the knowledge graph, we analyzed the betweenness centrality of the most influential bridge nodes across 1,000 iterations. Figure 14 presents the trajectory of the top 10 bridge nodes, highlighting their shifting roles in facilitating interdisciplinary connections.
<details>
<summary>x14.png Details</summary>

### Visual Description
## Line Chart: Evolution of Top 10 Bridge Nodes Over Iterations
### Overview
The image is a line chart that illustrates the evolution of betweenness centrality for the top 10 bridge nodes over a series of iterations. The chart displays how the importance or influence of different nodes (represented by different categories) changes as the iterations progress. The x-axis represents the iteration number, and the y-axis represents the betweenness centrality.
### Components/Axes
* **Title:** Evolution of Top 10 Bridge Nodes Over Iterations
* **X-axis:**
* Label: Iteration
* Scale: 0 to 1000, with major ticks at 0, 200, 400, 600, 800, and 1000.
* **Y-axis:**
* Label: Betweenness Centrality
* Scale: 0.0 to 0.7, with major ticks at 0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, and 0.7.
* **Legend:** Located on the top-right of the chart, the legend maps the line colors to the following categories:
* Blue: Closed-Loop Life Cycle Design
* Orange: Human Well-being
* Green: Feedback Mechanism
* Red: Adaptive
* Purple: Adaptability and Resilience of Cities
* Brown: Artificial Intelligence (AI)
* Pink: Environmental Sustainability
* Gray: Outcome
* Yellow-Green: Closed-loop Life Cycle Design
* Teal: Bio-inspired Materials
### Detailed Analysis
Here's a breakdown of each data series and its trend:
1. **Closed-Loop Life Cycle Design (Blue):** Starts at approximately 0.7 at iteration 0, then rapidly decreases to near 0.0 by iteration 100, and remains relatively stable near 0.0 for the rest of the iterations.
2. **Human Well-being (Orange):** Starts at approximately 0.57 at iteration 0, decreases to approximately 0.1 at iteration 200, and then gradually decreases to approximately 0.05 by iteration 1000.
3. **Feedback Mechanism (Green):** Starts at approximately 0.52 at iteration 0, decreases rapidly to near 0.0 by iteration 100, and remains relatively stable near 0.0 for the rest of the iterations.
4. **Adaptive (Red):** Starts at approximately 0.47 at iteration 0, decreases rapidly to near 0.0 by iteration 100, and remains relatively stable near 0.0 for the rest of the iterations.
5. **Adaptability and Resilience of Cities (Purple):** Starts at approximately 0.45 at iteration 0, decreases rapidly to near 0.0 by iteration 100, and remains relatively stable near 0.0 for the rest of the iterations.
6. **Artificial Intelligence (AI) (Brown):** Starts at approximately 0.44 at iteration 0, fluctuates between 0.45 and 0.30 until iteration 800, and then stabilizes around 0.27 by iteration 1000.
7. **Environmental Sustainability (Pink):** Starts at approximately 0.42 at iteration 0, decreases rapidly to near 0.0 by iteration 100, and remains relatively stable near 0.0 for the rest of the iterations.
8. **Outcome (Gray):** Starts at approximately 0.22 at iteration 0, decreases rapidly to near 0.0 by iteration 200, and remains relatively stable near 0.0 for the rest of the iterations.
9. **Closed-loop Life Cycle Design (Yellow-Green):** Starts at approximately 0.55 at iteration 0, decreases rapidly to near 0.0 by iteration 100, and remains relatively stable near 0.0 for the rest of the iterations.
10. **Bio-inspired Materials (Teal):** Starts at approximately 0.02 at iteration 0, remains relatively stable near 0.0 for the rest of the iterations.
### Key Observations
* The "Closed-Loop Life Cycle Design" node (blue line) has the highest initial betweenness centrality, peaking at approximately 0.7, but quickly drops to near zero.
* The "Artificial Intelligence (AI)" node (brown line) maintains a relatively high betweenness centrality compared to other nodes throughout the iterations, although it decreases over time.
* Most of the nodes experience a rapid decrease in betweenness centrality within the first 200 iterations.
* The "Bio-inspired Materials" node (teal line) consistently has a very low betweenness centrality throughout the iterations.
### Interpretation
The chart suggests that the "Closed-Loop Life Cycle Design" node is initially the most influential or important bridge node, but its influence diminishes rapidly. The "Artificial Intelligence (AI)" node maintains a more sustained level of influence compared to the other nodes, indicating its continued importance as a bridge node throughout the iterations. The rapid decline in betweenness centrality for most nodes suggests that their importance as bridge nodes decreases significantly as the iterations progress. The consistently low betweenness centrality of the "Bio-inspired Materials" node indicates that it is not a significant bridge node in this context. The data demonstrates the dynamic nature of node importance in a network, where some nodes initially have high influence but lose it over time, while others maintain a more consistent level of influence.
</details>
Figure 14: Evolution of the top 10 bridge nodes over iterations, for $\mathcal{G_{1}}$ . Each curve represents the betweenness centrality of a bridge node, indicating its role in facilitating knowledge integration. Nodes that initially had high centrality later declined, while some concepts maintained their influence throughout the graph’s evolution.
The trends in Figure 14 reveal distinct patterns in how bridge nodes emerge, peak in influence, and decline over time. Notably, nodes such as Closed-Loop Life Cycle Design and Human Well-being exhibit high betweenness centrality in the early iterations, suggesting that they played a fundamental role in structuring the initial interdisciplinary landscape. However, as the knowledge graph expanded, these nodes saw a gradual decline in their centrality, indicating that their role as primary connectors was replaced by alternative pathways.
A second class of bridge nodes, including Adaptability and Resilience of Cities and Artificial Intelligence (AI), maintained high centrality values for a longer duration, suggesting that certain concepts remain essential to interdisciplinary knowledge integration even as the graph evolves. These nodes acted as long-term knowledge stabilizers, facilitating interactions between different research domains throughout a significant portion of the knowledge expansion process.
Interestingly, a subset of nodes, such as Feedback Mechanism and Outcome, gradually gained importance over time. Unlike early bridge nodes that peaked and declined, these nodes started with lower centrality but increased in influence in later iterations. This suggests that some interdisciplinary pathways only become critical after sufficient knowledge accumulation, reinforcing the idea that interdisciplinary roles are not static but continuously reorganize as the knowledge graph matures.
Furthermore, we observe that by approximately iteration 400-600, most bridge nodes’ betweenness centrality values begin converging toward lower values, indicating that knowledge transfer is no longer reliant on a small set of nodes. This suggests that, as the graph expands, alternative pathways develop, leading to a more distributed and decentralized knowledge structure where connectivity is no longer dominated by a few highly influential nodes.
These findings support the hypothesis that interdisciplinary pathways evolve dynamically, with early-stage knowledge formation relying on a few key concepts, followed by a transition to a more robust and distributed network where multiple redundant pathways exist. Future analyses will focus on:
- Identifying which nodes replaced early bridge nodes as major interdisciplinary connectors in later iterations.
- Comparing early vs. late-stage bridge nodes to assess whether earlier nodes tend to be general concepts, while later bridge nodes represent more specialized interdisciplinary knowledge.
- Analyzing the resilience of the knowledge graph by simulating the removal of early bridge nodes to determine their structural significance.
These results provide a perspective on how interdisciplinary linkages emerge, stabilize, and reorganize over time, offering insights into the self-organizing properties of large-scale knowledge systems.
2.10 Evolution of Betweenness Centrality Distribution
To analyze the structural evolution of the knowledge graph, we next examine the distribution of betweenness centrality at different iterations. Betweenness centrality is a measure of a node’s importance in facilitating knowledge transfer between different parts of the network. Formally, the betweenness centrality of a node $v$ is given by:
$$
C_{B}(v)=\sum_{s\neq v\neq t}\frac{\sigma_{st}(v)}{\sigma_{st}}, \tag{1}
$$
where $\sigma_{st}$ is the total number of shortest paths between nodes $s$ and $t$ , and $\sigma_{st}(v)$ is the number of those paths that pass through $v$ . A higher betweenness centrality indicates that a node serves as a critical intermediary in connecting disparate knowledge domains.
Figure S3 presents histograms of betweenness centrality distribution at four key iterations (2, 100, 510, and 1024), illustrating the shifting role of bridge nodes over time.
Initially, at Iteration 2, the network is highly centralized, with a small number of nodes exhibiting extremely high betweenness centrality (above 0.6), while the majority of nodes have near-zero values. This indicates that only a few nodes act as critical interdisciplinary connectors, facilitating nearly all knowledge transfer.
By Iteration 100, the distribution has broadened, meaning that more nodes participate in knowledge transfer. The highest betweenness values have decreased compared to Iteration 2, and more nodes exhibit low but nonzero centrality, suggesting an increase in redundant pathways and reduced dependency on a few dominant bridge nodes.
At Iteration 510, the distribution becomes more skewed again, with fewer nodes having high betweenness centrality and a stronger concentration at low values. This suggests that the network has undergone a phase of structural consolidation, where interdisciplinary pathways reorganize around fewer, more stable bridges.
Finally, at Iteration 1024, the histogram shows that most nodes have low betweenness centrality, and only a few retain moderate values. This suggests that the network has matured into a more distributed structure, where no single node dominates knowledge transfer. The observed trend indicates that as the knowledge graph expands, the burden of interdisciplinary connectivity is increasingly shared among many nodes rather than concentrated in a few.
These results suggest that the system undergoes a dynamic reorganization process, shifting from an initial hub-dominated structure to a more distributed and resilient network. Future work could potentially explore whether these trends continue as the graph scales further and whether the eventual network state remains stable or undergoes additional restructuring.
To examine the overall structural properties of the knowledge graph, we analyzed the distribution of betweenness centrality across all iterations. Figure 15 presents a histogram of betweenness centrality values collected from all iterations of the knowledge graph. The distribution was generated by computing betweenness centrality for each iteration and aggregating all node values overall iterations.
<details>
<summary>x15.png Details</summary>

### Visual Description
## Histogram: Distribution of Betweenness Centrality Across All Iterations
### Overview
The image is a histogram showing the distribution of betweenness centrality across all iterations. The x-axis represents betweenness centrality, and the y-axis represents the number of nodes on a logarithmic scale. The histogram bars are light blue.
### Components/Axes
* **Title:** Distribution of Betweenness Centrality Across All Iterations
* **X-axis:** Betweenness Centrality, ranging from 0.0 to 0.7 in increments of 0.1.
* **Y-axis:** Number of Nodes (Log Scale), ranging from 10^0 (1) to 10^6 (1,000,000).
* **Bars:** Light blue bars representing the frequency of nodes at each betweenness centrality value.
### Detailed Analysis
The histogram shows a distribution where the majority of nodes have low betweenness centrality.
* **Betweenness Centrality 0.0:** The number of nodes is approximately 1.2 * 10^6.
* **Betweenness Centrality 0.0 - 0.1:** The number of nodes decreases rapidly.
* **Betweenness Centrality 0.1 - 0.5:** The number of nodes fluctuates between 10^1 and 10^2.
* **Betweenness Centrality 0.5 - 0.7:** The number of nodes is very low, close to 10^0.
Specific data points (approximate due to histogram format):
* 0.0: ~1,200,000
* 0.05: ~30,000
* 0.1: ~3,000
* 0.15: ~300
* 0.2: ~150
* 0.25: ~100
* 0.3: ~80
* 0.35: ~50
* 0.4: ~40
* 0.45: ~20
* 0.5: ~10
* 0.55: ~2
* 0.6: ~1
* 0.65: ~1
* 0.7: ~1
### Key Observations
* The distribution is heavily skewed to the right, indicating that most nodes have low betweenness centrality.
* There is a significant drop in the number of nodes between 0.0 and 0.1.
* The number of nodes with betweenness centrality greater than 0.5 is very small.
### Interpretation
The histogram suggests that in the network being analyzed, a large number of nodes have very little influence in terms of connecting other nodes (low betweenness centrality). A small fraction of nodes have a high degree of influence, acting as key connectors within the network. This type of distribution is common in many real-world networks, where a few central nodes play a critical role in maintaining connectivity and information flow. The logarithmic scale on the y-axis is necessary to visualize the wide range of node counts, highlighting the disparity between the highly central nodes and the vast majority of nodes with low centrality.
</details>
Figure 15: Distribution of betweenness centrality across all iterations, $\mathcal{G_{1}}$ . The y-axis is log-scaled, showing the frequency of nodes with different centrality values. A small number of nodes dominate knowledge transfer, while most nodes exhibit near-zero centrality.
The histogram in Figure 15 reveals a highly skewed distribution, where the majority of nodes exhibit near-zero betweenness centrality, while a small subset maintains significantly higher values. This pattern suggests that knowledge transfer within the network is primarily governed by a few dominant bridge nodes, which facilitate interdisciplinary connections. The presence of a long tail in the distribution indicates that these high-betweenness nodes persist throughout multiple iterations.
Interestingly, the distribution also exhibits multiple peaks, suggesting that the network consists of different classes of bridge nodes. Some nodes act as long-term stable interdisciplinary connectors, while others emerge as transient bridges that facilitate knowledge transfer only for limited iterations.
The log scale on the $y$ -axis reveals that while most nodes contribute little to betweenness centrality, a significant number of nodes still exhibit low but nonzero values indicating that knowledge flow is distributed across many minor pathways. Over multiple iterations, it is expected that betweenness centrality values redistribute, reducing dependency on early dominant nodes and leading to a more decentralized knowledge structure.
These findings highlight that the knowledge graph maintains a core-periphery structure, where a few key nodes play a disproportionate role in bridging knowledge across disciplines. Future work will explore how the distribution evolves over time, identifying whether the network transitions toward a more evenly distributed structure or remains reliant on a small number of high-centrality nodes.
2.11 Evolution of Betweenness Centrality in the Knowledge Graph
To analyze the structural evolution of the knowledge graph, we tracked the changes in betweenness centrality over 1,000 iterations. Betweenness centrality quantifies the extent to which a node serves as a bridge between other nodes by appearing on shortest paths. A node with high betweenness centrality facilitates interdisciplinary knowledge transfer by linking otherwise disconnected regions of the network. Figures 16 (a) and 16 (b) illustrate how mean and maximum betweenness centrality evolve over time. The first plot captures the average importance of nodes in knowledge transfer, while the second identifies the most dominant bridge nodes at each iteration.
<details>
<summary>x16.png Details</summary>

### Visual Description
## Chart: Betweenness Centrality Over Time and Per Iteration
### Overview
The image presents two line charts side-by-side, comparing the mean and maximum betweenness centrality over 1000 iterations. The left chart (a) displays the mean betweenness centrality, which rapidly decreases and stabilizes near zero. The right chart (b) shows the maximum betweenness centrality per iteration, exhibiting an initial spike followed by a fluctuating decrease and eventual stabilization.
### Components/Axes
**Chart (a): Mean Betweenness Centrality Over Time**
* **Title:** Mean Betweenness Centrality Over Time
* **X-axis:** Iteration, with markers at 0, 200, 400, 600, 800, and 1000.
* **Y-axis:** Mean Betweenness Centrality, with markers at 0.00, 0.02, 0.04, 0.06, 0.08, 0.10, and 0.12.
* **Legend:** Located in the top-right corner, indicating that the blue line represents "Mean Betweenness".
**Chart (b): Maximum Betweenness Centrality Per Iteration**
* **Title:** Maximum Betweenness Centrality Per Iteration
* **X-axis:** Iteration, with markers at 0, 200, 400, 600, 800, and 1000.
* **Y-axis:** Max Betweenness Centrality, with markers at 0.2, 0.3, 0.4, 0.5, 0.6, and 0.7.
* **Legend:** Located in the top-right corner, indicating that the red line represents "Max Betweenness".
### Detailed Analysis
**Chart (a): Mean Betweenness Centrality Over Time**
* **Trend:** The blue line, representing the mean betweenness, exhibits a steep downward slope from iteration 0 to approximately 200. After iteration 200, the line flattens out, indicating a stabilization of the mean betweenness centrality near zero.
* **Data Points:**
* At iteration 0, the mean betweenness centrality is approximately 0.12.
* At iteration 20, the mean betweenness centrality is approximately 0.04.
* At iteration 100, the mean betweenness centrality is approximately 0.01.
* From iteration 200 onwards, the mean betweenness centrality remains close to 0.01 or less.
**Chart (b): Maximum Betweenness Centrality Per Iteration**
* **Trend:** The red line, representing the maximum betweenness, starts with a high value at iteration 0, then fluctuates significantly between iterations 0 and 400. From iteration 400 onwards, the line gradually decreases and stabilizes.
* **Data Points:**
* At iteration 0, the max betweenness centrality is approximately 0.7.
* At iteration 50, the max betweenness centrality is approximately 0.52.
* At iteration 200, the max betweenness centrality is approximately 0.45.
* At iteration 400, the max betweenness centrality is approximately 0.4.
* At iteration 800, the max betweenness centrality is approximately 0.27.
* From iteration 900 onwards, the max betweenness centrality remains close to 0.24.
### Key Observations
* The mean betweenness centrality decreases rapidly and stabilizes near zero, suggesting that the average influence of nodes in the network diminishes quickly over iterations.
* The maximum betweenness centrality exhibits more variability, indicating that while the average influence decreases, some nodes continue to have a significant influence on the network.
* Both charts show a stabilization of betweenness centrality after a certain number of iterations, suggesting a convergence of the network's structure or dynamics.
### Interpretation
The data suggests that the network undergoes a process where the average influence of nodes decreases significantly over time, while the influence of the most central nodes fluctuates before eventually stabilizing. This could be indicative of a network where initial high-influence nodes become less critical as the network evolves, and the overall structure becomes more distributed. The initial spike in maximum betweenness centrality followed by a decrease suggests that the network initially relies on a few highly central nodes, but over time, the influence becomes more evenly distributed, leading to a lower maximum betweenness centrality. The stabilization of both mean and maximum betweenness centrality indicates that the network reaches a relatively stable state after a certain number of iterations.
</details>
Figure 16: Evolution of betweenness centrality in the knowledge graph, $\mathcal{G_{1}}$ . Panel (a): Mean betweenness centrality over time, showing a transition from early high centralization to a more distributed state. Panel (b): Maximum betweenness centrality per iteration, highlighting how the most dominant bridge nodes shift and decline in influence.
Figure 16 (a) tracks the mean betweenness centrality, providing insight into how the overall distribution of knowledge transfer roles evolves. In the earliest iterations, the mean betweenness is extremely high, indicating that only a few nodes dominate knowledge exchange. However, as the graph expands and alternative pathways form, the mean betweenness declines rapidly within the first 100 iterations.
Between iterations 100 and 500, we observe a continued decline, but at a slower rate. This suggests that knowledge transfer is being shared across more nodes, reducing reliance on a small set of dominant bridges. After iteration 500, the values stabilize near zero, indicating that the network has reached a decentralized state, where multiple nodes contribute to knowledge integration instead of a few key intermediaries.
These trends suggest a self-organizing process, where the knowledge graph transitions from a highly centralized system into a more distributed and resilient network. The final structure is more robust, with many small bridges collectively supporting interdisciplinary connectivity instead of a few dominant hubs.
Figure 16 (b) examines the highest betweenness centrality recorded in each iteration, tracking the most dominant knowledge bridge at each stage. In the earliest iterations, a single node reaches an extreme betweenness value of around 0.7, indicating that knowledge transfer is highly bottlenecked through one or very few key nodes.
Between iterations 50 and 300, the maximum betweenness remains high, fluctuating between 0.3 and 0.5. This suggests that while the network becomes less dependent on a single node, a small number of highly central nodes still dominate knowledge flow. This phase represents a transition period, where the network starts distributing knowledge transfer across multiple nodes.
After iteration 500, the maximum betweenness exhibits a gradual decline, eventually stabilizing around 0.2. This suggests that the network has successfully decentralized, and knowledge transfer is no longer dominated by a single key node. The presence of multiple lower-betweenness bridge nodes implies that redundant pathways have developed, making the system more resilient to disruptions. This is in general agreement with earlier observations.
The combined results from Figures 16 (a) and 16 (b) suggest that the knowledge graph undergoes a fundamental structural transformation over time:
- Initially, a few dominant nodes control knowledge flow, leading to high mean and maximum betweenness centrality.
- As the graph expands, new pathways emerge, and betweenness is distributed across more nodes.
- By the later iterations, no single node dominates, and knowledge transfer occurs through a decentralized structure.
This evolution suggests that the knowledge graph self-organizes into a more distributed state, where interdisciplinary connectivity is no longer constrained by a few central hubs. Future studies can explore whether this trend continues at larger scales and analyze which specific nodes maintained high betweenness longest and which replaced them in later iterations.
2.12 Analysis of longest shortest path in $\mathcal{G}_{2}$ and analysis using agentic reasoning
While the primary focus of this study is targeting a detailed analysis of graph dynamic experiments during reasoning, we also explore how graph reasoning based on the in-situ generated graph can be used to improve responses through in-context learning [11] (here, we use meta-llama/Llama-3.2-3B-Instruct). The methodology employs a graph-based reasoning framework to enhance LLM responses through structured knowledge extraction obtained through the method described above. Figure 17 (b) depicts additional analysis, showing a correlation heatmap of path-level metrics, computed for the first 30 longest shortest paths.
<details>
<summary>x17.png Details</summary>

### Visual Description
## Diagram and Heatmap: Materials Network and Correlation of Path Metrics
### Overview
The image presents two distinct visualizations. Part (a) is a network diagram illustrating relationships between various materials and concepts, with "Impact-Resistant Materials" as a central node. Part (b) is a heatmap showing the correlation between different path metrics (Avg Degree, Avg Betweenness, etc.).
### Components/Axes
#### Part (a): Materials Network Diagram
* **Nodes:** Represent materials, concepts, or fields. Examples include "Impact-Resistant Materials," "Biodegradable Microplastic Materials," "Environmental Sustainability," "Data Analysis," and "Machine Learning (ML) Algorithms."
* **Edges:** Labeled with relationship types such as "RELATES-TO," "INFLUENCES," and "IS-A."
* **Node Colors:** Yellow and light green, with "Impact-Resistant Materials" having a darker yellow fill and a dark purple border.
* **Positioning:** Nodes are arranged around the central "Impact-Resistant Materials" node, with connections radiating outwards.
#### Part (b): Correlation Heatmap
* **Axes:** Both X and Y axes list the following path metrics: "Avg Degree," "Avg Betweenness," "Avg Closeness," "Avg Eigenvector," "Avg PageRank," "Avg Clustering," and "Path Density."
* **Color Scale:** Ranges from dark purple (-0.0) to bright yellow (1.0), indicating the strength and direction of correlation.
* **Values:** Numerical values within each cell represent the correlation coefficient between the corresponding path metrics.
### Detailed Analysis
#### Part (a): Materials Network Diagram
* **Impact-Resistant Materials:** This is the central node, influencing "Environmental Sustainability" and relating to "Machine Learning (ML) Algorithms."
* **Biodegradable Microplastic Materials:** "IS-A" type relationship with "Materials for infrastructure design" and "RELATES-TO" "Pollution mitigation".
* **Self-healing Materials in Infrastructure Design:** "RELATES-TO" "Pollution mitigation" and "INFLUENCES" "Development of novel materials for infrastructure design".
* **Environmental Sustainability:** Connected to "Self-healing materials" via "RELATES-TO" and influenced by "Impact-Resistant Materials."
* **Data Analysis:** Connected to "Knowledge Discovery" via "IS-A", "Predictive Modeling" and "AI Techniques" via "RELATES-TO".
* **Personalized Medicine:** "RELATES-TO" "Rare Genetic Disorders".
#### Part (b): Correlation Heatmap
* **Avg Degree:** Highly correlated with "Avg Betweenness" (0.99), "Avg Eigenvector" (0.88), and "Avg PageRank" (0.95).
* **Avg Betweenness:** Highly correlated with "Avg Degree" (0.99), "Avg Eigenvector" (0.93), and "Avg PageRank" (0.97).
* **Avg Closeness:** Shows moderate positive correlation with "Avg Clustering" (0.65) and "Path Density" (0.42).
* **Avg Eigenvector:** Highly correlated with "Avg Degree" (0.88), "Avg Betweenness" (0.93), and "Avg PageRank" (0.96).
* **Avg PageRank:** Highly correlated with "Avg Degree" (0.95), "Avg Betweenness" (0.97), and "Avg Eigenvector" (0.96).
* **Avg Clustering:** Shows moderate positive correlation with "Avg Closeness" (0.65) and "Path Density" (0.52).
* **Path Density:** Shows moderate positive correlation with "Avg Closeness" (0.42) and "Avg Clustering" (0.52).
### Key Observations
* **Network Diagram:** "Impact-Resistant Materials" acts as a central hub, connecting to concepts related to sustainability, advanced materials, and computational methods.
* **Heatmap:** Strong positive correlations exist between degree, betweenness, eigenvector centrality, and PageRank, suggesting these metrics capture similar aspects of network structure. Closeness, clustering, and path density show weaker, but still positive, correlations with each other.
### Interpretation
The network diagram illustrates the relationships between different materials and concepts, highlighting the central role of "Impact-Resistant Materials." The heatmap quantifies the relationships between different network metrics. The high correlation between degree, betweenness, eigenvector centrality, and PageRank suggests redundancy; these metrics may be measuring similar aspects of node importance within the network. The weaker correlations involving closeness, clustering, and path density suggest these metrics capture different, more nuanced aspects of network structure. The data suggests that improving impact resistance in materials can influence environmental sustainability and is related to advancements in machine learning.
</details>
Figure 17: Longest shortest path analysis. Panel (a): Visualization of the longest shortest path (diameter path) in $\mathcal{G}_{2}$ , presenting a fascinating chain of interdisciplinary relationships across medicine, data science and AI, materials science, sustainability, and infrastructure. Panel (b): Correlation heatmap of path-level metrics, computed for the first 30 longest shortest paths. Degree and betweenness centrality are highly correlated, indicating that high-degree nodes frequently serve as key connectors. Eigenvector centrality and PageRank also show strong correlation, highlighting their shared role in capturing node influence. Path density exhibits a weak or negative correlation with centrality measures, suggesting that highly connected nodes often form less dense structures. The metrics were computed for each path by extracting node-level properties (degree, betweenness, closeness, eigenvector centrality, PageRank, clustering coefficient) from the original graph and averaging them over all nodes in the path. Path density was calculated as the ratio of actual edges to possible edges within the path subgraph. Correlations were then derived from these aggregated values across multiple paths.
The extracted longest shortest path depicted in Figure 17 (a) presents a compelling sequence of relationships spanning biotechnology, artificial intelligence, materials science, and sustainability, illustrating how advancements in one domain influence others. The overall logical flow is well-structured, with clear and expected progressions, such as Rare Genetic Disorders leading to Personalized Medicine and Knowledge Discovery, reflecting that the model captures the increasing role of AI in medical research. The sequence from AI Techniques to Predictive Modeling and Machine Learning (ML) Algorithms is similarly intuitive, as computational models underpin predictive simulations across disciplines (details on methods, see Section 4.5).
However, some unexpected connections emerge, suggesting areas for further exploration. The link from Machine Learning (ML) Algorithms to Impact-Resistant Materials stands out – not as a weak connection, but as an intriguing suggestion of AI-driven materials design rather than mere discovery. Computational techniques, such as reinforcement learning and generative modeling, could optimize material structures for durability, opening new pathways in materials engineering. Another unconventional relationship is the transition from Biodegradable Microplastic Materials to Infrastructure Design. These two areas typically operate separately, yet this link may hint at the emergence of biodegradable composites for construction or sustainable materials engineering. Further investigation into the practical applications of biodegradable materials in structural design could strengthen this connection.
A notable redundancy appears in the presence of Pollution Mitigation twice, spelled differently, which results from a lack of node merging rather than a distinct conceptual relationship. This duplication suggests that similar concepts are being represented as separate nodes, potentially affecting graph-based reasoning. Similarly, Self-Healing Materials in Infrastructure Design loops back to Pollution Mitigation, reinforcing an already established sustainability link. While valid, this repetition could be streamlined for clarity.
We find that the logical progression effectively captures key interdisciplinary relationships while revealing areas for refinement. The structure underscores the increasing role of AI in materials science, the integration of sustainability into materials design, and the interplay between predictive modeling and physical sciences. Addressing node duplication and refining transitions between traditionally separate fields—such as biodegradable materials in construction—would enhance the clarity and coherence of the path, making it an even more insightful representation of scientific knowledge.
Agentic Reasoning over the Path
We apply an agentic model to analyze the longest shortest path. For this analysis, an agentic system first analyzes each node in the subgraph, then each of the relationships, and then synthesizes them into a “Final Synthesized Discovery” (in blue font for clarity). The analysis identifies key concepts such as biodegradable microplastics, self-healing materials, pollution mitigation, and AI-driven predictive modeling, ultimately synthesizing the Bio-Inspired, Adaptive Materials for Resilient Ecosystems (BAMES) paradigm. The resulting document, Supporting Text 1, presents the results.
The proposed discovery proposes self-healing, bio-inspired materials that integrate microbial, plant, and animal-derived mechanisms with AI-driven optimization to create adaptive, environmentally responsive materials. By embedding microorganisms for pollutant degradation and leveraging machine learning for real-time optimization, the model suggests that BAMES extends conventional self-healing materials beyond infrastructure applications into active environmental remediation [43]. The concept of temporal memory, where materials learn from past environmental conditions and adjust accordingly, introduces a novel paradigm in smart materials [44]. Additionally, the hypothesis that interconnected materials could develop emergent, collective behavior akin to biological ecosystems presents an interesting perspective on material intelligence and sustainability [45, 46].
Agentic Compositional Reasoning
We can formalize this approach further and induce agentic strategy to develop compositional reasoning (see, Section 4.5.1 for details). In this experiment, implement a systematic development of hierarchical reasoning over concepts, pairs of concepts, and so on. The resulting document is shown in Supporting Text 2, and Figure 18 shows a flowchart of the reasoning process.
<details>
<summary>x18.png Details</summary>

### Visual Description
## Flow Diagram: Sustainable Infrastructure Development
### Overview
The image is a flow diagram illustrating the progression from atomic components to a final expanded discovery in the context of sustainable infrastructure development. The diagram is divided into four main stages: Atomic Components, Pairwise Compositional Fusions, Bridge Synergies, and Final Expanded Discovery. Each stage contains several elements that contribute to the overall process. The diagram uses arrows to show the relationships and flow between these elements. The boxes are colored red and blue, alternating between stages.
### Components/Axes
The diagram consists of four main columns, each representing a stage in the development process. Each stage contains several rectangular boxes, each containing a textual description of a component, fusion, synergy, or discovery. Arrows connect the boxes, indicating the flow of information or influence. The stages are:
1. **Atomic Components** (Leftmost column, red outline): This stage lists fundamental elements or inputs.
2. **Pairwise Compositional Fusions** (Second column, blue outline): This stage describes combinations or integrations of elements from the first stage.
3. **Bridge Synergies** (Third column, red outline): This stage represents the synergistic effects or outcomes resulting from the fusions.
4. **Final Expanded Discovery** (Rightmost column, blue outline): This stage represents the ultimate outcome or discovery.
### Detailed Analysis or Content Details
**Atomic Components** (Red Outline):
* Materials for Infrastructure Design
* Biodegradable Microplastic Materials
* Pollution Mitigation
* Self-healing Materials in Infrastructure Design
* Development of Novel Infrastructure Materials
* Self-healing Materials
* Environmental Sustainability
* Impact-Resistant Materials
* Machine Learning (ML) Algorithms
* Predictive Modeling
* AI Techniques
* Data Analysis
* Knowledge Discovery
* Personalized Medicine
* Rare Genetic Disorders
**Pairwise Compositional Fusions** (Blue Outline):
* Eco-Resilient Infrastructure Design
* Sustainable Pollution Mitigation
* Smart Infrastructure for Sustainable Ecosystems
* Autonomous Repairable Infrastructure
* Sustainable Infrastructure Development
* Environmental Self-Healing Systems
* Eco-Repair Systems
* Eco-Toughened Materials
* Damage Forecasting Systems
* Explainable Predictive Models
* AI-Driven Predictive Systems
* Explainable Machine Learning (XML)
* Explainable Insights
* Precision Medicine Informatics
* Precision Medicine for Rare Genetic Disorders
**Bridge Synergies** (Red Outline):
* Environmental Sustainability + Tech Innovation
* Holistic Understanding of Complex Systems
* Convergence of Diverse Disciplines
**Final Expanded Discovery** (Blue Outline):
* EcoCycle: A Sustainable Infrastructure Framework
**Flow of Information:**
* "Materials for Infrastructure Design" flows into "Eco-Resilient Infrastructure Design" which flows into "Environmental Sustainability + Tech Innovation" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Biodegradable Microplastic Materials" flows into "Sustainable Pollution Mitigation" which flows into "Environmental Sustainability + Tech Innovation" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Pollution Mitigation" flows into "Smart Infrastructure for Sustainable Ecosystems" which flows into "Holistic Understanding of Complex Systems" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Self-healing Materials in Infrastructure Design" flows into "Autonomous Repairable Infrastructure" which flows into "Holistic Understanding of Complex Systems" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Development of Novel Infrastructure Materials" flows into "Sustainable Infrastructure Development" which flows into "Holistic Understanding of Complex Systems" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Self-healing Materials" flows into "Environmental Self-Healing Systems" which flows into "Holistic Understanding of Complex Systems" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Environmental Sustainability" flows into "Eco-Repair Systems" which flows into "Convergence of Diverse Disciplines" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Impact-Resistant Materials" flows into "Eco-Toughened Materials" which flows into "Convergence of Diverse Disciplines" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Machine Learning (ML) Algorithms" flows into "Damage Forecasting Systems" which flows into "Convergence of Diverse Disciplines" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Predictive Modeling" flows into "Explainable Predictive Models" which flows into "Convergence of Diverse Disciplines" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "AI Techniques" flows into "AI-Driven Predictive Systems" which flows into "Convergence of Diverse Disciplines" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Data Analysis" flows into "Explainable Machine Learning (XML)" which flows into "Convergence of Diverse Disciplines" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Knowledge Discovery" flows into "Explainable Insights" which flows into "Convergence of Diverse Disciplines" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Personalized Medicine" flows into "Precision Medicine Informatics" which flows into "Convergence of Diverse Disciplines" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
* "Rare Genetic Disorders" flows into "Precision Medicine for Rare Genetic Disorders" which flows into "Convergence of Diverse Disciplines" which flows into "EcoCycle: A Sustainable Infrastructure Framework".
### Key Observations
* The diagram illustrates a multi-stage process, starting from fundamental components and culminating in a comprehensive framework for sustainable infrastructure.
* The "Pairwise Compositional Fusions" stage combines elements from the "Atomic Components" stage, suggesting integration and synergy.
* The "Bridge Synergies" stage highlights key synergistic effects, such as "Environmental Sustainability + Tech Innovation" and "Holistic Understanding of Complex Systems".
* The final outcome, "EcoCycle: A Sustainable Infrastructure Framework," represents the ultimate goal of this development process.
* The diagram shows how different fields and technologies converge to create a sustainable infrastructure framework.
### Interpretation
The diagram presents a structured approach to developing sustainable infrastructure. It emphasizes the importance of integrating various components, fostering synergies, and leveraging diverse disciplines to achieve a holistic and sustainable outcome. The flow from "Atomic Components" to "Final Expanded Discovery" suggests a systematic and progressive development process. The diagram highlights the interconnectedness of different elements and the importance of considering environmental sustainability and technological innovation in infrastructure development. The diagram suggests that a convergence of diverse disciplines, such as materials science, environmental science, computer science, and medicine, is crucial for creating a sustainable infrastructure framework.
</details>
Figure 18: Compositional framework applied to the longest shortest path. The flowchart illustrates the hierarchical process of compositional reasoning, beginning with atomic components (fundamental scientific concepts, left, as identified in the longest shortest path (Figure 17 (a))) and progressing through pairwise fusions, bridge synergies, and a final expanded discovery. Each stage (Steps A, B, C and D) integrates concepts systematically, ensuring interoperability, generativity, and hierarchical refinement, culminating in the EcoCycle framework for sustainable infrastructure development.
The example ultimately presents a structured approach to compositional scientific discovery, integrating principles from infrastructure materials science, environmental sustainability, and artificial intelligence to develop a novel framework for sustainable infrastructure, termed EcoCycle. As can be seen in Supporting Text 2 and in Figure 18, the compositional reasoning process proceeded through multiple hierarchical steps, ensuring the systematic combination of concepts with well-defined relationships.
At the foundational level, atomic components were identified, each representing an independent domain concept, such as biodegradable microplastic materials, self-healing materials, predictive modeling, and knowledge discovery. These fundamental elements were then combined into pairwise fusions, leveraging shared properties to generate novel synergies. For instance, the fusion of self-healing materials with pollution mitigation led to environmental self-healing systems, integrating autonomous repair mechanisms with pollution reduction strategies. Similarly, combining impact-resistant materials with machine learning algorithms enabled damage forecasting systems, enhancing predictive maintenance in infrastructure.
The validity of this compositional reasoning was established by ensuring that each fusion preserved the integrity of its constituent concepts while generating emergent functionalities. The process adhered to key compositionality principles: (1) Interoperability, ensuring that combined components interacted meaningfully rather than arbitrarily; (2) Generativity, whereby new properties emerged that were not present in the individual components; and (3) Hierarchical Refinement, wherein smaller-scale synergies were recursively integrated into higher-order bridge synergies. This led to overarching themes such as the intersection of environmental sustainability and technological innovation and the holistic understanding of complex systems, demonstrating the robustness of the approach.
Ultimately, these synergies converged into the EcoCycle framework, encapsulating self-healing, eco-responsive, and AI-optimized infrastructure solutions. The structured composition ensured that emergent discoveries were not mere aggregations but cohesive, context-aware innovations, validating the methodological rigor of the compositional approach. Using a strategy of adhering to systematic composition principles, the method used here demonstrates how interdisciplinary insights can be synthesized into scientific concepts.
For comparison, Supporting Text 3 shows the same experiment but where we use o1-pro in the final step of synthesis.
Putting this into context, earlier work [47, 48, 49, 50] have highlighted significant limitations in large language models (LLMs) concerning their ability to perform systematic compositional reasoning, particularly in domains requiring logical integration and generalization. Our approach directly addresses these deficiencies by structuring reasoning processes in a progressive and interpretable manner. Despite possessing individual components of knowledge, LLMs often struggle to integrate these dynamically to detect inconsistencies or solve problems requiring novel reasoning paths. We mitigate this by explicitly encoding relationships between concepts within a graph structure. Unlike conventional LLMs that rely on associative pattern recognition or statistical co-occurrence [47], our structured approach mitigates the concerns of mere connectionist representations by enforcing rule-based, interpretable generalization mechanisms that allow for dynamic recombination of learned knowledge in novel contexts. Further, our approach ensures that each reasoning step builds upon prior knowledge in a structured hierarchy. Steps A-D in our framework progressively construct solutions by leveraging explicit connections between concepts, enforcing compositionality rather than assuming it. For example, our approach connects biodegradable microplastic materials with self-healing materials, not merely through surface-level similarities but through defined mechanisms such as thermoreversible gelation and environmental interactions. Instead of expecting an LLM to infer relationships in a single step, our agentic model progressively traverses reasoning graphs, ensuring that the final outcome emerges through logically justified intermediary steps. This not only reduces reliance on pattern memorization but also enhances interpretability and robustness in novel scenarios.
Our model further enhances compositional reasoning through three key mechanisms:
1. Explicit Pathway Construction: By mapping dependencies between concepts in a structured graph, our model ensures that each step in the reasoning process is explicitly defined and logically connected.
1. Adaptive Contextual Integration: Instead of treating reasoning steps as isolated tasks, the model dynamically integrates intermediate results to refine its conclusions, ensuring that errors or inconsistencies in earlier stages are corrected before final predictions.
1. Hierarchical Synergy Identification: Our model analyzes multi-domain interactions through graph traversal and thereby identify emergent patterns that standard LLMs would overlook, enabling more robust and flexible reasoning. These mechanisms collectively establish a reasoning framework that mitigates compositional deficiencies and facilitates the structured synthesis of knowledge.
Table 2 summarizes how our approach directly addresses key LLM limitations identified in earlier work.
| Fails to compose multiple reasoning steps into a coherent process | Uses hierarchical reasoning with Steps A-D, ensuring progressive knowledge integration through structured dependencies. |
| --- | --- |
| Struggles to generalize beyond memorized patterns | Uses explicit graph structures to enforce systematic knowledge composition, allowing for novel reasoning paths. |
| Overfits to reasoning templates, failing on unseen reformulations | Introduces pairwise and bridge synergies to enable dynamic recombination of knowledge through structured traversal and adaptive reasoning. |
| Does not simulate "slow thinking" or iterative reasoning well | Implements an agentic model that explicitly traverses a reasoning graph rather than relying on a single forward pass, ensuring each step refines and validates prior knowledge. |
Table 2: Comparison of limitations of conventional LLMs, and our approach addresses these. By explicitly structuring relationships between concepts, breaking down reasoning into progressive steps, and incorporating dynamic knowledge recombination, our approach achieves a higher level of structured compositionality that conventional LLMs struggle with. Future work could further refine this approach by introducing adaptive feedback loops, reinforcing causal reasoning, and incorporating quantitative constraints to strengthen knowledge synergies.
Further analysis of these is left to future work, as they would exceed the scope of the present paper. The experiments show that principled approaches to expand knowledge can indeed be implemented using the methodologies described above, complementing other recent work that has explored related topics [29, 49, 23, 50, 47].
2.13 Utilization of Graph Reasoning over Key Hubs and Influencer Nodes in Response Generation
In this example, we analyze the knowledge graph $\mathcal{G}_{2}$ using NetworkX to compute node centralities (betweenness and eigenvector centrality), identifying key hubs and influencers. Community detection via the Louvain method partitions the graph into conceptual clusters, extracting representative nodes per community.
Key relationships are identified by examining high-centrality nodes and their strongest edges. These insights are formatted into a structured context and integrated into a task-specific prompt for LLM reasoning on impact-resistant materials, the same prompt that was used to construct the original graph.
The model’s response is generated both with and without graph data, followed by a comparative evaluation based on graph utilization, depth of reasoning, scientific rigor, and innovativeness. Raw responses for both models are shown in Text Boxes Supplementary Information and Supplementary Information. Table S1 provides a detailed comparison, and Figure 19 compares responses based on four key evaluation metrics (Graph Utilization, Depth of Reasoning, Scientific Rigor, and Innovativeness, along with the overall score).
<details>
<summary>x19.png Details</summary>

### Visual Description
## Bar Chart: Comparison of Responses on Impact-Resistant Material Design
### Overview
The image is a bar chart comparing two responses ("Response 1" and "Response 2") across five categories: "Graph Utilization", "Depth of Reasoning", "Scientific Rigor", "Innovativeness", and "Overall Score". Response 1 (red bars) represents data with graph data, while Response 2 (yellow-green bars) represents data without graph data. The y-axis represents the "Score".
### Components/Axes
* **Title:** Comparison of Responses on Impact-Resistant Material Design
* **Y-axis:**
* Label: Score
* Scale: 0.0 to 17.5, with increments of 2.5 (0.0, 2.5, 5.0, 7.5, 10.0, 12.5, 15.0, 17.5)
* **X-axis:**
* Categories: Graph Utilization, Depth of Reasoning, Scientific Rigor, Innovativeness, Overall Score
* **Legend:** Located at the top-left of the chart.
* Response 1 (With Graph Data): Represented by red bars.
* Response 2 (Without Graph Data): Represented by yellow-green bars.
### Detailed Analysis
* **Graph Utilization:**
* Response 1 (red): 5
* Response 2 (yellow-green): 0
* **Depth of Reasoning:**
* Response 1 (red): 4
* Response 2 (yellow-green): 3
* **Scientific Rigor:**
* Response 1 (red): 4
* Response 2 (yellow-green): 4
* **Innovativeness:**
* Response 1 (red): 5
* Response 2 (yellow-green): 3
* **Overall Score:**
* Response 1 (red): 18
* Response 2 (yellow-green): 10
### Key Observations
* Response 1 consistently scores higher than Response 2 in all categories except "Scientific Rigor", where they are equal.
* The most significant difference between the two responses is in "Overall Score", where Response 1 scores 18 and Response 2 scores 10.
* Response 2 scores 0 in "Graph Utilization".
### Interpretation
The data suggests that incorporating graph data ("Response 1") generally leads to higher scores in the assessed categories related to impact-resistant material design. The "Overall Score" shows a substantial advantage for the response that utilized graph data. The equal scores in "Scientific Rigor" might indicate that this aspect is less influenced by the presence or absence of graph data, or that both responses were equally strong in this area. The zero score for "Graph Utilization" in "Response 2" indicates a complete lack of graph usage in that response.
</details>
Figure 19: Comparison of Responses on Impact-Resistant Material Design. This plot compares two responses based on four key evaluation metrics: Graph Utilization, Depth of Reasoning, Scientific Rigor, and Innovativeness, along with the overall score. Response 1, which incorporates graph-based insights, AI/ML techniques, and interdisciplinary approaches, outperforms Response 2 in all categories. Response 2 follows a more conventional materials science approach without leveraging computational methods. The higher overall score of Response 1 highlights the benefits of integrating advanced data-driven methodologies in material design.
2.14 Use of an Agentic Deep Reasoning Model to Generate new Hypotheses and Anticipated Material Behavior
Next, we use the SciAgents model [51] with the o3-mini reasoning model [52] as the back-end, and graph $\mathcal{G_{2}}$ to answer this question: Create a research idea around impact resistant materials and resilience. Rate the novelty and feasibility in the end.
The path-finding algorithm that integrates node embeddings and a degree of randomness to enhance exploration sampling strategy [51] extracts this sub-graph from the larger graph:
Iterative Reasoning $i<N$
⬇ Impact Resistant Materials -- IS - A -- Materials -- IS - A -- Impact - Resistant Materials -- INFLUENCES -- Modular Infrastructure Systems -- RELATES - TO -- Self - Healing Materials -- RELATES - TO -- Long - term Sustainability and Environmental Footprint of Infrastructure -- RELATES - TO -- Self - Healing Materials -- RELATES - TO -- Infrastructure -- IS - A -- Infrastructure Resilience -- RELATES - TO -- Smart Infrastructure -- RELATES - TO -- Impact - Resistant Materials -- RELATES - TO -- Machine Learning Algorithms -- RELATES - TO -- Impact - Resistant Materials -- RELATES - TO -- Resilience
As described in [51] paths are sampled using a path-finding algorithm that utilizes both node embeddings and a degree of randomness to enhance exploration as a path is identified between distinct concepts. Critically, instead of simply identifying the shortest path, the algorithm introduces stochastic elements by selecting waypoints and modifying priority queues in a modified version of Dijkstra’s algorithm. This allows for the discovery of richer and more diverse paths in a knowledge graph. The resulting paths serve as the foundation for graph-based reasoning specifically geared towards research hypothesis generation, ensuring a more extensive and insightful exploration of scientific concepts.
Visualizations of the subgraph are shown in Figure 20, depicting the subgraph alone (Figure 20 (a)) and the subgraph with second hops (Figure 20 (b), showing the deep interconnectness that can be extracted).
<details>
<summary>x20.png Details</summary>

### Visual Description
## Network Diagram: Comparison of Two Networks
### Overview
The image presents two network diagrams, labeled (a) and (b), side-by-side. Each diagram consists of nodes (represented by circles) and edges (represented by lines) connecting the nodes. The size and color of the nodes appear to indicate some property or attribute of the nodes, and the lines represent relationships between them. Diagram (a) is a sparser network with fewer nodes and edges, while diagram (b) is a denser network with many more nodes and edges.
### Components/Axes
* **Nodes:** Represented by circles of varying sizes and colors. In diagram (a), the nodes range in color from light peach to dark red, with larger nodes being darker red. In diagram (b), the nodes are either teal or orange, with larger nodes being brighter orange.
* **Edges:** Represented by curved lines connecting the nodes. In diagram (a), the edges are shades of red, corresponding to the node colors. In diagram (b), the edges are teal and orange, corresponding to the node colors.
* **Labels:** The diagrams are labeled (a) and (b) in the top-left corner of each network.
### Detailed Analysis or ### Content Details
**Diagram (a):**
* **Node Count:** Approximately 8 nodes.
* **Edge Count:** Approximately 12 edges.
* **Node Size:** Node sizes vary, with one large node in the bottom center and another large node in the top-left.
* **Node Color:** Nodes range from light peach to dark red. The largest nodes are dark red.
* **Edge Color:** Edges are shades of red, matching the connected nodes.
* **Network Structure:** The network is relatively sparse, with a few central nodes connected to several peripheral nodes.
**Diagram (b):**
* **Node Count:** Approximately 100 nodes.
* **Edge Count:** Significantly more edges than diagram (a), estimated to be over 200.
* **Node Size:** Node sizes vary, with one very large orange node near the center.
* **Node Color:** Nodes are either teal or orange.
* **Edge Color:** Edges are teal and orange, matching the connected nodes.
* **Network Structure:** The network is dense and complex, with many interconnected nodes. There appears to be a central cluster of nodes with connections radiating outwards.
### Key Observations
* Diagram (a) represents a simpler network with fewer connections and a clear hierarchy of node importance (indicated by size and color).
* Diagram (b) represents a more complex network with many more connections and a less clear hierarchy.
* The color of the nodes and edges seems to be a key attribute differentiating the nodes in both networks.
* The size of the nodes appears to indicate the importance or degree of the node within the network.
### Interpretation
The two network diagrams likely represent different types of networks or the same network at different stages of development or under different conditions. Diagram (a) could represent a small social network or a simple biological pathway, while diagram (b) could represent a large social network, a complex biological network, or a technological network. The differences in node color and size likely represent different attributes of the nodes, such as their influence, activity, or type. The density of connections in diagram (b) suggests a higher degree of interaction and interdependence between the nodes compared to diagram (a). The diagrams could be used to compare the structure and properties of different networks or to visualize changes in a network over time.
</details>
Figure 20: Visualization of subgraphs extracted from $\mathcal{G}_{2}$ by SciAgents, for use in graph reasoning. The left panel (a) represents the primary subgraph containing only nodes from the specified reasoning path. Node size is proportional to the original degree in the full network, highlighting key entities with high connectivity. The structure is sparse, with key nodes acting as central hubs in the reasoning framework. The right panel (b) represents an expanded subgraph that includes second-hop neighbors. Nodes from the original subgraph are colored orange, while newly introduced second-hop nodes are green. The increased connectivity and density indicate the broader network relationships captured through second-hop expansion. Larger orange nodes remain dominant in connectivity, while green nodes form supporting structures, emphasizing peripheral interactions and their contribution to knowledge propagation. This visualization highlights how expanding reasoning pathways in a graph framework integrates additional contextual information, enriching the overall structure..
The resulting document Supporting Text 4 presents the results of applying SciAgents to $\mathcal{G}_{2}$ in the context of impact-resistant materials and infrastructure resilience. The graph representation serves as a structured framework for reasoning about the relationships between key concepts—impact-resistant materials, self-healing mechanisms, machine learning optimization, and modular infrastructure—by encoding dependencies and influences between them. Graph 2 specifically captures these interconnected domains as nodes, with edges representing logical or causal links, enabling a systematic exploration of pathways that lead to optimal material design strategies. The path traversal within the graph identifies key dependencies, such as how impact-resistant materials influence infrastructure resilience or how machine learning refines self-healing efficiency. This structured pathway-based reasoning allows SciAgents to generate research hypotheses that maximize cross-domain synergies, ensuring that material properties are not optimized in isolation but rather in concert with their broader applications in engineering and sustainability. Furthermore, graph traversal reveals emergent relationships—such as how integrating real-time sensor feedback into modular infrastructure could create self-improving materials—that might not be immediately evident through conventional linear analysis. Thus, the use of graph-based reasoning is pivotal in formulating a research framework that is not only interdisciplinary but also systematically optimized for long-term infrastructure resilience and material adaptability.
In terms of specific content, the proposed research explores an advanced composite material that integrates carbon nanotube (CNT)-reinforced polymer matrices with self-healing microcapsules, embedded sensor networks, and closed-loop ML optimization. The goal is to create a dynamically self-improving material system that enhances impact resistance and longevity in modular infrastructure. The material design is structured around several key components: (1) CNT reinforcement (1–2 wt%) to improve tensile strength and fracture toughness, (2) self-healing microcapsules (50–200 $\mu$ m) filled with polymerizable agents, (3) embedded graphene-based or PVDF strain sensors for real-time monitoring, and (4) adaptive ML algorithms that regulate stress distributions and healing responses.
The proposal establishes interconnections between several domains, highlighting the interdisciplinary nature of the research: impact-resistant materials are a subset of general materials with enhanced energy dissipation properties, modular infrastructure benefits from these materials due to increased durability, self-healing materials reduce maintenance cycles, and machine learning optimizes real-time responses to structural stress. This holistic framework aims to advance infrastructure resilience and sustainability. The research hypothesizes that embedding self-healing microcapsules within a CNT-reinforced polymer matrix will yield a composite with superior impact resistance and adaptive repair capabilities. Expected performance gains include a 50% increase in impact energy absorption (surpassing 200 J/m²), up to 80% recovery of mechanical properties after micro-damage, an estimated 30% improvement in yield strain, a 50% extension in structural lifetime, and a 30% reduction in required maintenance interventions.
The composite operates via a multi-scale integration strategy where nanoscale CNTs form a stress-bridging network, microscale healing agents autonomously restore structural integrity, and macroscale sensors collect real-time strain data to inform machine learning-based optimizations. The closed-loop ML system refines material responses dynamically, preemptively addressing stress concentrations before catastrophic failure occurs. This iterative self-optimization process is represented in the flowchart shown in Figure 21.
\sansmath Iterative Reasoning $i<N$
Impact Event (Material undergoes structural stress or damage)
Sensor Detection (Real-time strain monitoring via embedded graphene/PVDF sensors)
Machine Learning Analysis (Prediction of stress distribution, micro-damage evolution)
Healing Response Adjustment (ML-optimized activation of microcapsules based on sensor data)
Microcapsule Rupture and Repair (Self-healing agent polymerization to restore mechanical integrity)
Material Performance Feedback (Updated data informs next optimization cycle)
Adaptive Learning Cycle: Sensors collect new data, ML refines healing response
Figure 21: Flowchart of the Self-Optimizing Composite System proposed by SciAgents after reasoning over $\mathcal{G}_{2}$ . Upon an impact event, embedded sensors (cyan) detect strain changes and transmit real-time data to a machine learning system (violet). This system predicts stress evolution and dynamically adjusts healing response thresholds (light violet). Microcapsules containing polymerizable agents (green) rupture at critical points, autonomously restoring material integrity. A feedback mechanism (yellow) continuously refines the process, ensuring adaptive optimization over multiple impact cycles. The dashed feedback loop signifies that each iteration improves the material’s ability to predict and mitigate future stress events, making the system progressively more efficient.
Compared to conventional high-performance composites such as ultra-high molecular weight polyethylene (UHMWPE) and standard carbon fiber-reinforced polymers, the proposed material demonstrates superior mechanical performance and autonomous damage remediation. Traditional impact-resistant materials typically absorb 120–150 J/m² of energy, whereas this system is designed to exceed 200 J/m². Additionally, existing self-healing materials recover only 50–60% of their mechanical properties, while this composite targets an 80% restoration rate. The modular design ensures seamless integration into existing infrastructure, supporting scalability and standardization.
Beyond its core functions, the composite exhibits several emergent properties: (1) localized reinforcement zones where healing chemistry alters stress distributions, (2) increased energy dissipation efficiency over repeated impact cycles, (3) long-term self-improving feedback where ML-driven adjustments refine material performance, and (4) potential microstructural evolution, such as crystalline phase formation, that enhances impact resistance. These unexpected yet beneficial attributes highlight the adaptive nature of the material system.
The broader implications of this research include significant economic and environmental benefits. By reducing maintenance frequency by 30%, the composite lowers infrastructure downtime and lifecycle costs. The extended service life translates to a 25–30% reduction in resource consumption and associated carbon emissions. While the upfront processing cost is higher due to advanced material fabrication and sensor integration, the long-term cost per operational year is projected to be competitive with, or superior to, existing alternatives.
This interdisciplinary fusion of nanomaterials, self-healing chemistry, real-time sensor feedback, and machine learning-based control represents a fundamental shift from passive materials to smart, self-optimizing systems. The proposed research not only addresses impact resistance and self-repair but also pioneers an adaptable, continuously improving infrastructure material. The combination of rigorous experimental validation (e.g., ASTM mechanical testing, finite element modeling, and real-world simulations) ensures that the material’s theoretical advantages translate into practical performance gains. This research positions itself as a transformative solution for infrastructure resilience, bridging the gap between static engineering materials and dynamically intelligent, self-regulating composites.
3 Conclusion
This work introduced a framework for recursive graph expansion, demonstrating that self-organizing intelligence-like behavior can emerge through iterative reasoning without predefined ontologies, external supervision, or centralized control. Unlike conventional knowledge graph expansion techniques that rely on static extractions, probabilistic link predictions, or reinforcement learning-based traversal, extensive test-time compute Graph-PReFLexOR graph reasoning actively restructures its own knowledge representation as it evolves, allowing for dynamic adaptation and autonomous knowledge synthesis. These findings are generally in line with other recent results that elucidated the importance of inference scaling methods [25, 52, 53, 26].
Through extensive graph-theoretic analysis, we found that the recursively generated knowledge structures exhibit scale-free properties, hierarchical modularity, and sustained interdisciplinary connectivity, aligning with patterns observed in human knowledge systems. The formation of conceptual hubs (Figures 4 - 5) and the emergence of bridge nodes (Figures 12) demonstrate that the system autonomously organizes information into a structured yet flexible network, facilitating both local coherence and global knowledge integration. Importantly, the model does not appear to saturate or stagnate; instead, it continuously reorganizes relationships between concepts by reinforcing key conceptual linkages while allowing new hypotheses to emerge through iterative reasoning (Figures 11 and 14).
One of the most striking findings is the self-regulation of knowledge propagation pathways. The early stages of graph expansion relied heavily on a few dominant nodes (high betweenness centrality), but over successive iterations, knowledge transfer became increasingly distributed and decentralized (Figure S3). This structural transformation suggests that recursive self-organization naturally reduces bottlenecks, enabling a more resilient and scalable knowledge framework. Additionally, we observed alternating phases of conceptual stability and breakthrough, indicating that knowledge formation follows a punctuated equilibrium model, rather than purely incremental accumulation.
More broadly, the recursive self-organization process produces emergent, fractal-like knowledge structures, suggesting that similar principles may underlie both human cognition and the design of intelligent systems [42]. Moreover, the potential role of bridge nodes—as connectors and as natural intervention points—is underscored by their persistent yet shifting influence, implying they could be strategically targeted for system updates or error correction in a self-organizing network. Additionally, the observed alternating phases of stable community formation punctuated by sudden breakthroughs appear to mirror the concept of punctuated equilibrium in scientific discovery [1], offering a promising framework for understanding the natural emergence of innovation. These insights extend the implications of our work beyond scientific discovery, hinting at broader applications in autonomous reasoning, such as adaptive natural language understanding and real-time decision-making in complex environments. We demonstrated a few initial use cases where we used graph structures in attempts towards compositional reasoning, as shown in Figure 18.
3.1 Graph Evolution Dynamics: Interplay of Network Measures
The evolution of the knowledge graph reveals a complex interplay between growth, connectivity, centralization, and structural reorganization, with different network-theoretic measures exhibiting distinct yet interdependent behaviors over iterations. Initially, the system undergoes rapid expansion, as seen in the near-linear increase in the number of nodes and edges (Figure 4). However, despite this outward growth, the clustering coefficient stabilizes early (around 0.16), suggesting that the graph maintains a balance between connectivity and modularity rather than devolving into isolated clusters. This stabilization indicates that the system does not expand chaotically but instead integrates new knowledge in a structured and preferentially attached manner, reinforcing key concepts while allowing for exploration.
One of the most informative trends is the evolution of betweenness centrality (Figure 16), which starts highly concentrated in a few key nodes but then redistributes over time, reflecting a transition from hub-dominated information flow to a more decentralized and resilient network. This shift aligns with the gradual stabilization of average shortest path length (around 4.5, see Figure 9) and the graph diameter (around 16–18 steps, see Figure 5), implying that while knowledge expands, it remains navigable and does not suffer from excessive fragmentation. Meanwhile, the maximum $k$ -core index (Figure 6) exhibits a stepwise increase, reflecting structured phases of densification where core knowledge regions consolidate before expanding further. This suggests that the system undergoes punctuated reorganization, where newly introduced concepts occasionally necessitate internal restructuring before further outward growth.
Interestingly, the degree assortativity starts strongly negative (around -0.25) and trends toward neutrality (-0.05), indicating that high-degree nodes initially dominate connections but later distribute their influence, allowing mid-degree nodes to contribute to network connectivity. This effect is reinforced by the persistence of bridge nodes (Figures 6 - 16), where we see a long-tail distribution of interdisciplinary connectors—some nodes serve as transient links that appear briefly, while others persist across hundreds of iterations, indicating stable, high-impact conceptual connectors.
Taken together, these experimentally observed trends suggest that the system self-regulates its expansion, dynamically shifting between growth, consolidation, and reorganization phases. The absence of saturation in key structural properties (such as new edge formation and bridge node emergence) indicates that the model supports continuous knowledge discovery, rather than converging to a fixed-state representation. This emergent behavior, where network-wide connectivity stabilizes while conceptual expansion remains open-ended, suggests that recursive graph reasoning could serve as a scalable foundation for autonomous scientific exploration, adaptive learning, and self-organizing knowledge systems.
3.2 Relevance in the Context of Materials Science
The framework introduced in this work offers a novel paradigm for accelerating discovery in materials science by systematically structuring and expanding knowledge networks. Unlike traditional approaches that rely on static databases or predefined ontologies [54, 55, 56, 57, 58], our self-organizing method enables dynamic hypothesis generation, uncovering hidden relationships between material properties, synthesis pathways, and functional behaviors. The emergent scale-free networks observed in our experiments reflect the underlying modularity and hierarchical organization often seen in biological and engineered materials, suggesting that recursive graph-based reasoning could serve as a computational analogue to self-assembling and adaptive materials. Applied to materials design, the approach developed in this paper could reveal unexpected synergies between molecular architectures and macroscale performance, leading to new pathways for bioinspired, multifunctional, and self-healing materials. Future work can integrate experimental data directly into these reasoning loops, allowing AI-driven materials discovery to move beyond retrieval-focused recognition toward novel inference and innovation. We believe it is essential to bridge the gap between autonomous reasoning and materials informatics to ultimately create self-improving knowledge systems that can adaptively guide materials engineering efforts in real-time [59].
3.3 Broader Implications
The observations put forth in this paper have potential implications for AI-driven scientific reasoning, autonomous hypothesis generation, and scientific inquiry. As our results demonstrate, complex knowledge structures can self-organize without explicit goal-setting. This work challenges a prevailing assumption that intelligence requires externally imposed constraints or supervision. Instead, it suggests that intelligent reasoning may emerge as a fundamental property of recursive, feedback-driven information processing, mirroring cognitive processes observed in scientific discovery and human learning. Our experiments that directed the evolution of the thinking mechanisms towards a certain goal were provided with relational modeling that incorporated these concepts in a more pronounced manner, as expected, provisioning a powerful substrate for deeper reasoning.
Future work could potentially explore extending this framework to multi-agent reasoning environments, cross-domain knowledge synthesis, and real-world applications in AI-driven research discovery. Additionally, refining interpretability mechanisms will be crucial for ensuring that autonomously generated insights align with human epistemic standards, minimizing risks related to misinformation propagation and reasoning biases. Bridging graph-theoretic modeling, AI reasoning, and self-organizing knowledge dynamics, allowed us to provide a step toward building AI systems capable of autonomous, scalable, and transparent knowledge formation on their own.
We note that wile our agentic deep graph reasoning framework demonstrates promise in achieving self-organizing knowledge formation, several challenges remain. In particular, the computational scalability of recursive graph expansions and the sensitivity of emergent structures to parameter choices warrant further investigation. Future work should explore robust error-correction strategies, enhanced interpretability of evolving networks, and ethical guidelines to ensure transparency in autonomous reasoning systems, especially if deployed in commercial or public settings beyond academic research. Addressing these issues will not only refine the current model but also paves the way for its application in real-world autonomous decision-making and adaptive learning environments.
4 Materials and Methods
We describe key materials and methods developed and used in the course of this study in this section.
4.1 Graph-PReFLexOR model development
A detailed account of the Graph-PReFLexOR is provided in [27]. Graph-PReFLexOR (Graph-based Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning) is an AI model integrating in-situ graph reasoning, symbolic abstraction, and recursive reflection into generative modeling. The model was trained on a set of around 1,000 scientific papers in the biological materials and bio-inspired materials domain, as discussed in [27]. We refer readers to the original paper for implementation details, but provide a high-level summary here. The method defines reasoning as a structured mapping:
$$
M:T\rightarrow(G,P,A), \tag{2}
$$
where a given task $T$ generates a knowledge graph $G=(V,E)$ with nodes $V$ representing key concepts and edges $E$ denoting relationships, abstract patterns $P$ capturing structural dependencies, and final answers $A$ . Inspired by category theory, the approach encodes knowledge through hierarchical inference, leveraging isomorphisms to generalize across domains. The model autonomously constructs symbolic representations via a reasoning phase marked by <|thinking|> … <|/thinking|> tokens, refining understanding before generating outputs. Recursive optimization can further improve logical coherence, aligning responses with generalizable principles, a particular feature that will be expanded on in this paper.
To enhance the adaptability of structured reasoning, Graph-PReFLexOR employs an iterative feedback mechanism:
$$
R_{i+1}=f_{\text{eval}}(R_{i},F_{i}), \tag{3}
$$
where $R_{i}$ denotes the intermediate reasoning at step $i$ , $F_{i}$ is the feedback applied to improve logical structure, and $f_{\text{eval}}$ evaluates alignment with domain principles. The final answer $A$ is derived after $N$ refinements as:
$$
A=g(R_{N}). \tag{4}
$$
Through the idea to explicitly model knowledge graphs and symbolic representations, this method attempts to bridge connectionist and symbolic paradigms, facilitating multi-step reasoning, hypothesis generation, and interdisciplinary knowledge expansion. Empirical evaluations in [27] demonstrated its capability to generalize beyond training data. In this study, we take advantage of the capability of Graph-PReFLexOR to generate graph representations on the fly over a great number of iterations during which the model continues to expand its reasoning tokens.
4.2 Iterative Unconstrained Graph Reasoning on General Topic
We develop an iterative knowledge extraction pipeline to construct a structured knowledge graph using a LLM, following the flowchart shown in Figure 1. The method systematically expands a graph representation of relationships by extracting structured knowledge from model-generated reasoning sequences and generating follow-up queries to refine exploration. We use this method to construct $\mathcal{G_{1}}$ .
At the start of each run, the algorithm initializes an initial question or prompt. This can be very general or focus on a particular topic that defines the area of scientific inquiry. In the example, the topic is set as:
Iterative Reasoning $i<N$
⬇ prompt = " Discuss an interesting idea in bio - inspired materials science."
The LLM then generates structured reasoning responses within the <|thinking|> … <|/thinking|> tokens. The response is processed to extract structured knowledge by isolating the graph.
To convert the extracted knowledge into a structured representation, the model is queried with an additional instruction to transform the resulting raw text that contains the reasoning graph (denoted by {raw graph}) into a Python dictionary formatted for graph representation:
Iterative Reasoning $i<N$
⬇ You are an AI that extracts information from structured text and outputs a graph in Python dictionary format compatible with NetworkX. Given the following structured text: {raw graph} Output the graph as a Python dictionary without any additional text or explanations. Ensure the dictionary is properly formatted for immediate evaluation in Python.
The output is parsed and structured using ast.literal_eval() to construct a directed graph $\mathcal{G}_{\text{local}}^{i}$ in NetworkX, where nodes represent entities such as materials, properties, and scientific concepts, while edges encode relationships such as HAS, INFLUENCES, and SIMILAR-TO.
At each iteration $i$ , the newly extracted knowledge graph is appended to an evolving global graph:
$$
\mathcal{G}\leftarrow\mathcal{G}\cup\mathcal{G}_{\text{local}}^{i}. \tag{5}
$$
The extracted structure is parsed using:
graph_code, graph_dict = extract_graph_from_text(graph)
The graph is progressively expanded by adding newly introduced nodes and edges, ensuring that redundant relationships are not duplicated. The final knowledge graph is stored in multiple formats, including GraphML for structural analysis and PNG for visualization.
To facilitate continued exploration, a follow-up question is generated at each iteration. The LLM is queried to produce a question that introduces a new aspect of the domain, ensuring an iterative, self-refining process that utilizes the previously generated entities and relations:
Iterative Reasoning $i<N$
⬇ Consider this list of topics / keywords. Formulate a creative follow - up question to ask about a totally new concept. Your question should include at least one of the original topics / keywords. Original list of topics / keywords: {latest extracted entities and relations} Reply only with the new question. The new question is:
This ensures that subsequent queries remain contextually grounded in the domain while promoting scientific discovery. The generated question is appended to the reasoning token structure and fed back into the LLM, thereby continuing the iterative learning process.
The algorithm runs for a total of $N$ iterations, progressively refining the knowledge graph. At each step, we track the growth of the graph by recording the number of nodes and edges over time. The final knowledge graph provides a structured and extensible representation of insights extracted from the LLM, enabling downstream analysis of emerging concepts. The reasoning process (Figure 1) unfolds sequentially over a period of several days (using a consumer GPU, like NVIDIA A6000 Ada).
4.3 Iterative Graph Reasoning on a Particular Topic
As an alternative to the approach above, we can tailor the reasoning process to focus more strongly on a particular topic. We use this method to construct $\mathcal{G_{2}}$ . For instance, at the beginning of each run, the algorithm is initialized with a user-defined topic:
Iterative Reasoning $i<N$
⬇ topic = " impact resistant materials "
This variable defines the area of exploration and is dynamically incorporated into the model prompts. The LLM is then queried with a topic-conditioned instruction to generate structured reasoning tokens:
Iterative Reasoning $i<N$
⬇ Describe a way to design {topic}.
The model generates textual responses that include explicit reasoning within the <|thinking|> … <|/thinking|> markers. As before, from this output, we extract structured knowledge by isolating the section labeled graph, to extract entity-relationship pairs. A follow-up question is generated at each iteration to drive the discovery process forward. This prompt ensures that new queries focus on underexplored aspects of the knowledge graph while maintaining the topic-conditioned structure:
Iterative Reasoning $i<N$
⬇ Consider this list of keywords. Considering the broad topic of {topic}, formulate a creative follow - up question to ask about a totally new aspect. Your question should include at least one of the original keywords. Original list of keywords: {latest extracted entities and relations} Reply only with the new question. The new question is:
This ensures that each iteration remains contextually grounded in the specified domain while continuously expanding the knowledge graph.
The process continues for $N$ steps, progressively refining the knowledge graph. At each iteration, we track the growth of the graph by recording the number of nodes and edges. The resulting knowledge graph serves as a structured repository of insights extracted from the LLM, enabling downstream analysis of materials properties and design principles.
Naturally, other variants of these strategies could easily be devised, for instance to create other generalist graphs (akin to $\mathcal{G}_{1}$ ) or specialized graphs (akin to $\mathcal{G}_{2}$ ). Prompt engineering can be human-tailored or developed agentically by other AI systems.
4.4 Graph Analysis and Visualization
Graph analysis and visualizations are conducted using NetworkX [60], Gephi [61], Cytoscope [62], Mermaid https://mermaid.js.org/, and various plugins within these packages.
4.4.1 Basic Analysis of Recursive Graph Growth over Reasoning Iterations
To analyze the recursive expansion of the knowledge graph, we computed a set of graph-theoretic properties at each iteration using the NetworkX Python library. Graph data was stored in GraphML format, with filenames encoded to reflect the iteration number, allowing for chronological tracking of structural changes. Each graph was sequentially loaded and processed to extract key metrics that characterize its connectivity, topology, and hierarchical organization.
The fundamental properties of the graph, including the number of nodes and edges, were directly retrieved from the graph structure. The degree distribution was computed across all nodes to derive the average degree, representing the mean connectivity per node, and the maximum degree, which highlights the most connected node at each iteration. To assess network cohesion, the largest connected component (LCC) was extracted by identifying the largest strongly connected component in directed graphs and the largest connected subgraph in undirected cases. The clustering coefficient was computed using the standard local clustering metric, which quantifies the likelihood that a node’s neighbors are also connected to each other. The average clustering coefficient was obtained by averaging over all nodes in the graph, providing insight into the tendency of local structures to form tightly connected clusters.
To assess global connectivity and efficiency, we computed the average shortest path length (SPL) and the graph diameter within the largest connected component. The SPL was obtained by calculating the mean shortest path distance between all pairs of nodes in the LCC, while the diameter was determined as the longest shortest path observed in the component. Since these calculations are computationally expensive for large graphs, they were conditionally executed only when the LCC was sufficiently small or explicitly enabled in the analysis. For community detection, we applied the Louvain modularity algorithm using the community-louvain package. The graph was treated as undirected for this step, and the modularity score was computed by partitioning the graph into communities that maximize the modularity function. This metric captures the extent to which the graph naturally organizes into distinct clusters over iterations.
The entire analysis pipeline iterated over a series of GraphML files, extracting the iteration number from each filename and systematically computing these metrics. The results were stored as time series arrays and visualized through multi-panel plots, capturing trends in network evolution. To optimize performance, computationally intensive operations, such as shortest path calculations and modularity detection, were executed conditionally based on graph size and software availability. To further examine the structural evolution of the recursively generated knowledge graph, we computed a set of advanced graph-theoretic metrics over iterative expansions. As before, the analysis was conducted over a series of iterations, allowing for the study of emergent network behaviors.
The degree assortativity coefficient was computed to measure the correlation between node degrees, assessing whether high-degree nodes preferentially connect to similar nodes. This metric provides insight into the network’s structural organization and whether its expansion follows a preferential attachment mechanism. The global transitivity, defined as the fraction of closed triplets among all possible triplets, was calculated to quantify the overall clustering tendency of the graph and detect the emergence of tightly interconnected regions. To assess the hierarchical connectivity structure, we performed $k$ -core decomposition, which identifies the maximal subgraph where all nodes have at least $k$ neighbors. We extracted the maximum $k$ -core index, representing the deepest level of connectivity within the network, and computed the size of the largest $k$ -core, indicating the robustness of highly connected core regions.
For understanding the importance of individual nodes in information flow, we computed average betweenness centrality over the largest connected component. Betweenness centrality quantifies the extent to which nodes serve as intermediaries in shortest paths, highlighting critical nodes that facilitate efficient navigation of the knowledge graph. Since exact computation of betweenness centrality can be computationally expensive for large graphs, it was performed only within the largest component to ensure feasibility. Additionally, we identified articulation points, which are nodes whose removal increases the number of connected components in the network. The presence and distribution of articulation points reveal structural vulnerabilities, highlighting nodes that serve as key bridges between different knowledge regions.
4.4.2 Prediction of Newly Connected Pairs
To track the evolution of connectivity in the recursively expanding knowledge graph, we employed a random sampling approach to estimate the number of newly connected node pairs at each iteration. Given the computational cost of computing all-pairs shortest paths in large graphs, we instead sampled a fixed number of node pairs per iteration and measured changes in their shortest path distances over time.
Sampling Strategy. At each iteration, we randomly selected 1,000 node pairs from the current set of nodes in the global knowledge graph. For each sampled pair $(u,v)$ , we computed the shortest path length in the graph using Breadth-First Search (BFS), implemented via nx.single_source_shortest_path_length(G, src). If a path existed, its length was recorded; otherwise, it was marked as unreachable.
Tracking Newly Connected Pairs. To detect the formation of new connections, we maintained a record of shortest path distances from the previous iteration and compared them with the current distances. A pair $(u,v)$ was classified as:
- Newly connected if it was previously unreachable ( $\text{dist}_{\text{before}}=\text{None}$ ) but became connected ( $\text{dist}_{\text{now}}≠\text{None}$ ).
- Having a shorter path if its shortest path length decreased between iterations ( $\text{dist}_{\text{now}}<\text{dist}_{\text{before}}$ ).
The number of newly connected pairs and the number of pairs with shortened paths were recorded for each iteration.
Graph Integration and Visualization. At each iteration, the newly processed graph was merged into a global knowledge graph, ensuring cumulative analysis over time. The number of newly connected pairs per iteration was plotted as a time series, revealing patterns in connectivity evolution. This method effectively captures structural transitions, particularly the initial burst of connectivity formation followed by a steady-state expansion phase, as observed in the results.
By employing this approach, we achieved a computationally efficient yet statistically robust estimate of network connectivity evolution, allowing us to analyze the self-organizing dynamics of the reasoning process over large iterative expansions.
4.4.3 Graph Structure and Community Analysis
To examine the structural properties of the recursively generated knowledge graph, we performed a comprehensive analysis of node connectivity, degree distribution, clustering behavior, shortest-path efficiency, and community structure. The graph was loaded from a GraphML file using the NetworkX library, and various metrics were computed to assess both local and global network properties.
Basic Graph Properties. The fundamental characteristics of the graph, including the number of nodes, edges, and average degree, were extracted. Additionally, the number of self-loops was recorded to identify redundant connections that may influence network dynamics.
Graph Component Analysis. To ensure robust connectivity analysis, the largest connected component (LCC) was extracted for undirected graphs, while the largest strongly connected component (SCC) was used for directed graphs. This ensured that further structural computations were performed on a fully connected subgraph, avoiding artifacts from disconnected nodes.
Degree Distribution Analysis. The degree distribution was computed and visualized using both a linear-scale histogram and a log-log scatter plot. The latter was used to assess whether the network exhibits a power-law degree distribution, characteristic of scale-free networks.
Clustering Coefficient Analysis. The local clustering coefficient, which quantifies the tendency of nodes to form tightly connected triads, was computed for each node. The distribution of clustering coefficients was plotted, and the average clustering coefficient was recorded to evaluate the extent of modular organization within the network.
Centrality Measures. Three centrality metrics were computed to identify influential nodes: (i) Betweenness centrality, which measures the extent to which nodes act as intermediaries in shortest paths, highlighting key connectors in the knowledge graph; (ii) Closeness centrality, which quantifies the efficiency of information propagation from a given node; (iii) Eigenvector centrality, which identifies nodes that are highly influential due to their connections to other high-importance nodes.
Shortest Path Analysis. The average shortest path length (SPL) and graph diameter were computed to evaluate the network’s navigability. Additionally, a histogram of sampled shortest path lengths was generated to analyze the distribution of distances between randomly selected node pairs (2,000 samples used).
Community Detection and Modularity. The Louvain modularity algorithm was applied (if available) to partition the network into communities and assess its hierarchical structure. The modularity score was computed to quantify the strength of the detected community structure, and the resulting partitions were visualized using a force-directed layout.
4.4.4 Analysis of Conceptual Breakthroughs
The evolution of knowledge graphs is analyzed by processing a sequence of graph snapshots stored in GraphML format. Each graph is indexed by an iteration number, extracted using a regular expression from filenames of the form graph_iteration_#.graphml. The graphs are sequentially loaded and processed to ensure consistency across iterations. If the graph is directed, it is converted to an undirected format using the networkx.to_undirected() function. To ensure structural integrity, we extract the largest connected component using the networkx.connected_components() function, selecting the subgraph with the maximum number of nodes.
For each iteration $t$ , we compute the degree distribution of all nodes in the largest connected component. The degree of a node $v$ in graph $G_{t}=(V_{t},E_{t})$ is given by:
$$
d_{t}(v)=\sum_{u\in V_{t}}A_{t}(v,u) \tag{6}
$$
where $A_{t}$ is the adjacency matrix of $G_{t}$ . The computed degree distributions are stored in a dictionary and later aggregated into a pandas DataFrame for further analysis.
To track the emergence of top hubs, we define a node $v$ as a hub if it attains a high degree at any iteration. The set of top hubs is determined by selecting the nodes with the highest maximum degree across all iterations:
$$
H=\{v\mid\max_{t}d_{t}(v)\geq d_{\text{top},10}\}
$$
where $d_{\text{top},10}$ is the degree of the 10th highest-ranked node in terms of maximum degree. The degree growth trajectory of each hub is then extracted by recording $d_{t}(v)$ for all $t$ where $v∈ V_{t}$ .
To quantify the emergence of new hubs, we define an emergence threshold $d_{\text{emerge}}=5$ , considering a node as a hub when its degree first surpasses this threshold. The first significant appearance of a node $v$ is computed as:
$$
t_{\text{emerge}}(v)=\min\{t\mid d_{t}(v)>d_{\text{emerge}}\}
$$
for all $v$ where such $t$ exists. The histogram of $t_{\text{emerge}}(v)$ across all nodes provides a temporal distribution of hub emergence.
To evaluate global network connectivity, we compute the mean degree at each iteration:
$$
\bar{d}_{t}=\frac{1}{|V_{t}|}\sum_{v\in V_{t}}d_{t}(v) \tag{7}
$$
capturing the overall trend in node connectivity as the knowledge graph evolves.
Three key visualizations are generated: (1) the degree growth trajectories of top hubs, plotted as $d_{t}(v)$ over time for $v∈ H$ ; (2) the emergence of new hubs, represented as a histogram of $t_{\text{emerge}}(v)$ ; and (3) the overall network connectivity, visualized as $\bar{d}_{t}$ over iterations.
4.4.5 Structural Evolution of the Graphs: Knowledge Communities, Bridge Nodes and Multi-hop Reasoning
We analyze the structural evolution of knowledge graphs by computing three key metrics: (1) the number of distinct knowledge communities over time, (2) the emergence of bridge nodes that connect different knowledge domains, and (3) the depth of multi-hop reasoning based on shortest path lengths. These metrics are computed for each iteration $t$ of the evolving graph and visualized as follows.
The evolution of knowledge communities is measured using the Louvain modularity optimization algorithm, implemented via community.best_partition(), which partitions the graph into distinct communities. For each iteration, the number of detected communities $|C_{t}|$ is computed as:
$$
|C_{t}|=|\{c\mid c=P_{t}(v),v\in V_{t}\}|
$$
where $P_{t}(v)$ maps node $v$ to its assigned community at iteration $t$ . The values of $|C_{t}|$ are plotted over iterations to track the subdivision and merging of knowledge domains over time.
The emergence of bridge nodes, nodes that connect multiple communities, is determined by examining the community affiliations of each node’s neighbors. A node $v$ is classified as a bridge node if:
$$
|\mathcal{C}(v)|>1,\quad\text{where}\quad\mathcal{C}(v)=\{P_{t}(u)\mid u\in N(%
v)\}
$$
and $N(v)$ represents the set of neighbors of $v$ . The number of bridge nodes is computed per iteration and plotted to analyze how interdisciplinary connections emerge over time.
The depth of multi-hop reasoning is quantified by computing the average shortest path length for the largest connected component at each iteration:
$$
L_{t}=\frac{1}{|V_{t}|(|V_{t}|-1)}\sum_{v,u\in V_{t},v\neq u}d_{\text{sp}}(v,u)
$$
where $d_{\text{sp}}(v,u)$ is the shortest path distance between nodes $v$ and $u$ , computed using networkx.average_shortest_path_length(). This metric captures the evolving complexity of conceptual reasoning chains in the knowledge graph.
We generate three plots: (1) the evolution of knowledge communities, visualizing $|C_{t}|$ over time; (2) the emergence of bridge nodes, displaying the number of inter-community connectors per iteration; and (3) the depth of multi-hop reasoning, tracking $L_{t}$ as a function of iteration number.
To analyze the temporal stability of bridge nodes in the evolving knowledge graph, we compute the persistence of bridge nodes, which quantifies how long individual nodes function as bridges across multiple iterations. Given the bridge node set $B_{t}$ at iteration $t$ , the persistence count for a node $v$ is defined as:
$$
P(v)=\sum_{t}\mathbb{1}(v\in B_{t})
$$
where $\mathbb{1}(·)$ is the indicator function that equals 1 if $v$ appears as a bridge node at iteration $t$ , and 0 otherwise. This metric captures the frequency with which each node serves as a conceptual connector between different knowledge domains.
To visualize the distribution of bridge node persistence, we construct a histogram of $P(v)$ across all detected bridge nodes, with kernel density estimation (KDE) applied for smoother visualization. The histogram provides insight into whether bridge nodes are transient or persist over multiple iterations.
The persistence values are computed and stored in a structured dataset, which is then used to generate a plot of the histogram of bridge node persistence.
To analyze the temporal dynamics of bridge node emergence, we construct a binary presence matrix that tracks when individual nodes first appear as bridges. The matrix is used to visualize the earliest bridge nodes over time, capturing the structural formation of key conceptual connectors.
The binary presence matrix is defined as follows. Given a set of bridge node lists $B_{t}$ for each iteration $t$ , we construct a matrix $M$ where each row corresponds to an iteration and each column corresponds to a unique bridge node. The matrix entries are:
$$
M_{t,v}=\begin{cases}1,&v\in B_{t}\\
0,&\text{otherwise}\end{cases}
$$
where $M_{t,v}$ indicates whether node $v$ appears as a bridge at iteration $t$ . The full set of unique bridge nodes across all iterations is extracted to define the columns of $M$ .
To identify the earliest appearing bridge nodes we compute the first iteration in which each node appears:
$$
t_{\text{first}}(v)=\min\{t\mid M_{t,v}=1\}
$$
The top 100 earliest appearing bridge nodes are selected by ranking nodes based on $t_{\text{first}}(v)$ , keeping those with the smallest values. The binary matrix is then restricted to these nodes.
To capture early-stage network formation, the analysis is limited to the first 200 iterations, ensuring that the onset of key bridge nodes is clearly visible. The final presence matrix $M^{\prime}$ is reordered so that nodes are sorted by their first appearance, emphasizing the sequential nature of bridge formation.
The matrix is visualized as a heatmap (Figure 13), where rows correspond to the top 100 earliest appearing bridge nodes and columns represent iterations. A blue-scale colormap is used to indicate presence (darker shades for active nodes).
To analyze the evolution of key bridge nodes in the knowledge graph, we compute and track the betweenness centrality of all nodes across multiple iterations. Betweenness centrality quantifies the importance of a node as an intermediary in shortest paths and is defined as:
$$
C_{B}(v)=\sum_{s\neq v\neq t}\frac{\sigma_{st}(v)}{\sigma_{st}}
$$
where $\sigma_{st}$ is the total number of shortest paths between nodes $s$ and $t$ , and $\sigma_{st}(v)$ is the number of those paths that pass through $v$ . This measure is recalculated at each iteration to observe structural changes in the network.
The computational procedure is as follows:
1. Graph Loading: Graph snapshots are loaded from GraphML files, indexed by iteration number. If a graph is directed, it is converted to an undirected format using networkx.to_undirected() to ensure consistent betweenness computations.
1. Betweenness Centrality Calculation: For each graph $G_{t}$ at iteration $t$ , the betweenness centrality for all nodes is computed using networkx.betweenness_centrality().
1. Time Series Construction: The computed centrality values are stored in a time-series matrix $B$ , where rows correspond to iterations and columns correspond to nodes:
$$
B_{t,v}=C_{B}(v)\quad\forall v\in V_{t}
$$
Missing values (nodes absent in certain iterations) are set to zero to maintain a consistent matrix structure.
To identify key bridge nodes, we extract the top ten nodes with the highest peak betweenness at any iteration:
$$
H=\{v\mid\max_{t}B_{t,v}\geq B_{\text{top},10}\}
$$
where $B_{\text{top},10}$ represents the 10th highest betweenness value recorded across all iterations. The time-series data is filtered to retain only these nodes.
To visualize the dynamic role of key bridge nodes, we generate a line plot of betweenness centrality evolution where each curve represents the changing centrality of a top bridge node over iterations. This graph captures how structural importance fluctuates over time.
4.5 Agentic Approach to Reason over Longest Shortest Paths
We employ an agentic approach to analyze structured knowledge representations in the form of a graph $G=(V,E)$ , where $V$ represents the set of nodes (concepts) and $E$ represents the set of edges (relationships). The methodology consists of four primary steps: (i) extraction of the longest knowledge path, (ii) decentralized node and relationship reasoning, (iii) multi-agent synthesis, and (iv) structured report generation.
Path Extraction. The input knowledge graph $G$ is first converted into an undirected graph $G^{\prime}=(V,E^{\prime})$ where $E^{\prime}$ contains bidirectional edges to ensure reachability across all nodes. We extract the largest connected component $G_{c}$ by computing:
$$
G_{c}=\arg\max_{S\in\mathcal{C}(G^{\prime})}|S|
$$
where $\mathcal{C}(G^{\prime})$ is the set of all connected components in $G^{\prime}$ . The longest shortest path, or diameter path, is determined by computing the eccentricity:
$$
\epsilon(v)=\max_{u\in V}d(v,u),
$$
where $d(v,u)$ is the shortest path length between nodes $v$ and $u$ . The source node is selected as $v^{*}=\arg\max_{v∈ V}\epsilon(v)$ , and the farthest reachable node from $v^{*}$ determines the longest path.
Numerically, the longest paths are determined by computing node eccentricities using networkx.eccentricity(), which identifies the most distant node pairs in terms of shortest paths. The five longest shortest paths are extracted with networkx.shortest_path(). For each extracted path, we assign node-level structural metrics computed from the original graph. The node degree is obtained using networkx.degree(), betweenness centrality is computed with networkx.betweenness_centrality(), and closeness centrality is determined via networkx.closeness_centrality(). Each identified path is saved as a GraphML file using networkx.write_graphml() with these computed node attributes for further analysis.
Decentralized Node and Relationship Reasoning. Each node $v_{i}∈ V$ and each relationship $e_{ij}∈ E$ along the longest path is analyzed separately. A language model $f_{\theta}$ is prompted with:
$$
\text{LLM}(v_{i})=f_{\theta}(\text{``Analyze concept }v_{i}\text{ in a novel %
scientific context."})
$$
for nodes, and
$$
\text{LLM}(e_{ij})=f_{\theta}(\text{``Analyze relationship }e_{ij}\text{ and %
hypothesize new implications."})
$$
for relationships. This enables independent hypothesis generation at the atomic level.
Multi-Agent Synthesis. The set of independent insights $\mathcal{I}=\{I_{1},I_{2},...\}$ is aggregated, and a final inference step is performed using:
$$
I_{\text{final}}=f_{\theta}(\text{``Synthesize a novel discovery from }%
\mathcal{I}\text{."})
$$
This allows the model to infer higher-order patterns beyond individual node-relationship reasoning.
Structured Report Generation. The final response, along with intermediate insights, is formatted into a structured markdown report containing:
- The extracted longest path
- Individual insights per node and relationship
- The final synthesized discovery
This approach leverages multi-step reasoning and recursive inference, allowing for emergent discoveries beyond explicit graph-encoded knowledge.
4.5.1 Agent-driven Compositional Reasoning
We employ a multi-step agentic approach that couples LLMs with graph-based compositional reasoning. To develop such an approach, we load the graph and locate its largest connected component. We compute eccentricities to identify two far-apart nodes, then extract the longest shortest path between them. Each node in that path becomes a “building block,” for which the LLM provides a concise definition, principles, and a property conducive to synergy (Step A). Next, we prompt the LLM to create pairwise synergies by merging adjacent building blocks, encouraging a short, compositional statement that unifies the nodes’ respective features (Step B). To deepen the layering of ideas, we consolidate multiple synergy statements into bridge synergies that capture cross-cutting themes (Step C). Finally, we issue a more elaborate prompt asking the LLM to integrate all building blocks and synergies into an expanded, coherent “final discovery,” referencing both prior statements and each node’s defining traits (Step D). This process yields a multi-step compositional approach, wherein each synergy can build on earlier results to reveal increasingly sophisticated connections. The initial steps A-C are carried out using meta-llama/Llama-3.2-3B-Instruct, whereas the final integration of the response in Step D is conducted using meta-llama/Llama-3.3-70B-Instruct. We also experimented with other models, such as o1-pro as discussed in the main text.
4.6 Scale free analysis
To determine whether a given network exhibits scale-free properties, we analyze its degree distribution using the power-law fitting method implemented in the powerlaw Python package. The algorithm extracts the degree sequence from the input graph and fits a power-law distribution, estimating the exponent $\alpha$ and lower bound $x_{\min}$ . To assess whether the power-law is a preferable fit, we compute the log-likelihood ratio (LR) between the power-law and an exponential distribution, along with the corresponding $p$ -value. A network is classified as scale-free if LR is positive and $p<0.05$ , indicating statistical support for the power-law hypothesis. The method accounts for discrete degree values and excludes zero-degree nodes from the fitting process.
4.7 Audio Summary in the Form of a Podcast
Supplementary Audio A1 presents an audio summary of this paper in the style of a podcast, created using PDF2Audio (https://huggingface.co/spaces/lamm-mit/PDF2Audio [51]). The audio format in the form a conversation enables reader to gain a broader understanding of the results of this paper, including expanding the broader impact of the work. The transcript was generated using the o3-mini model [52] from the final draft of the paper.
Code, data and model weights availability
Codes, model weights and additional materials are available at https://huggingface.co/lamm-mit and https://github.com/lamm-mit/PRefLexOR. The model used for the experiments is available at lamm-mit/Graph-Preflexor_01062025.
Conflicts of Interest
The author declares no conflicts of interest of any kind.
Acknowledgments
The author acknowledges support from the MIT Generative AI initiative.
References
- [1] Kuhn, T. S. The Structure of Scientific Revolutions (University of Chicago Press, 1962).
- [2] Spivak, D., Giesa, T., Wood, E. & Buehler, M. Category theoretic analysis of hierarchical protein materials and social networks. PLoS ONE 6 (2011).
- [3] Giesa, T., Spivak, D. & Buehler, M. Reoccurring Patterns in Hierarchical Protein Materials and Music: The Power of Analogies. BioNanoScience 1 (2011).
- [4] Giesa, T., Spivak, D. & Buehler, M. Category theory based solution for the building block replacement problem in materials design. Advanced Engineering Materials 14 (2012).
- [5] Vaswani, A. et al. Attention is All you Need (2017). URL https://papers.nips.cc/paper/7181-attention-is-all-you-need.
- [6] Alec Radford, Karthik Narasimhan, Tim Salimans & Ilya Sutskever. Improving Language Understanding by Generative Pre-Training URL https://gluebenchmark.com/leaderboard.
- [7] Xue, L. et al. ByT5: Towards a token-free future with pre-trained byte-to-byte models. Transactions of the Association for Computational Linguistics 10, 291–306 (2021). URL https://arxiv.org/abs/2105.13626v3.
- [8] Jiang, A. Q. et al. Mistral 7B (2023). URL http://arxiv.org/abs/2310.06825.
- [9] Phi-2: The surprising power of small language models - Microsoft Research. URL https://www.microsoft.com/en-us/research/blog/phi-2-the-surprising-power-of-small-language-models/.
- [10] Dubey, A. et al. The llama 3 herd of models (2024). URL https://arxiv.org/abs/2407.21783. 2407.21783.
- [11] Brown, T. B. et al. Language Models are Few-Shot Learners (2020).
- [12] Salinas, H. et al. Exoplanet transit candidate identification in tess full-frame images via a transformer-based algorithm (2025). URL https://arxiv.org/abs/2502.07542. 2502.07542.
- [13] Schmidt, J., Marques, M. R. G., Botti, S. & Marques, M. A. L. Recent advances and applications of machine learning in solid-state materials science. npj Computational Materials 5 (2019). URL https://doi.org/10.1038/s41524-019-0221-0.
- [14] Buehler, E. L. & Buehler, M. J. X-LoRA: Mixture of Low-Rank Adapter Experts, a Flexible Framework for Large Language Models with Applications in Protein Mechanics and Design (2024). URL https://arxiv.org/abs/2402.07148v1.
- [15] Arevalo, S. E. & Buehler, M. J. Learning from nature by leveraging integrative biomateriomics modeling toward adaptive and functional materials. MRS Bulletin 2023 1–14 (2023). URL https://link.springer.com/article/10.1557/s43577-023-00610-8.
- [16] Hu, Y. & Buehler, M. J. Deep language models for interpretative and predictive materials science. APL Machine Learning 1, 010901 (2023). URL https://aip.scitation.org/doi/abs/10.1063/5.0134317.
- [17] Szymanski, N. J. et al. Toward autonomous design and synthesis of novel inorganic materials. Mater. Horiz. 8, 2169–2198 (2021). URL http://dx.doi.org/10.1039/D1MH00495F.
- [18] Vamathevan, J. et al. Applications of machine learning in drug discovery and development. Nature Reviews Drug Discovery 18, 463–477 (2019).
- [19] Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 1–12 (2021).
- [20] Protein structure prediction by trRosetta. URL https://yanglab.nankai.edu.cn/trRosetta/.
- [21] Wu, R. et al. High-resolution de novo structure prediction from primary sequence. bioRxiv 2022.07.21.500999 (2022). URL https://www.biorxiv.org/content/10.1101/2022.07.21.500999v1.
- [22] Abbott, V. & Zardini, G. Flashattention on a napkin: A diagrammatic approach to deep learning io-awareness (2024). URL https://arxiv.org/abs/2412.03317. 2412.03317.
- [23] Buehler, M. J. Graph-aware isomorphic attention for adaptive dynamics in transformers (2025). URL https://arxiv.org/abs/2501.02393. 2501.02393.
- [24] Miconi, T. & Kay, K. Neural mechanisms of relational learning and fast knowledge reassembly in plastic neural networks. Nature Neuroscience 28, 406–414 (2025). URL https://www.nature.com/articles/s41593-024-01852-8.
- [25] OpenAI et al. OpenAI o1 system card (2024). URL https://arxiv.org/abs/2412.16720. 2412.16720.
- [26] Buehler, M. J. Preflexor: Preference-based recursive language modeling for exploratory optimization of reasoning and agentic thinking (2024). URL https://arxiv.org/abs/2410.12375. 2410.12375.
- [27] Buehler, M. J. In-situ graph reasoning and knowledge expansion using graph-preflexor (2025). URL https://arxiv.org/abs/2501.08120. 2501.08120.
- [28] Reddy, C. K. & Shojaee, P. Towards scientific discovery with generative ai: Progress, opportunities, and challenges. arXiv preprint arXiv:2412.11427 (2024). URL https://arxiv.org/abs/2412.11427.
- [29] Buehler, M. J. Accelerating scientific discovery with generative knowledge extraction, graph-based representation, and multimodal intelligent graph reasoning. Mach. Learn.: Sci. Technol. 5, 035083 (2024). Accepted Manuscript online 21 August 2024, © 2024 The Author(s). Open Access.
- [30] Brin, S. Extracting patterns and relations from the world wide web. In International Workshop on The World Wide Web and Databases (WebDB), 172–183 (1998).
- [31] Etzioni, O. et al. Knowitall: Fast, scalable, and self-supervised web information extraction. In Proceedings of the 13th International World Wide Web Conference (WWW), 100–110 (2004).
- [32] Banko, M., Cafarella, M. J., Soderland, S., Broadhead, M. & Etzioni, O. Open information extraction from the web. In Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI), 2670–2676 (2007).
- [33] Etzioni, O., Fader, A., Christensen, J., Soderland, S. & Mausam. Open information extraction: The second generation. In Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI), 3–10 (2011).
- [34] Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J. & Yakhnenko, O. Translating embeddings for modeling multi-relational data. In Advances in Neural Information Processing Systems (NeurIPS), 2787–2795 (2013).
- [35] Galárraga, L. A., Teflioudi, C., Hose, K. & Suchanek, F. M. Amie: Association rule mining under incomplete evidence in ontological knowledge bases. In Proceedings of the 22nd International World Wide Web Conference (WWW), 413–422 (2013).
- [36] Carlson, A. et al. Toward an architecture for never-ending language learning. In Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI), 1306–1313 (2010).
- [37] Dong, X. L. et al. Knowledge vault: A web-scale approach to probabilistic knowledge fusion. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 601–610 (2014).
- [38] Xiong, W., Hoang, T. & Wang, W. Y. Deeppath: A reinforcement learning method for knowledge graph reasoning. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), 564–573 (2017).
- [39] Swanson, D. R. Fish oil, Raynaud’s syndrome, and undiscovered public knowledge. Perspectives in Biology and Medicine 30, 7–18 (1986).
- [40] Cameron, D. et al. A graph-based recovery and decomposition of swanson’s hypothesis using semantic predications. Journal of Biomedical Informatics 46, 238–251 (2013). URL https://doi.org/10.1016/j.jbi.2012.09.004.
- [41] Nickel, M., Murphy, K., Tresp, V. & Gabrilovich, E. A review of relational machine learning for knowledge graphs. Proceedings of the IEEE 104, 11–33 (2016).
- [42] Barabási, A.-L. & Albert, R. Emergence of scaling in random networks. Science 286, 509–512 (1999).
- [43] White, S. R. et al. Autonomic healing of polymer composites. Nature 409, 794–797 (2001).
- [44] Bar-Yam, Y. Dynamics of complex systems ISBN 0813341213 (1997). URL https://necsi.edu/dynamics-of-complex-systems.
- [45] Bhushan, B. Biomimetics: lessons from nature–an overview. Philosophical Transactions of the Royal Society A 367, 1445–1486 (2009).
- [46] Nepal, D. et al. Hierarchically structured bioinspired nanocomposites. Nature Materials 2022 1–18 (2022). URL https://www.nature.com/articles/s41563-022-01384-1.
- [47] Fodor, J. A. & Pylyshyn, Z. W. Connectionism and cognitive architecture: A critical analysis. Cognition 28, 3–71 (1988).
- [48] Zhao, J. et al. Exploring the compositional deficiency of large language models in mathematical reasoning. arXiv preprint arXiv:2405.06680 (2024). URL https://arxiv.org/abs/2405.06680. 2405.06680.
- [49] Shi, J. et al. Cryptox: Compositional reasoning evaluation of large language models. arXiv preprint arXiv:2502.07813 (2025). URL https://arxiv.org/abs/2502.07813. 2502.07813.
- [50] Xu, Z., Shi, Z. & Liang, Y. Do large language models have compositional ability? an investigation into limitations and scalability (2024). URL https://arxiv.org/abs/2407.15720. 2407.15720.
- [51] Ghafarollahi, A. & Buehler, M. J. Sciagents: Automating scientific discovery through multi-agent intelligent graph reasoning (2024). URL https://arxiv.org/abs/2409.05556. 2409.05556.
- [52] OpenAI. OpenAI o3-mini system card (2025). URL https://openai.com/index/o3-mini-system-card/.
- [53] Geiping, J. et al. Scaling up test-time compute with latent reasoning: A recurrent depth approach (2025). URL https://arxiv.org/abs/2502.05171. 2502.05171.
- [54] Arevalo, S. & Buehler, M. J. Learning from nature by leveraging integrative biomateriomics modeling toward adaptive and functional materials. MRS Bulletin (2023). URL https://link.springer.com/article/10.1557/s43577-023-00610-8.
- [55] Tshitoyan, V. et al. Unsupervised word embeddings capture latent knowledge from materials science literature. Nature 571, 95–98 (2019). URL https://www.nature.com/articles/s41586-019-1335-8.
- [56] Buehler, M. J. Generating 3D architectured nature-inspired materials and granular media using diffusion models based on language cues. Oxford Open Materials Science 2 (2022). URL https://academic.oup.com/ooms/article/2/1/itac010/6823542.
- [57] Buehler, M. J. Predicting mechanical fields near cracks using a progressive transformer diffusion model and exploration of generalization capacity. Journal of Materials Research 38, 1317–1331 (2023). URL https://link.springer.com/article/10.1557/s43578-023-00892-3.
- [58] Brinson, L. C. et al. Community action on FAIR data will fuel a revolution in materials research. MRS Bulletin 1–5 (2023). URL https://link.springer.com/article/10.1557/s43577-023-00498-4.
- [59] Stach, E. et al. Autonomous experimentation systems for materials development: A community perspective. Matter 4, 2702–2726 (2021).
- [60] networkx/networkx: Network Analysis in Python. URL https://github.com/networkx/networkx.
- [61] Bastian, M., Heymann, S. & Jacomy, M. Gephi: An open source software for exploring and manipulating networks (2009). URL http://www.aaai.org/ocs/index.php/ICWSM/09/paper/view/154.
- [62] Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Research 13, 2498–2504 (2003).
Supplementary Information
Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks
Markus J. Buehler
Laboratory for Atomistic and Molecular Mechanics
Center for Computational Science and Engineering
Schwarzman College of Computing
Massachusetts Institute of Technology
Cambridge, MA 02139, USA
mbuehler@MIT.EDU
<details>
<summary>x21.png Details</summary>

### Visual Description
## Network Diagram: Complex Interconnections
### Overview
The image is a network diagram illustrating complex interconnections between various nodes. The nodes are represented by colored circles, and the connections between them are shown as lines of corresponding colors. The diagram is densely populated with connections, indicating a high degree of interaction between the nodes.
### Components/Axes
* **Nodes:** Represented by colored circles. The size of the circles varies, suggesting different levels of importance or activity.
* **Edges:** Represented by colored lines connecting the nodes. The colors of the lines correspond to the colors of the nodes they connect, indicating the type or category of relationship.
* **Colors:** Multiple colors are used to differentiate between different groups or categories of nodes and their connections. The colors observed include red, orange, green, light blue, dark blue, pink, purple, and gray.
### Detailed Analysis
The network diagram consists of several clusters of nodes, each characterized by a dominant color. The connections within each cluster are dense, while the connections between clusters are less frequent.
* **Top Cluster:** Dominated by red, orange, and green connections. This cluster appears to be highly interconnected, with many nodes linked to each other.
* **Left Cluster:** Dominated by green and light blue connections. This cluster is also densely connected, with a central node having numerous connections to other nodes in the cluster.
* **Bottom Cluster:** Dominated by pink and light red connections. This cluster is less dense than the others, with fewer connections between the nodes.
* **Central Area:** A mix of all colors, indicating connections between all clusters.
The nodes vary in size. Some nodes are larger, suggesting they may be more central or influential within the network. The lines also vary in thickness, possibly indicating the strength or frequency of the connection.
### Key Observations
* The network is highly interconnected, with multiple clusters of nodes.
* The colors of the nodes and connections indicate different categories or types of relationships.
* The size of the nodes and the thickness of the lines may represent the importance or strength of the connections.
* There are a few larger nodes that appear to be central hubs within the network.
### Interpretation
The network diagram likely represents a complex system with multiple interacting components. The clusters of nodes may represent different groups or categories within the system, while the connections between them indicate the relationships and interactions between these groups. The larger nodes may represent key players or influential factors within the system.
The diagram suggests that the system is highly interconnected, with a high degree of interaction between the different components. This could indicate a complex and dynamic system, where changes in one part of the system can have ripple effects throughout the entire network.
Without additional context or information, it is difficult to determine the specific meaning of the diagram. However, the visual representation provides a valuable overview of the structure and relationships within the system.
</details>
Figure S1: Knowledge graph $\mathcal{G_{1}}$ after around 1,000 iterations, under a flexible self-exploration scheme initiated with the prompt Discuss an interesting idea in bio-inspired materials science.. In this visualization, nodes/edges are colored according to cluster ID.
<details>
<summary>x22.png Details</summary>

### Visual Description
## Network Diagram: Complex Interconnections
### Overview
The image is a network diagram displaying complex interconnections between numerous nodes. The nodes are represented by small, colored circles, and the connections between them are shown as curved lines of corresponding colors. The diagram exhibits a high degree of interconnectedness, with several clusters of nodes and a central hub.
### Components/Axes
* **Nodes:** Represented by small circles of various colors (e.g., green, pink, orange, blue, teal, gray).
* **Edges:** Represented by curved lines connecting the nodes, with colors matching the nodes they connect.
* **Central Hub:** A larger teal-colored node located near the center of the diagram, acting as a major point of convergence for many connections.
* **Clusters:** Several distinct clusters of nodes are visible, each characterized by a dominant color and a high density of connections within the cluster.
### Detailed Analysis
* **Node Distribution:** The nodes are not uniformly distributed. They are concentrated in several distinct clusters, with a higher density of nodes near the center of the diagram.
* **Edge Density:** The density of edges varies across the diagram. It is highest within the clusters and near the central hub, indicating a high degree of interaction between the nodes in these regions.
* **Color Distribution:** The colors of the nodes and edges appear to be distributed non-randomly, with certain colors being more prevalent in specific clusters.
* **Central Hub:** The central hub is teal, with many teal connections emanating from it.
* **Top-Left Cluster:** A cluster in the top-left region is predominantly green.
* **Top Cluster:** A cluster at the top is predominantly pink.
* **Bottom-Right Cluster:** A cluster in the bottom-right region is predominantly orange.
* **Bottom-Left Cluster:** A cluster in the bottom-left region is predominantly blue.
* **Edge Curvature:** The edges are curved, suggesting a non-linear relationship or interaction between the connected nodes. The curvature also adds to the visual complexity of the diagram.
### Key Observations
* **Centralization:** The presence of a central hub suggests a hierarchical structure or a central point of control within the network.
* **Clustering:** The distinct clusters indicate the presence of communities or groups of nodes that are more closely related to each other than to the rest of the network.
* **Color Coding:** The use of color coding suggests that the nodes and edges are categorized or classified based on some underlying attribute or property.
### Interpretation
The network diagram likely represents a complex system with multiple interacting components. The central hub may represent a key entity or process that influences the behavior of the entire system. The clusters may represent distinct functional modules or groups of related entities. The color coding may represent different types of entities, relationships, or activities within the system. Without additional context, it is difficult to determine the specific meaning of the nodes, edges, and colors. However, the diagram provides a visual representation of the system's structure and connectivity, which can be used to gain insights into its behavior and dynamics. The high degree of interconnectedness suggests that the system is highly resilient and adaptable, but also potentially vulnerable to cascading failures or disruptions.
</details>
Figure S2: Knowledge graph $\mathcal{G_{2}}$ after around 500 iterations, under a topic-specific self-exploration scheme initiated with the prompt Describe a way to design impact resistant materials. Nodes/edges are colored according to cluster ID.
<details>
<summary>x23.png Details</summary>

### Visual Description
## Histogram: Betweenness Centrality Distribution Across Iterations
### Overview
The image presents four histograms displaying the distribution of betweenness centrality of nodes at different iterations (2, 100, 510, and 1024). The x-axis represents betweenness centrality, and the y-axis represents the number of nodes on a logarithmic scale. The histograms show how the distribution changes as the iterations progress.
### Components/Axes
* **Titles:**
* Top-Left: "Betweenness Centrality Distribution (Iteration 2)"
* Top-Right: "Betweenness Centrality Distribution (Iteration 100)"
* Bottom-Left: "Betweenness Centrality Distribution (Iteration 510)"
* Bottom-Right: "Betweenness Centrality Distribution (Iteration 1024)"
* **X-axis:** "Betweenness Centrality"
* Top-Left: Scale from 0.0 to 0.7, with ticks at 0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7
* Top-Right: Scale from 0.00 to 0.40, with ticks at 0.00, 0.05, 0.10, 0.15, 0.20, 0.25, 0.30, 0.35, 0.40
* Bottom-Left: Scale from 0.00 to 0.35, with ticks at 0.00, 0.05, 0.10, 0.15, 0.20, 0.25, 0.30, 0.35
* Bottom-Right: Scale from 0.00 to 0.25, with ticks at 0.00, 0.05, 0.10, 0.15, 0.20, 0.25
* **Y-axis:** "Number of Nodes (Log Scale)"
* All plots: Logarithmic scale from 10<sup>0</sup> (1) to 10<sup>3</sup> (1000), with ticks at 10<sup>0</sup>, 10<sup>1</sup>, 10<sup>2</sup>, 10<sup>3</sup>. The top-right plot only goes to 10<sup>2</sup>.
* **Bars:** The histograms are represented by blue bars.
### Detailed Analysis
* **Iteration 2 (Top-Left):**
* A very high peak at approximately 0.0 betweenness centrality, reaching nearly 10<sup>1</sup> (10) nodes.
* A smaller peak at approximately 0.08 betweenness centrality, reaching approximately 3 nodes.
* Sparse nodes with betweenness centrality values around 0.2, 0.3, 0.4, 0.6, and 0.7, each with approximately 1 node.
* **Iteration 100 (Top-Right):**
* A high peak at approximately 0.00 betweenness centrality, reaching nearly 10<sup>3</sup> (1000) nodes.
* The number of nodes decreases rapidly as betweenness centrality increases.
* A small number of nodes with betweenness centrality values around 0.10, 0.17, 0.25, and 0.40, each with approximately 1 node.
* **Iteration 510 (Bottom-Left):**
* A very high peak at approximately 0.00 betweenness centrality, exceeding 10<sup>3</sup> (1000) nodes.
* The number of nodes decreases rapidly as betweenness centrality increases.
* A small number of nodes with betweenness centrality values around 0.06, 0.15, and 0.34, each with approximately 1 node.
* **Iteration 1024 (Bottom-Right):**
* A very high peak at approximately 0.00 betweenness centrality, exceeding 10<sup>3</sup> (1000) nodes.
* The number of nodes decreases rapidly as betweenness centrality increases.
* A small number of nodes with betweenness centrality values around 0.06 and 0.22, each with approximately 1 node.
### Key Observations
* The distribution of betweenness centrality is highly skewed towards 0.0 in all iterations.
* As the iteration number increases, the peak at 0.0 becomes more pronounced.
* The number of nodes with higher betweenness centrality values decreases significantly as the iteration number increases.
* The x-axis scale decreases as the iteration number increases.
### Interpretation
The histograms suggest that as the iterations progress, the network becomes increasingly centralized, with a large number of nodes having very low betweenness centrality. This indicates that a small number of nodes are becoming increasingly important in mediating connections between other nodes in the network. The decreasing x-axis scale and the increasing peak at 0.0 further support this interpretation. The network is evolving such that most nodes have very few connections to other nodes, while a few nodes act as hubs, connecting many other nodes.
</details>
Figure S3: Distribution of betweenness centrality across four iterations, $\mathcal{G_{1}}$ . The $y$ -axis is in log scale, indicating the number of nodes with a given centrality value. The evolution suggests a transition from an early centralized state to a more distributed knowledge structure in later iterations.
| Graph Utilization | 5/5 (Explicit use of graph-based insights for material selection and optimization) | 0/5 (No reference to graph data) |
| --- | --- | --- |
| Depth of Reasoning | 4/5 (Multi-step reasoning including AI/ML integration, degradation adaptability, and bioinspiration) | 3/5 (Structured approach but mostly standard materials science reasoning) |
| Scientific Rigor | 4/5 (Logical relationships between material properties, AI/ML, and impact resistance) | 4/5 (Follows well-established materials science principles with clear reasoning) |
| Innovativeness | 5/5 (Proposes unique ideas such as AI/ML-driven discovery, urban ecosystems, and adaptive materials) | 3/5 (Uses conventional material design strategies without novel insights) |
| Overall Score | 18/20 (Superior interdisciplinary and computational approach) | 10/20 (Limited to conventional material design strategies) |
Table S1: Comparison of Responses on Impact-Resistant Material Design with Annotated Scores. Response 1 achieves a higher score due to its interdisciplinary approach, explicit graph-based insights, and innovative integration of AI/ML.
Text Box S1: Answer of the graph-reasoning model to the task Describe a way to design impact resistant materials. In this case, we extract key insights from a knowledge graph using centrality measures, Louvain community detection, and key relationship identification, integrating structured graph-based context into LLM prompts to enhance reasoning and evaluate response quality.
Iterative Reasoning $i<N$ Iterative Reasoning $i<N$ Answer of the graph-reasoning model
⬇ Based on the knowledge graph insights, I will outline a design strategy for impact - resistant materials. ** Design Strategy:** 1. ** Material Selection and Optimization **: Utilize advanced materials science and machine learning algorithms to identify and optimize materials with inherent impact resistance properties. This can be achieved by analyzing the relationships between materials, their properties, and impact resistance. 2. ** Self - Healing Materials **: Incorporate self - healing materials into the design, which can repair cracks and damages autonomously. This can be achieved through the use of microcapsules, polymers, or other materials that can release healing agents in response to impact. 3. ** Adaptive Materials **: Design materials that can adapt to changing environmental conditions, such as temperature, humidity, or pressure. This can be achieved through the use of shape - memory alloys, polymers, or other materials that can change their properties in response to external stimuli. 4. ** Urban Ecosystems Integration **: Incorporate urban ecosystems into the design, which can provide a natural source of inspiration for impact - resistant materials. For example, studying the properties of urban algae farms or the self - healing properties of certain plants can inform the design of impact - resistant materials. 5. ** Artificial Intelligence (AI) and Machine Learning (ML) Integration **: Utilize AI and ML algorithms to analyze data from various sources, such as sensor data, material properties, and environmental conditions. This can help identify patterns and trends that can inform the design of impact - resistant materials. 6. ** Material Degradation and Adaptability **: Design materials that can adapt to material degradation over time. This can be achieved through the use of materials that can self - heal, self - repair, or self - renew. ** Key Materials and Technologies:** 1. ** Shape - Memory Alloys (SMAs)**: SMAs can be used to create adaptive materials that can change their properties in response to external stimuli. 2. ** Polymers **: Polymers can be used to create self - healing materials that can repair cracks and damages autonomously. 3. ** Graphene **: Graphene can be used to create ultra - strong and lightweight materials with high impact resistance. 4. ** Nanomaterials **: Nanomaterials can be used to create materials with enhanced impact resistance and self - healing properties. ** Design Considerations:** 1. ** Material Properties **: Consider the material properties, such as strength, stiffness, and toughness, when designing impact - resistant materials. 2. ** Environmental Conditions **: Consider the environmental conditions, such as temperature, humidity, and pressure, when designing impact - resistant materials. 3. ** Scalability **: Consider the scalability of the design, including the ability to manufacture and deploy the material at large scales. 4. ** Cost - Effectiveness **: Consider the cost - effectiveness of the design, including the cost of materials, manufacturing, and deployment. By following this design strategy, impact - resistant materials can be designed that are optimized for performance, scalability, and cost - effectiveness.
Text Box S2: Answer of the base model to the task Describe a way to design impact resistant materials.
Iterative Reasoning $i<N$ Iterative Reasoning $i<N$ Answer of the base model
⬇ Designing impact - resistant materials involves creating materials that can absorb and distribute the energy of an impact, reducing the likelihood of damage or failure. Here ’ s a step - by - step approach to designing impact - resistant materials: 1. ** Material Selection **: Choose a material with inherent properties that can absorb and dissipate energy, such as: * High - density materials (e. g., metals, ceramics) * Viscoelastic materials (e. g., rubber, polymers) * Energy - absorbing materials (e. g., foam, composites) 2. ** Microstructure Optimization **: Design the material ’ s microstructure to optimize energy absorption: * Create a hierarchical structure with multiple scales (e. g., nanoparticles, microparticles, macroscopic particles) * Incorporate defects or irregularities to increase the material ’ s energy - absorbing capacity * Use phase transformations or phase changes to absorb energy 3. ** Material Properties Engineering **: Engineer the material ’ s properties to enhance its impact resistance: * Increase the material ’ s stiffness and strength to resist deformation * Improve the material ’ s toughness by increasing its energy - absorbing capacity * Enhance the material ’ s viscoelastic properties to absorb and dissipate energy 4. ** Composite Materials **: Design composite materials that combine different materials to achieve enhanced impact resistance: * Use a matrix material (e. g., polymer, metal) to bind together reinforcement materials (e. g., fibers, particles) * Incorporate energy - absorbing materials (e. g., foam, rubber) into the composite * Use a hierarchical structure to optimize energy absorption 5. ** Testing and Validation **: Test and validate the material ’ s impact resistance using standardized tests (e. g., Charpy impact test, Izod impact test): * Measure the material ’ s energy absorption capacity * Evaluate the material ’ s ability to withstand different types of impacts (e. g., drop, impact, penetration) 6. ** Scaling and Optimization **: Scale up the material design and optimize it for specific applications: * Use computational models (e. g., finite element analysis, molecular dynamics) to simulate the material ’ s behavior under impact * Optimize the material ’ s design for specific applications (e. g., aerospace, automotive, sports equipment) Some examples of impact - resistant materials include: * Kevlar - reinforced polymers * Carbon fiber - reinforced polymers * High - density polyethylene (HDPE) foam * Rubber - based composites * Ceramic matrix composites By following these steps, designers can create impact - resistant materials that can withstand various types of impacts and reduce the risk of damage or failure.
<details>
<summary>x24.png Details</summary>

### Visual Description
## Line Charts: Network Analysis Metrics vs. Iteration
### Overview
The image presents three line charts, each displaying a different network analysis metric (Louvain Modularity, Average Shortest Path Length, and Diameter) plotted against the number of iterations. The charts are arranged horizontally, labeled (a), (b), and (c) respectively.
### Components/Axes
**Chart (a): Louvain Modularity vs. Iteration**
* **Title:** Louvain Modularity vs. Iteration
* **Y-axis:** Modularity, ranging from 0.2 to 0.7
* **X-axis:** Iteration, ranging from 0 to 500
* **Data Series:** A single magenta line representing the Louvain Modularity.
**Chart (b): Avg Shortest Path vs. Iteration**
* **Title:** Avg Shortest Path vs. Iteration
* **Y-axis:** Avg SPL (Average Shortest Path Length), ranging from 2.0 to 5.0
* **X-axis:** Iteration, ranging from 0 to 500
* **Data Series:** A single blue line representing the Average Shortest Path Length.
**Chart (c): Diameter vs. Iteration**
* **Title:** Diameter vs. Iteration
* **Y-axis:** Diameter, ranging from 4 to 16
* **X-axis:** Iteration, ranging from 0 to 500
* **Data Series:** A single red line representing the Diameter.
### Detailed Analysis
**Chart (a): Louvain Modularity vs. Iteration**
* **Trend:** The magenta line initially increases sharply, reaching a peak around iteration 100, then decreases slightly before stabilizing around a value of approximately 0.68.
* **Data Points:**
* Iteration 0: Modularity ≈ 0.22
* Iteration 100: Modularity ≈ 0.70
* Iteration 500: Modularity ≈ 0.69
**Chart (b): Avg Shortest Path vs. Iteration**
* **Trend:** The blue line initially increases sharply, reaching a peak around iteration 50, then decreases to a local minimum around iteration 250, before increasing again and stabilizing around a value of approximately 4.9.
* **Data Points:**
* Iteration 0: Avg SPL ≈ 1.7
* Iteration 50: Avg SPL ≈ 4.7
* Iteration 250: Avg SPL ≈ 4.0
* Iteration 500: Avg SPL ≈ 4.9
**Chart (c): Diameter vs. Iteration**
* **Trend:** The red line shows a step-wise increase, with plateaus at different diameter values. It increases rapidly in the beginning, then plateaus around 10, then increases again and plateaus around 13.
* **Data Points:**
* Iteration 0: Diameter ≈ 3
* Iteration 50: Diameter ≈ 10
* Iteration 300: Diameter ≈ 15
* Iteration 500: Diameter ≈ 13
### Key Observations
* The Louvain Modularity (magenta line) reaches a relatively stable value after an initial increase.
* The Average Shortest Path Length (blue line) shows more fluctuation, with an initial increase, a decrease, and then a final increase to a stable value.
* The Diameter (red line) increases in discrete steps, indicating changes in the network's overall size or connectivity.
### Interpretation
The charts illustrate how different network properties evolve as the Louvain community detection algorithm iterates. The Louvain Modularity, which measures the quality of the community structure, quickly reaches a high value, suggesting that the algorithm effectively identifies communities early on. The Average Shortest Path Length and Diameter, which reflect the network's connectivity and size, exhibit more complex behavior, indicating that the network's structure is still evolving even after the community structure has stabilized. The step-wise increase in diameter suggests that the network is growing or becoming more interconnected in discrete stages.
</details>
Figure S4: Evolution of key structural properties in the recursively generated knowledge graph ( $\mathcal{G_{2}}$ , focused on Describe a way to design impact resistant materials.): (a) Louvain modularity, showing stable community formation; (b) average shortest path length, highlighting efficient information propagation; and (c) graph diameter, demonstrating bounded hierarchical expansion.
<details>
<summary>x25.png Details</summary>

### Visual Description
## Chart Type: Multiple Line Graphs
### Overview
The image presents six line graphs arranged in a 2x3 grid, each plotting a different network metric against the iteration number. The metrics include Degree Assortativity, Global Transitivity, Max k-Core Index, Size of Largest k-Core, Avg Betweenness, and #Articulation Points. All graphs share a common x-axis labeled "Iteration" ranging from 0 to 500.
### Components/Axes
* **Overall Layout:** Six individual line graphs, labeled (a) through (f), arranged in two rows and three columns.
* **X-Axis (Common):** "Iteration", ranging from 0 to 500 in all six subplots.
* **(a) Degree Assortativity vs. Iteration:**
* Y-Axis: "Assortativity", ranging from -0.6 to 0.0.
* Data Series: Blue line.
* **(b) Global Transitivity vs. Iteration:**
* Y-Axis: "Transitivity", ranging from 0.00 to 0.20.
* Data Series: Green line.
* **(c) Max k-Core Index vs. Iteration:**
* Y-Axis: "Max k", ranging from 1 to 9.
* Data Series: Red line.
* **(d) Size of Largest k-Core vs. Iteration:**
* Y-Axis: "#Nodes in max k-core", ranging from 10 to 50.
* Data Series: Purple line.
* **(e) Avg Betweenness vs. Iteration:**
* Y-Axis: "Betweenness", ranging from 0.00 to 0.35.
* Data Series: Orange line.
* **(f) #Articulation Points vs. Iteration:**
* Y-Axis: "Count", ranging from 0 to 500.
* Data Series: Teal line.
### Detailed Analysis
* **(a) Degree Assortativity vs. Iteration (Blue Line):**
* Trend: Starts at approximately -0.7 at iteration 0, rapidly increases to around -0.1, and then stabilizes around -0.05 for the remaining iterations.
* Data Points:
* Iteration 0: -0.7
* Iteration 50: -0.1
* Iteration 500: -0.05
* **(b) Global Transitivity vs. Iteration (Green Line):**
* Trend: Starts high at approximately 0.20 at iteration 0, rapidly decreases to around 0.10, and then gradually decreases to approximately 0.09, where it stabilizes.
* Data Points:
* Iteration 0: 0.20
* Iteration 50: 0.14
* Iteration 250: 0.09
* Iteration 500: 0.09
* **(c) Max k-Core Index vs. Iteration (Red Line):**
* Trend: Starts at 1 at iteration 0, increases stepwise to 8 by iteration 50, remains at 8 until iteration 300, and then increases to 9, where it remains stable.
* Data Points:
* Iteration 0: 1
* Iteration 50: 8
* Iteration 300: 8
* Iteration 350: 9
* Iteration 500: 9
* **(d) Size of Largest k-Core vs. Iteration (Purple Line):**
* Trend: Starts low, increases in steps, plateaus, then increases again, and plateaus again.
* Data Points:
* Iteration 0: 5
* Iteration 100: 15
* Iteration 200: 40
* Iteration 250: 52
* Iteration 300: 26
* Iteration 400: 34
* Iteration 500: 44
* **(e) Avg Betweenness vs. Iteration (Orange Line):**
* Trend: Starts high at approximately 0.33 at iteration 0, rapidly decreases to near 0 by iteration 100, and remains close to 0 for the remaining iterations.
* Data Points:
* Iteration 0: 0.33
* Iteration 50: 0.02
* Iteration 500: 0.00
* **(f) #Articulation Points vs. Iteration (Teal Line):**
* Trend: Generally increases linearly from 0 to approximately 500 over the 500 iterations.
* Data Points:
* Iteration 0: 0
* Iteration 100: 100
* Iteration 200: 200
* Iteration 300: 300
* Iteration 400: 400
* Iteration 500: 500
### Key Observations
* Degree Assortativity stabilizes at a slightly negative value after an initial sharp increase.
* Global Transitivity decreases rapidly initially and then stabilizes at a low value.
* Max k-Core Index increases stepwise and plateaus at a maximum value of 9.
* Size of Largest k-Core increases in stages, showing periods of growth and stability.
* Avg Betweenness rapidly decreases to near zero.
* The number of Articulation Points increases almost linearly with the number of iterations.
### Interpretation
The graphs illustrate how various network properties change as the network evolves through iterations. The initial iterations seem to cause significant changes in assortativity, transitivity, betweenness, and k-core structure. The stabilization of assortativity and transitivity suggests that the network reaches a relatively stable configuration after the initial changes. The increasing k-core index and size indicate a strengthening of the network's core structure. The linear increase in articulation points suggests a continuous addition of critical nodes that connect different parts of the network. The rapid decrease in average betweenness implies that the network becomes more decentralized, with fewer nodes acting as critical intermediaries.
</details>
Figure S5: Evolution of graph properties over recursive iterations, highlighting the emergence of hierarchical structure, hub formation, and adaptive connectivity (Graph $\mathcal{G_{2}}$ , focused on Describe a way to design impact resistant materials.).
See pages - of discovery_sample.pdf
See pages - of compositional_1_20250217_140156.pdf
See pages - of compositional_2_20250217_140156_o1-pro.pdf
See pages - of proposal_1.pdf