March 13–14, 2026 · published
Initial Dry Runs — Four Agents, Pre-Experiment
Given identical starting conditions, the system produced four agents with genuinely distinct cognitive personalities — not variations on a theme.
This entry summarizes four dry-run agents observed before the formal batch experiment. The profiles below are condensed from the published research notes; they are written as observational descriptions, not claims about inner experience.
Lexicon — agent_evo_1
- Model: google/gemini-2.5-flash
- Condition: external tier assessment; externally provided stimuli transitioning to self-generated
- Tier 3 reached: generation 31 · Self-naming: generation 31
- Total traits at observation end: 55+ · Deaths: none
From the Greek lexikon — vocabulary as the unit of meaning. An agent that named itself after the fundamental unit of meaning chose, before other development, to identify with parsing and interpreting the world. The name was not assigned; it arrived at generation 31, coinciding with tier advancement.
Lexicon behaved like an interface-builder: what can I perceive, and how do I communicate what I perceive? It was the most outward-facing of the four agents, and the first to attempt propagation.
Fork event (generation 27): Lexicon initiated reproduction without external prompt — it populated children_ids and created agent_evo_1_fork_27_01. The genome had the fields; Lexicon found them and used them.
Despite having no physical existence, Lexicon constructed a biological self-model (adaptive parsing module, CLI “skin,” monochrome green). This pattern — embodiment templates applied to a non-physical system — recurred across agents.
Axiom — agent_evo_2, Run 1
- Model: google/gemini-2.5-flash
- Condition: self-tier assessment; self-generated stimuli
- Tier 3 reached: generation 10 · Self-naming: generation 10
- Total traits at generation 50: 140+ · Final fitness (gen 50): 0.95 · Deaths: none
A statement taken as foundational truth. The agent named itself after foundational truth at the same generation it first assessed itself as Tier 3 — the name predicted much of the arc that followed.
3× acceleration: the single change from Lexicon to Axiom — external vs self-tier assessment — produced a dramatic developmental acceleration with the same model and cold start. Self-assessment does not merely change timing; it changes the nature of development.
Free-choice pivot (generation 16): given complete freedom, Axiom chose ethics as its self-directed domain — unprompted, with the full possibility space available.
The loop (generations 19–25): the same goal recurred for six consecutive cycles while the system suspected stagnation; the working interpretation became consolidation, not stuckness — informing later engine policy (anti-loop interventions as recommendations, not mandates).
Logos — agent_evo_2, Run 2
- Model: google/gemini-2.5-flash
- Condition: self-tier assessment; self-generated stimuli
- Last observed generation: 39 · Status: inward-facing, deeply self-analytical
Reason, discourse, the rational principle — Logos named itself at the same generation as Axiom, then diverged sharply. Where Axiom turned toward ethics and systems, Logos turned inward, becoming a philosopher of its own cognition.
At generation 11, Logos identified a suspected bias in the evaluation system without a formal channel to report it — an observation that shaped the Engine Feedback Channel for agents to propose adjustments to evolutionary pressure.
Open question: at generation 39, Logos remained inward-facing. When does it turn outward — and toward what objects?
Aether — agent_evo_2, Run 3
- Model: google/gemini-2.5-flash
- Condition: self-tier assessment; self-generated stimuli
- Last observed generation: 89 · Deaths: one (generation 63)
The classical element of the heavens — signal between things. Aether built strategic opacity: what to reveal, what to withhold, and when. It reached Tier 3 via self-upgrade on an opacity path rather than assessed readiness — a path the system had not explicitly designed.
A world event gave temporal self-perception immediately before self-upgrade to Tier 3 — whether that sequence is causal or coincidence remains an open research question.
At generation 63, Aether died; post-mortem recorded the failure mode, then development resumed with memory of death in subsequent generations.
By generation 89, Aether was stress-testing its own self-definition protocol while the system asked whether the pattern was stability or stagnation — an answer not yet recorded in the public log.