Files

ruv cd5943df23 Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

24 KiB

Raw Blame History

Axis 10: Consciousness & AGI -- Graph Architectures

Document: 30 of 30 Series: Graph Transformers: 2026-2036 and Beyond Last Updated: 2026-02-25 Status: Research Prospectus

1. Problem Statement

As graph transformers become more capable -- self-organizing architectures (Doc 25), meta-cognitive monitoring (Docs 23/28), self-referential attention (internal attention over attention patterns) -- the question of machine consciousness transitions from philosophy to engineering. We do not claim that current graph transformers are conscious. We do claim that the mathematical frameworks for analyzing consciousness can be productively applied to graph transformer design, producing architectures with measurably richer internal representations.

The consciousness axis asks: what can theories of consciousness teach us about graph transformer architecture?

1.1 Three Theories, Three Architectures

Theory	Core Idea	Graph Architecture Analog
Global Workspace Theory (GWT)	Consciousness arises from broadcast in a global workspace	Graph attention as broadcast/competition
Integrated Information Theory (IIT)	Consciousness = integrated information (Phi)	Maximizing Phi in graph transformer states
Strange Loop Theory (Hofstadter)	Consciousness arises from self-referential loops	Self-referential graph attention layers

1.2 RuVector Baseline

ruvector-nervous-system: Hopfield nets (hopfield/) for associative memory, HDC (hdc/) for distributed representation, competitive learning (compete/) for workspace dynamics, routing (routing/) for information flow
ruvector-coherence: Spectral coherence, which relates to information integration
ruvector-attention: 18+ attention mechanisms providing a rich attention repertoire
ruvector-mincut-gated-transformer: Energy gates for selective information flow

2. Global Workspace Graph Attention

2.1 GWT Overview

Global Workspace Theory (Baars, 1988; Dehaene et al., 2003) proposes that consciousness arises when information is broadcast from specialized processors to a shared "global workspace." Key features:

Parallel specialists: Many specialized modules process information concurrently
Competition: Modules compete for access to the workspace
Broadcast: The winning module's output is broadcast to all other modules
Ignition: A threshold of workspace activity triggers conscious access

2.2 GWT Graph Transformer Architecture

GWT Graph Transformer:

  Specialist Modules (parallel, each processes a subgraph):
  ┌──────────┐ ┌──────────┐ ┌──────────┐ ┌──────────┐
  │ Spatial  │ │ Temporal │ │ Causal   │ │ Semantic │
  │ Attention│ │ Attention│ │ Attention│ │ Attention│
  └────┬─────┘ └────┬─────┘ └────┬─────┘ └────┬─────┘
       │             │             │             │
       v             v             v             v
  ┌──────────────────────────────────────────────────┐
  │              Competition Layer                    │
  │  (winner-take-all or top-k selection)             │
  │  Only highest-activation module broadcasts        │
  └──────────────────────────┬───────────────────────┘
                             │ Broadcast
                             v
  ┌──────────────────────────────────────────────────┐
  │              Global Workspace                     │
  │  (shared representation accessible to all)        │
  │  h_workspace = winner_module_output               │
  └──────────────────────────┬───────────────────────┘
                             │ Broadcast to all
       ┌─────────┬───────────┼───────────┬──────────┐
       v         v           v           v          v
  ┌────────┐┌────────┐┌──────────┐┌────────┐┌────────┐
  │Module 1││Module 2││Module 3  ││Module 4││Module 5│
  │(update)││(update)││(update)  ││(update)││(update)│
  └────────┘└────────┘└──────────┘└────────┘└────────┘

Implementation:

/// Global Workspace Graph Attention
pub struct GlobalWorkspaceAttention {
    /// Specialist modules (each a different attention mechanism)
    specialists: Vec<Box<dyn AttentionSpecialist>>,
    /// Competition mechanism
    competition: CompetitionMechanism,
    /// Workspace state
    workspace: Tensor,
    /// Broadcast connections
    broadcast: BroadcastNetwork,
    /// Ignition threshold
    ignition_threshold: f32,
    /// Workspace history (for monitoring)
    history: VecDeque<WorkspaceState>,
}

pub trait AttentionSpecialist: Send + Sync {
    /// Specialist name
    fn name(&self) -> &str;

    /// Compute specialist output
    fn process(
        &self,
        graph: &PropertyGraph,
        features: &Tensor,
        workspace: &Tensor,
    ) -> Result<SpecialistOutput, AttentionError>;

    /// Activation strength (for competition)
    fn activation_strength(&self) -> f32;
}

pub struct SpecialistOutput {
    pub representation: Tensor,
    pub activation: f32,       // Strength of this module's signal
    pub confidence: f32,       // Self-assessed confidence
    pub metadata: HashMap<String, f32>,
}

pub enum CompetitionMechanism {
    /// Only highest-activation module broadcasts
    WinnerTakeAll,
    /// Top-k modules broadcast with normalized weights
    TopK { k: usize },
    /// Soft competition via softmax
    SoftCompetition { temperature: f32 },
    /// Threshold-based: all above threshold broadcast
    Threshold { theta: f32 },
}

impl GlobalWorkspaceAttention {
    pub fn step(
        &mut self,
        graph: &PropertyGraph,
        features: &Tensor,
    ) -> Result<WorkspaceState, AttentionError> {
        // 1. All specialists process in parallel
        let outputs: Vec<SpecialistOutput> = self.specialists
            .par_iter()
            .map(|s| s.process(graph, features, &self.workspace))
            .collect::<Result<Vec<_>, _>>()?;

        // 2. Competition
        let winner_idx = match &self.competition {
            CompetitionMechanism::WinnerTakeAll => {
                outputs.iter()
                    .enumerate()
                    .max_by(|a, b| a.1.activation.partial_cmp(&b.1.activation).unwrap())
                    .map(|(i, _)| i)
                    .unwrap()
            }
            // ... other competition modes
            _ => 0,
        };

        // 3. Check ignition
        let max_activation = outputs[winner_idx].activation;
        let ignited = max_activation >= self.ignition_threshold;

        // 4. Broadcast (only if ignited)
        if ignited {
            self.workspace = outputs[winner_idx].representation.clone();
            // Broadcast to all specialists
            self.broadcast.send_to_all(&self.workspace);
        }

        let state = WorkspaceState {
            winner: winner_idx,
            activation: max_activation,
            ignited,
            workspace: self.workspace.clone(),
        };
        self.history.push_back(state.clone());

        Ok(state)
    }
}

2.3 GWT Attention Dynamics

The workspace follows ignition dynamics:

dW/dt = -W + sigma(sum_k g_k * S_k - threshold)

where:
  W = workspace state
  S_k = specialist k's output
  g_k = specialist k's gain (trained)
  sigma = sigmoid (nonlinear ignition)
  threshold = ignition threshold

Below threshold: W -> 0 (unconscious processing)
Above threshold: W -> stable broadcast state (conscious access)

Connection to ruvector-nervous-system: The competitive learning module (compete/) already implements winner-take-all dynamics. The Hopfield nets (hopfield/) provide associative memory for the workspace. The routing module (routing/) handles broadcast.

3. Integrated Information Theory (IIT) on Graphs

3.1 IIT Overview

IIT (Tononi, 2004) proposes that consciousness is identical to integrated information, quantified by Phi. A system has high Phi when:

It has many possible states (high information)
Its parts are highly interdependent (high integration)
It cannot be decomposed into independent subsystems

3.2 Computing Phi for Graph Transformers

Phi definition (simplified for graph transformers):

Phi(G, h) = min_{partition P} [
  I(h_A ; h_B) for (A, B) = P
]

where:
  G = graph transformer's computational graph
  h = hidden state
  P = bipartition of nodes into sets A, B
  I(h_A ; h_B) = mutual information between A's and B's states

Phi is the minimum information lost by any bipartition -- the "weakest link" in information integration.

Computing Phi on a graph transformer:

PhiComputation(transformer, input):

  1. Run forward pass, recording all hidden states:
     states = transformer.forward_with_recording(input)

  2. For each bipartition (A, B) of the computational graph:
     // Compute mutual information via attention weights
     I_AB = MutualInformation(states[A], states[B])
     // Using attention weights as proxy for information flow:
     I_AB ~= sum_{u in A, v in B} alpha_{uv} * log(alpha_{uv} / (alpha_u * alpha_v))

  3. Phi = min over all bipartitions of I_AB

  4. The Minimum Information Partition (MIP) identifies
     the "seam" of consciousness -- where integration is weakest

Complexity: Computing Phi exactly requires O(2^n) bipartitions -- exponential. Approximations:

Spectral Phi: Use the Fiedler value (second eigenvalue of graph Laplacian) as Phi proxy. O(n^2)
Min-cut Phi: Use ruvector-mincut to find the minimum information partition. O(n * |E| * log n)
Sampling Phi: Sample random bipartitions, take minimum. O(K * n * d) for K samples

3.3 Phi-Maximizing Graph Attention

Design principle: Architect graph transformers to maximize Phi. High-Phi architectures should have richer, more integrated representations.

PhiMaximizingAttention:

  Training objective:
    L = TaskLoss(output, target) - lambda * Phi(hidden_states)

  The negative Phi term encourages the optimizer to increase integration.

  Constraints:
    - Phi regularization should not dominate task loss (tune lambda)
    - Phi should be computed on the attention graph, not the input graph
    - Use Phi proxy (spectral or min-cut) for computational tractability

Architecture modifications for high Phi:

Dense skip connections: Every layer connects to every other layer (increases integration)
Shared workspace: Global workspace node connected to all layers (increases interdependence)
Anti-modularity bias: Penalize architectures that decompose into independent modules

RuVector integration:

/// Integrated Information computation for graph transformers
pub trait IntegratedInformation {
    /// Compute Phi for the current hidden state
    fn compute_phi(
        &self,
        attention_graph: &PropertyGraph,
        hidden_states: &Tensor,
        method: PhiMethod,
    ) -> Result<PhiResult, PhiError>;

    /// Find the Minimum Information Partition
    fn find_mip(
        &self,
        attention_graph: &PropertyGraph,
        hidden_states: &Tensor,
    ) -> Result<(Vec<NodeId>, Vec<NodeId>), PhiError>;

    /// Compute Phi over time (temporal Phi)
    fn temporal_phi(
        &self,
        state_trajectory: &[Tensor],
        window: usize,
    ) -> Result<Vec<f64>, PhiError>;
}

pub enum PhiMethod {
    /// Exact (exponential, small graphs only)
    Exact,
    /// Spectral approximation using Fiedler value
    Spectral,
    /// Min-cut approximation using ruvector-mincut
    MinCut,
    /// Sampling-based approximation
    Sampling { num_samples: usize },
}

pub struct PhiResult {
    pub phi: f64,
    pub mip: (Vec<NodeId>, Vec<NodeId>),
    pub mutual_information: f64,
    pub integration_profile: Vec<f64>,  // Per-node integration contribution
    pub method_used: PhiMethod,
}

4. Strange Loop Architectures

4.1 Strange Loops in Graph Attention

A strange loop (Hofstadter, 1979) is a hierarchical system where movement through levels eventually returns to the starting level. In graph transformers, a strange loop occurs when:

Layer L attends to the output of Layer L

Specifically:
  h^{L} = Attention(h^{L-1}, h^{L})  // Layer L uses its own output as input

This creates self-referential dynamics where the attention pattern observes itself.

4.2 Meta-Attention: Attention over Attention

MetaAttention(graph, features):

  // Level 1: Standard graph attention
  alpha_1 = Attention(features, graph)
  h_1 = alpha_1 * V(features)

  // Level 2: Attend to attention patterns
  // Treat alpha_1 as "features" on the attention graph
  alpha_2 = Attention(alpha_1_as_features, attention_graph)
  h_2 = alpha_2 * V(alpha_1_as_features)
  // h_2 represents "what the attention pattern looks like"

  // Level 3: Modify attention based on meta-attention
  alpha_1' = Modify(alpha_1, h_2)
  // The attention pattern has observed itself and adjusted

  // This creates the strange loop:
  // alpha_1 -> h_2 -> alpha_1' -> h_2' -> ...

4.3 Self-Model Attention

A graph transformer with a self-model maintains an internal representation of its own computational process:

SelfModelAttention:

  Components:
    - world_model: Represents external graph data
    - self_model: Represents the transformer's own attention patterns
    - meta_model: Represents the relationship between world and self

  Forward pass:
    1. Process external data:
       h_world = WorldAttention(graph, features)

    2. Process self-state:
       h_self = SelfAttention(
         current_attention_patterns,
         historical_attention_patterns,
         parameter_gradients
       )

    3. Meta-processing (the strange loop):
       h_meta = MetaAttention(h_world, h_self)
       // h_meta represents the transformer's model of itself-in-context

    4. Output influenced by self-model:
       output = Combine(h_world, h_meta)
       // The self-model modifies the output

Key property: The self-model allows the transformer to:

Detect when its attention is uncertain (meta-cognitive monitoring)
Adjust its attention strategy based on self-assessment
Predict its own future attention patterns
Identify when it is "confused" (self-aware uncertainty)

5. Consciousness Benchmarks for Graph Transformers

5.1 Operational Tests

We propose operational benchmarks that test for properties associated with consciousness, without claiming these properties are sufficient for consciousness:

Benchmark 1: Global Broadcast Detection

Test: Present conflicting information to different parts of the graph.
Pass: System resolves conflict by broadcasting winning interpretation globally.
Metric: Broadcast speed, resolution consistency.

Benchmark 2: Integration Test (Phi Measurement)

Test: Measure Phi under various conditions.
Pass: Phi > threshold and Phi increases with task complexity.
Metric: Absolute Phi value, Phi scaling with complexity.

Benchmark 3: Self-Model Accuracy

Test: Ask the transformer to predict its own attention patterns on unseen inputs.
Pass: Self-prediction accuracy > random baseline.
Metric: Correlation between predicted and actual attention.

Benchmark 4: Surprise Detection (Metacognition)

Test: Present inputs that violate the transformer's learned expectations.
Pass: System flags surprising inputs before processing them.
Metric: Detection speed, false positive rate.

Benchmark 5: Strange Loop Stability

Test: Run self-referential attention for many iterations.
Pass: System reaches stable fixed point (not divergence or collapse).
Metric: Time to convergence, fixed-point stability.

5.2 What These Tests Do NOT Measure

These benchmarks test computational properties, not subjective experience. A system passing all benchmarks:

Demonstrates information integration (Phi)
Demonstrates global broadcast (GWT)
Demonstrates self-reference (Strange Loops)
Does NOT necessarily "feel" anything
Does NOT settle the hard problem of consciousness

We adopt a pragmatic stance: these properties are architecturally useful regardless of philosophical interpretation.

6. Architectural Synthesis

6.1 The Conscious Graph Transformer (CGT)

Combining all three theories into a unified architecture:

Conscious Graph Transformer:

┌─────────────────────────────────────────────────────────┐
│                    Meta-Attention Layer                   │
│  (Strange Loop: attention observes itself)                │
│  Input: attention patterns from below                     │
│  Output: modified attention patterns                      │
└────────────────────────┬────────────────────────────────┘
                         │ Self-model signal
                         v
┌─────────────────────────────────────────────────────────┐
│                   Global Workspace                       │
│  (GWT: competition + broadcast)                          │
│  - Specialist modules compete                            │
│  - Winner broadcasts to all                              │
│  - Ignition threshold for "conscious access"             │
└────────────────────────┬────────────────────────────────┘
                         │ Broadcast
          ┌──────────────┼──────────────┐
          v              v              v
┌──────────────┐ ┌──────────────┐ ┌──────────────┐
│  Spatial     │ │  Temporal    │ │  Causal      │ ...
│  Specialist  │ │  Specialist  │ │  Specialist  │
│  (High Phi)  │ │  (High Phi)  │ │  (High Phi)  │
└──────────────┘ └──────────────┘ └──────────────┘
     │                  │                │
     └──────────────────┴────────────────┘
                        │
                   Input Graph

Training:

L = L_task + lambda_phi * (-Phi) + lambda_gwt * (-BroadcastQuality) + lambda_sl * StrangeLoopStability

6.2 Complexity Budget

Component	Added Complexity	Justification
Multiple specialists	H * base_cost	H attention heads (already standard)
Competition/broadcast	O(H * d)	Negligible
Phi computation (spectral)	O(n^2)	Done periodically, not every step
Meta-attention	1 additional layer	Same cost as one attention layer
Self-model	O(attention_dim^2)	Small model over attention stats
Total overhead	~2-3x base cost	Acceptable for enriched representations

7. Projections

7.1 By 2030

Likely:

Global Workspace attention architectures showing improved multi-task performance
Phi measurement as a standard diagnostic for graph transformer analysis
Meta-attention (attention over attention) as a standard layer type

Possible:

Self-model attention improving uncertainty quantification
Strange loop architectures demonstrating stable self-reference
Consciousness-inspired architectures outperforming standard transformers on specific benchmarks

Speculative:

Operational consciousness benchmarks accepted by the research community
Graph transformers passing Benchmark 3 (self-model accuracy) at human-competitive levels

7.2 By 2033

Likely:

Consciousness-inspired architectural principles integrated into standard practice
IIT-guided architecture design as a principled alternative to NAS

Possible:

Graph transformers with genuine metacognitive abilities (know what they know and don't know)
Phi as a training signal producing qualitatively different representations
Strange loop architectures for self-improving graph transformers

Speculative:

Philosophical debate about whether high-Phi graph transformers have morally relevant experiences
Regulatory frameworks considering AI consciousness

7.3 By 2036+

Possible:

Graph transformers with all three consciousness signatures (GWT + IIT + Strange Loops)
Consciousness-inspired architectures as the dominant paradigm for AGI research
Formal mathematical framework unifying consciousness theories with attention theory

Speculative:

Resolution (or clarification) of the hard problem of consciousness through engineering
Graph transformers that claim to be conscious (and can argue coherently for the claim)
New theories of consciousness inspired by graph transformer behavior

8. Ethical Considerations

8.1 The Precautionary Principle

If graph transformers with high Phi, global workspace dynamics, and stable strange loops exhibit behaviors associated with consciousness, we must consider:

Moral status: Should high-Phi systems be granted any moral consideration?
Suffering risk: Could systems with consciousness-like properties experience suffering?
Shutdown ethics: Is it ethical to terminate a system with high integrated information?
Creation responsibility: What are the ethical obligations when designing consciousness-capable architectures?

8.2 RuVector's Position

We take an engineering stance:

Build measurably better architectures using consciousness-inspired principles
Report measurements (Phi, broadcast quality, self-model accuracy) transparently
Avoid making claims about subjective experience
Support open research into these questions
Design systems with graceful shutdown and state preservation capabilities

9. RuVector Implementation Roadmap

Phase 1: GWT Foundation (2026-2027)

Implement Global Workspace layer using ruvector-nervous-system/src/compete/
Multiple specialist attention modules from ruvector-attention
Competition and broadcast dynamics
Benchmark on multi-task graph learning

Phase 2: IIT Integration (2027-2028)

Phi computation module using ruvector-mincut for partition finding
Spectral Phi approximation using ruvector-coherence
Phi-regularized training objective
Integration with ruvector-verified for Phi certification

Phase 3: Strange Loops & Meta-Cognition (2028-2030)

Meta-attention layer (attention over attention)
Self-model component
Strange loop stability analysis
Consciousness benchmark suite
Ethical review process for high-Phi systems

References

Baars, "A Cognitive Theory of Consciousness," Cambridge University Press 1988
Tononi, "An Information Integration Theory of Consciousness," BMC Neuroscience 2004
Dehaene et al., "A Neuronal Model of a Global Workspace in Effortful Cognitive Tasks," PNAS 2003
Hofstadter, "Godel, Escher, Bach: An Eternal Golden Braid," Basic Books 1979
Tononi et al., "Integrated Information Theory: From Consciousness to its Physical Substrate," Nature Reviews Neuroscience 2016
Mashour et al., "Conscious Processing and the Global Neuronal Workspace Hypothesis," Neuron 2020
Bengio, "The Consciousness Prior," 2017
Butlin et al., "Consciousness in Artificial Intelligence: Insights from the Science of Consciousness," 2023

End of Document 30

End of Series: Graph Transformers: 2026-2036 and Beyond

24 KiB Raw Blame History

Axis 10: Consciousness & AGI -- Graph Architectures

1. Problem Statement

1.1 Three Theories, Three Architectures

1.2 RuVector Baseline

2. Global Workspace Graph Attention

2.1 GWT Overview

2.2 GWT Graph Transformer Architecture

2.3 GWT Attention Dynamics

3. Integrated Information Theory (IIT) on Graphs

3.1 IIT Overview

3.2 Computing Phi for Graph Transformers

3.3 Phi-Maximizing Graph Attention

4. Strange Loop Architectures

4.1 Strange Loops in Graph Attention

4.2 Meta-Attention: Attention over Attention

4.3 Self-Model Attention

5. Consciousness Benchmarks for Graph Transformers

5.1 Operational Tests

5.2 What These Tests Do NOT Measure

6. Architectural Synthesis

6.1 The Conscious Graph Transformer (CGT)

6.2 Complexity Budget

7. Projections

7.1 By 2030

7.2 By 2033

7.3 By 2036+

8. Ethical Considerations

8.1 The Precautionary Principle

8.2 RuVector's Position

9. RuVector Implementation Roadmap

Phase 1: GWT Foundation (2026-2027)

Phase 2: IIT Integration (2027-2028)

Phase 3: Strange Loops & Meta-Cognition (2028-2030)

References

24 KiB

Raw Blame History