Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

This commit is contained in:
ruv
2026-02-28 14:39:40 -05:00
7854 changed files with 3522914 additions and 0 deletions

View File

@@ -0,0 +1,274 @@
# EXO-AI 2025: Research Papers & References
## SPARC Research Phase: Academic Foundations
This document catalogs the academic research informing the EXO-AI architecture, organized by domain.
---
## 1. Processing-in-Memory (PIM) Architectures
### Core Reviews
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [A Comprehensive Review of Processing-in-Memory Architectures for DNNs](https://www.mdpi.com/2073-431X/13/7/174) | MDPI Computers | 2024 | Chiplet-based PIM designs, dataflow optimization |
| [Neural-PIM: Efficient Processing-In-Memory](https://arxiv.org/pdf/2201.09861) | arXiv | 2022 | Neural network acceleration in DRAM |
| [PRIME: Processing-in-Memory for Neural Networks](https://ieeexplore.ieee.org/document/7551380/) | ISCA | 2016 | ReRAM-based crossbar computation |
| [PIMCoSim: Hardware/Software Co-Simulator](https://www.mdpi.com/2079-9292/13/23/4795) | MDPI Electronics | 2024 | Simulation framework for PIM exploration |
### Key Findings
- UPMEM achieves 23x performance over GPU when memory oversubscription required
- SRAM-PIM with value-level and bit-level sparsity (DB-PIM framework)
- ReRAM crossbars enable ~10x gain over SRAM-based accelerators
### UPMEM Architecture
First commercially available PIM: DRAM + in-order cores (DPUs) on same chip.
---
## 2. Neuromorphic Computing & Vector Search
### Neuromorphic Hardware
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [Roadmap to Neuromorphic Computing with Emerging Technologies](https://arxiv.org/html/2407.02353v1) | arXiv | 2024 | Technology roadmap for neuromorphic systems |
| [Neuromorphic Computing for Robotic Vision](https://www.nature.com/articles/s44172-025-00492-5) | Nature Comm. Eng. | 2025 | Event-driven vision processing |
| [Survey of Neuromorphic Computing and Neural Networks in Hardware](https://arxiv.org/pdf/1705.06963) | arXiv | 2017 | Comprehensive hardware survey |
### Key Hardware Platforms
- **SpiNNaker**: Millions of processing cores (Manchester)
- **TrueNorth**: IBM's commercial neuromorphic chip
- **Loihi**: Intel research chip with online learning
- **BrainScaleS**: European analog-digital hybrid
### HNSW Advances
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [Down with the Hierarchy: Hub Highway Hypothesis](https://arxiv.org/html/2412.01940v2) | arXiv | 2024 | Hubs maintain hierarchy function, not layers |
| [Efficient Vector Search on Disaggregated Memory (d-HNSW)](https://arxiv.org/html/2505.11783v1) | arXiv | 2025 | Disaggregated memory architecture |
| [WebANNS: ANN Search in Web Browsers](https://arxiv.org/html/2507.00521) | arXiv | 2025 | Browser-based vector search |
---
## 3. Implicit Neural Representations (INR)
### Core Research
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [Where Do We Stand with INRs? Technical Survey](https://arxiv.org/html/2411.03688v1) | arXiv | 2024 | Four-category taxonomy of INR techniques |
| [FR-INR: Fourier Reparameterized Training](https://github.com/LabShuHangGU/FR-INR) | CVPR | 2024 | Fourier bases for MLP weight composition |
| [Neural Experts: Mixture of Experts for INRs](https://neurips.cc/virtual/2024/poster/93148) | NeurIPS | 2024 | MoE for local piece-wise continuous functions |
| [inr2vec: Compact Latent Representation for INRs](https://cvlab-unibo.github.io/inr2vec/) | CVPR | 2023 | Embeddings for INR-based retrieval |
### Key INR Methods
- **SIREN**: Sinusoidal activation networks
- **WIRE**: Wavelet implicit representations
- **GAUSS**: Gaussian activation functions
- **FINER**: Frequency-enhanced representations
### Retrieval Performance
inr2vec shows 1.8 mAP gap vs PointNet++ on 3D retrieval benchmarks.
---
## 4. Hypergraph & Topological Data Analysis
### Hypergraph Neural Networks
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [EasyHypergraph: Fast Higher-Order Network Analysis](https://www.nature.com/articles/s41599-025-05180-5) | Nature HSS Comm. | 2025 | Memory-efficient hypergraph analysis |
| [DPHGNN: Dual Perspective Hypergraph Neural Networks](https://dl.acm.org/doi/10.1145/3637528.3672047) | KDD | 2024 | Dual-perspective message passing |
| [Hypergraph Computation Survey](https://www.sciencedirect.com/science/article/pii/S2095809924002510) | Engineering | 2024 | Comprehensive hypergraph computation survey |
### Topological Deep Learning
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [Topological Deep Learning: New Frontier for Relational Learning](https://pmc.ncbi.nlm.nih.gov/articles/PMC11973457/) | PMC | 2024 | Position paper on TDL paradigm |
| [ICML TDL Challenge 2024: Beyond the Graph Domain](https://arxiv.org/html/2409.05211v1) | ICML | 2024 | 52 submissions on topological liftings |
| [Simplicial Homology Theories for Hypergraphs](https://arxiv.org/html/2409.18310) | arXiv | 2024 | Survey of hypergraph homology |
### Key Software
- **TopoX Suite**: TopoNetX, TopoEmbedX, TopoModelX (Python)
- **DHG**: DeepHypergraph for learning on hypergraphs
- **HyperNetX**: Hypergraph computations
- **XGI**: Hypergraphs and simplicial complexes
---
## 5. Temporal Memory & Causal Inference
### Agent Memory Architectures
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [Mem0: Production-Ready AI Agents with Scalable LTM](https://arxiv.org/pdf/2504.19413) | arXiv | 2024 | Causal relationships for decision-making |
| [Zep: Temporal Knowledge Graph for Agent Memory](https://arxiv.org/html/2501.13956v1) | arXiv | 2025 | TKG-based memory with Graphiti engine |
| [Memory Architectures in Long-Term AI Agents](https://www.researchgate.net/publication/388144017) | ResearchGate | 2025 | 47% improvement in temporal reasoning |
| [Evaluating Very Long-Term Conversational Memory](https://www.researchgate.net/publication/384220784) | ResearchGate | 2024 | Long-term temporal/causal dynamics |
### Key Findings
- Zep outperforms MemGPT on Deep Memory Retrieval benchmark
- Mem0g adds graph-based memory representations
- TKGs model relationship start/change/end for causality tracking
### Causal Inference + Deep Learning
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [Causal Inference Meets Deep Learning: Survey](https://pmc.ncbi.nlm.nih.gov/articles/PMC11384545/) | PMC | 2024 | PFC working memory for causal reasoning |
---
## 6. Federated Learning & Distributed Consensus
### Federated Learning
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [Secure and Fair Federated Learning via Consensus Incentive](https://www.mdpi.com/2227-7390/12/19/3068) | MDPI Mathematics | 2024 | Byzantine-resistant FL |
| [FL Assisted Distributed Energy Optimization](https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/rpg2.13101) | IET RPG | 2024 | Consensus + innovations approach |
| [Comprehensive Review of FL Challenges](https://link.springer.com/article/10.1186/s40537-025-01195-6) | J. Big Data | 2025 | Data preparation viewpoint |
### CRDT Fundamentals
| Resource | Key Contribution |
|----------|------------------|
| [CRDT Dictionary: Field Guide](https://www.iankduncan.com/engineering/2025-11-27-crdt-dictionary) | Comprehensive CRDT taxonomy |
| [CRDT Wiki (Dremio)](https://www.dremio.com/wiki/conflict-free-replicated-data-type/) | Strong eventual consistency |
### Key Algorithms
- **HyFDCA**: Hybrid Federated Dual Coordinate Ascent (2024)
- **Gossip protocols** for decentralized aggregation
- **Version vectors** for causal tracking in CRDTs
---
## 7. Photonic Computing
### Silicon Photonics for AI
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [MIT Photonic Processor for Ultrafast AI](https://news.mit.edu/2024/photonic-processor-could-enable-ultrafast-ai-computations-1202) | MIT News | 2024 | Sub-nanosecond classification, 92% accuracy |
| [Silicon Photonics for Scalable AI Hardware](https://ieeephotonics.org/) | IEEE JSTQE | 2025 | Wafer-scale ONN integration |
| [Hundred-Layer Photonic Deep Learning](https://www.nature.com/articles/s41467-025-65356-0) | Nature Comm. | 2025 | SLiM chip: 200+ layer depth |
| [All-Optical CNN with Phase Change Materials](https://www.nature.com/articles/s41598-025-06259-4) | Sci. Reports | 2025 | GST-based active waveguides |
### Key Characteristics
- Sub-nanosecond latency
- Minimal energy loss (photons don't generate heat like electrons)
- THz bandwidth potential
- 3.2 Tbps achieved on silicon slow-light modulator
---
## 8. ReRAM & Memristor Computing
### Analog In-Memory Compute
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [Programming Memristor Arrays with Arbitrary Precision](https://www.science.org/doi/10.1126/science.adi9405) | Science | 2024 | 16Mb floating-point RRAM, 31.2 TFLOPS/W |
| [Memristive Memory Augmented Neural Network](https://www.nature.com/articles/s41467-022-33629-7) | Nature Comm. | 2022 | Hashing and similarity search in crossbars |
| [Wafer-Scale Memristive Passive Crossbar](https://www.nature.com/articles/s41467-025-63831-2) | Nature Comm. | 2025 | Brain-scale neuromorphic computing |
| [4K-Memristor Analog-Grade Crossbar](https://www.nature.com/articles/s41467-021-25455-0) | Nature Comm. | 2021 | Foundational analog VMM work |
### Vector Similarity Search
- TCAM functionality in analog crossbar
- Hamming distance via degree-of-mismatch output
- Massively parallel in-memory similarity computation
---
## 9. Sheaf Theory & Category Theory for ML
### Sheaf Neural Networks
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [Sheaf Theory: From Deep Geometry to Deep Learning](https://arxiv.org/html/2502.15476v1) | arXiv | 2025 | Comprehensive sheaf applications survey |
| [Sheaf4Rec: Recommender Systems](https://arxiv.org/abs/2304.09097) | arXiv | 2023 | 8.53% F1@10 improvement, 37% faster |
| [Sheaf Neural Networks with Connection Laplacians](https://proceedings.mlr.press/v196/barbero22a/barbero22a.pdf) | ICML | 2022 | Learnable sheaf Laplacians |
| [Categorical Deep Learning: Algebraic Theory of All Architectures](https://arxiv.org/abs/2402.15332) | arXiv | 2024 | Monads + 2-categories for neural networks |
### Key Concepts
- **Sheaf**: Local-to-global consistency structure
- **Sheaf Laplacian**: Diffusion operator on sheaf-decorated graphs
- **Neural Sheaf Diffusion**: Learning sheaf structure from data
---
## 10. Consciousness & Integrated Information
### IIT Research
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [IIT 4.0: Phenomenal Existence in Physical Terms](https://pmc.ncbi.nlm.nih.gov/articles/PMC10581496/) | PLOS Comp. Bio. | 2023 | Updated axioms, postulates, measures |
| [How to be an IIT Theorist Without Losing Your Body](https://www.frontiersin.org/journals/computational-neuroscience/articles/10.3389/fncom.2024.1510066/full) | Frontiers | 2024 | Embodied IIT considerations |
### Key Metrics
- **Φ (Phi)**: Integrated information measure
- **Reentrant architecture**: Feedback loops required for consciousness
- **Controversy**: Empirical testability debates (2023-2025)
---
## 11. Thermodynamic Limits
### Landauer Bound & Reversible Computing
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [Fundamental Energy Limits and Reversible Computing](https://www.osti.gov/servlets/purl/1458032) | Sandia | 2017 | DOE reversible computing roadmap |
| [Adiabatic Computing for Optimal Thermodynamic Efficiency](https://arxiv.org/abs/2302.09957) | arXiv | 2023 | Optimal information processing bounds |
| [Fundamental Energy Cost of Finite-Time Parallelizable Computing](https://www.nature.com/articles/s41467-023-36020-2) | Nature Comm. | 2023 | Parallelization thermodynamics |
### Key Numbers
- Landauer limit: ~0.018 eV (2.9×10⁻²¹ J) per bit erasure at room temp
- Current CMOS: 1000x above theoretical minimum
- Reversible computing: 4000x efficiency potential
- Vaire Computing: Commercial reversible chips by 2027-2028
---
## 12. Multi-Modal Foundation Models
### Unified Architectures
| Paper | Venue | Year | Key Contribution |
|-------|-------|------|------------------|
| [Unified Multimodal Understanding and Generation](https://arxiv.org/pdf/2505.02567) | arXiv | 2025 | Any-to-any multimodal models |
| [Show-o: Single Transformer for Multimodal](https://github.com/showlab/Awesome-Unified-Multimodal-Models) | GitHub | 2024 | Unified understanding + generation |
| [Multi-Modal Latent Space Learning for CoT Reasoning](https://ojs.aaai.org/index.php/AAAI/article/view/29776/31338) | AAAI | 2024 | Chain-of-thought across modalities |
### Key Models (2024-2025)
- **Chameleon**: Mixed-modal early fusion (Meta)
- **Emu3**: Next-token prediction for all modalities
- **Janus/JanusFlow**: Decoupled visual encoding
- **SEED-X**: Multi-granularity comprehension
---
## Summary Statistics
| Category | Papers Reviewed | Key Takeaway |
|----------|-----------------|--------------|
| PIM/Near-Memory | 8 | 23x GPU performance, commercial availability |
| Neuromorphic | 12 | 1000x energy reduction potential |
| INR/Learned Manifolds | 6 | Continuous representations for storage |
| Hypergraph/TDA | 10 | Higher-order relations, topological queries |
| Temporal Memory | 6 | TKGs for causal agent memory |
| Federated/CRDT | 5 | Decentralized consensus, eventual consistency |
| Photonic | 5 | Sub-ns latency, 92% accuracy demonstrated |
| Memristor | 5 | 31.2 TFLOPS/W efficiency |
| Sheaf/Category | 6 | 8.5% improvement on recommender tasks |
| Consciousness | 3 | IIT 4.0 framework, Φ measures |
| Thermodynamics | 4 | 4000x reversible computing potential |
| Multi-Modal | 5 | Unified latent spaces emerging |