Files
wifi-densepose/docs/adr/ADR-031-rvf-example-repository.md
ruv d803bfe2b1 Squashed 'vendor/ruvector/' content from commit b64c2172
git-subtree-dir: vendor/ruvector
git-subtree-split: b64c21726f2bb37286d9ee36a7869fef60cc6900
2026-02-28 14:39:40 -05:00

174 lines
7.1 KiB
Markdown

# ADR-031: RVF Example Repository — 24 Demonstrations Across Four Categories
- **Status**: Accepted
- **Date**: 2026-02-14
- **Supersedes**: None
- **Related**: ADR-029 (RVF Canonical Format), ADR-030 (Computational Container)
## Context
RVF (RuVector Format) is the unified agentic AI format — storage, transfer, and cognitive runtime in one file. The existing six examples (`basic_store`, `progressive_index`, `quantization`, `wire_format`, `crypto_signing`, `filtered_search`) demonstrate core storage and indexing features but do not cover:
- Agentic AI patterns (agent memory, swarm coordination, reasoning traces)
- Practical production patterns (RAG, recommendations, caching, deduplication)
- Vertical domain applications (genomics, finance, medical, legal)
- Exotic capabilities (quantum state, neuromorphic search, self-booting, eBPF)
- Runtime targets (browser/WASM, edge/IoT, serverless, ruvLLM inference)
Without concrete examples, users cannot discover or adopt the full scope of RVF.
## Decision
Create 24 new runnable examples organized into four categories, plus a cross-cutting runtime-targets group. Each example is a standalone `fn main()` in `examples/rvf/examples/` with inline documentation explaining the pattern.
### Category A: Agentic AI (6 examples)
| # | Example | File | What It Demonstrates |
|---|---------|------|---------------------|
| A1 | Agent Memory | `agent_memory.rs` | Persistent agent memory with witness audit trails, session recall |
| A2 | Swarm Knowledge | `swarm_knowledge.rs` | Multi-agent shared knowledge base with concurrent writes |
| A3 | Reasoning Trace | `reasoning_trace.rs` | Store chain-of-thought reasoning with lineage derivation |
| A4 | Tool Cache | `tool_cache.rs` | Cache tool call results with metadata filters and TTL |
| A5 | Agent Handoff | `agent_handoff.rs` | Transfer agent state between instances via RVF file |
| A6 | Experience Replay | `experience_replay.rs` | RL-style experience replay buffer with priority sampling |
### Category B: Practical Production (5 examples)
| # | Example | File | What It Demonstrates |
|---|---------|------|---------------------|
| B1 | Semantic Search | `semantic_search.rs` | Document search engine with metadata-filtered k-NN |
| B2 | Recommendation Engine | `recommendation.rs` | Item recommendations with collaborative filtering embeddings |
| B3 | RAG Pipeline | `rag_pipeline.rs` | Retrieval-augmented generation: chunk, embed, retrieve, rerank |
| B4 | Embedding Cache | `embedding_cache.rs` | LRU embedding cache with temperature tiering and eviction |
| B5 | Dedup Detector | `dedup_detector.rs` | Near-duplicate detection with threshold-based clustering |
### Category C: Vertical Domains (4 examples)
| # | Example | File | What It Demonstrates |
|---|---------|------|---------------------|
| C1 | Genomic Pipeline | `genomic_pipeline.rs` | DNA k-mer embeddings with `.rvdna` domain profile and lineage |
| C2 | Financial Signals | `financial_signals.rs` | Market signal embeddings with TEE attestation witness chains |
| C3 | Medical Imaging | `medical_imaging.rs` | Radiology embedding search with `.rvvis` profile |
| C4 | Legal Discovery | `legal_discovery.rs` | Legal document similarity with `.rvtext` profile and audit trails |
### Category D: Exotic Capabilities (5 examples)
| # | Example | File | What It Demonstrates |
|---|---------|------|---------------------|
| D1 | Self-Booting Service | `self_booting.rs` | RVF with embedded unikernel that boots as a microservice |
| D2 | eBPF Accelerator | `ebpf_accelerator.rs` | eBPF hot-path acceleration for sub-microsecond lookups |
| D3 | Hyperbolic Taxonomy | `hyperbolic_taxonomy.rs` | Hierarchy-aware search in hyperbolic space |
| D4 | Multi-Modal Fusion | `multimodal_fusion.rs` | Text + image embeddings in one RVF file with cross-modal search |
| D5 | Sealed Cognitive Engine | `sealed_engine.rs` | Full cognitive engine: vectors + LoRA + GNN + kernel + witness chain |
### Category E: Runtime Targets (4 examples)
| # | Example | File | What It Demonstrates |
|---|---------|------|---------------------|
| E1 | Browser WASM | `browser_wasm.rs` | Client-side vector search via 5.5 KB WASM microkernel |
| E2 | Edge IoT | `edge_iot.rs` | Constrained device with rvlite-style minimal API |
| E3 | Serverless Function | `serverless_function.rs` | Cold-start optimized RVF for Lambda/Cloud Functions |
| E4 | ruvLLM Inference | `ruvllm_inference.rs` | LLM KV cache + LoRA adapter management backed by RVF |
## Implementation
### File Organization
```
examples/rvf/
Cargo.toml # Updated with 24 new [[example]] entries
examples/
# Existing (6)
basic_store.rs
progressive_index.rs
quantization.rs
wire_format.rs
crypto_signing.rs
filtered_search.rs
# Agentic (6)
agent_memory.rs
swarm_knowledge.rs
reasoning_trace.rs
tool_cache.rs
agent_handoff.rs
experience_replay.rs
# Practical (5)
semantic_search.rs
recommendation.rs
rag_pipeline.rs
embedding_cache.rs
dedup_detector.rs
# Vertical (4)
genomic_pipeline.rs
financial_signals.rs
medical_imaging.rs
legal_discovery.rs
# Exotic (5)
self_booting.rs
ebpf_accelerator.rs
hyperbolic_taxonomy.rs
multimodal_fusion.rs
sealed_engine.rs
# Runtime Targets (4)
browser_wasm.rs
edge_iot.rs
serverless_function.rs
ruvllm_inference.rs
```
### Example Structure
Each example follows this pattern:
```rust
//! # Example Title
//!
//! Category: Agentic | Practical | Vertical | Exotic | Runtime
//!
//! **What this demonstrates:**
//! - Feature A
//! - Feature B
//!
//! **RVF segments used:** VEC, INDEX, WITNESS, ...
//!
//! **Run:** `cargo run --example example_name`
fn main() {
// Self-contained, deterministic, no external dependencies
}
```
### Design Constraints
1. **Self-contained**: Each example runs without external services (databases, APIs, models)
2. **Deterministic**: Seeded RNG produces identical output across runs
3. **Fast**: Each completes in < 2 seconds on commodity hardware
4. **Documented**: Module-level doc comments explain the pattern and RVF segments used
5. **Buildable**: All examples compile against existing RVF crate APIs
### Dependencies
No new crate dependencies beyond what `examples/rvf/Cargo.toml` already provides:
- `rvf-runtime`, `rvf-types`, `rvf-wire`, `rvf-manifest`, `rvf-index`, `rvf-quant`, `rvf-crypto`
- `rand`, `tempfile`, `ed25519-dalek`
## Consequences
### Positive
- Users can discover all RVF capabilities through runnable code
- Each category targets a different audience (AI engineers, domain specialists, systems programmers)
- Examples serve as integration tests for advanced API surface
- The repository becomes a reference implementation catalog
### Negative
- 24 additional files to maintain (mitigated by CI: `cargo build --examples`)
- Some examples simulate external systems (LLM tokens, genomic data) with synthetic data
- Examples may drift from API as crates evolve (mitigated by workspace-level `cargo test`)
### Neutral
- Examples are not benchmarks; performance numbers are illustrative
- Domain-specific examples (genomics, finance) use synthetic data, not real datasets