dearsky/wifi-densepose

Fork 0

Files

History

ruv cd5943df23 Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

accuracy_test.rs

Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

api_server.rs

Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

batch_processing.rs

Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

custom_pipeline.rs

Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

lean_agentic.rs

Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

optimization_demo.rs

Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

README.md

Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

sample_dataset.json

Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

simple_ocr.rs

Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

streaming.rs

Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

wasm_demo.html

Merge commit 'd803bfe2b1fe7f5e219e50ac20d6801a0a58ac75' as 'vendor/ruvector'

2026-02-28 14:39:40 -05:00

README.md

ruvector-scipix Examples

This directory contains comprehensive examples demonstrating various features and use cases of ruvector-scipix.

Quick Start

All examples can be run using:

cargo run --example <example_name> -- [arguments]

Examples Overview

1. Simple OCR (`simple_ocr.rs`)

Basic OCR functionality with single image processing.

Demonstrates:

Loading and processing a single image
OCR recognition
Output in multiple formats (plain text, LaTeX)
Confidence scores

Usage:

cargo run --example simple_ocr -- path/to/image.png

Example Output:

Plain Text: x² + 2x + 1 = 0
LaTeX: x^{2} + 2x + 1 = 0
Confidence: 95.3%

2. Batch Processing (`batch_processing.rs`)

Parallel processing of multiple images with progress tracking.

Demonstrates:

Directory-based batch processing
Parallel/concurrent processing
Progress bar visualization
Statistics and metrics
JSON output

Usage:

cargo run --example batch_processing -- /path/to/images output.json

Features:

Automatic CPU core detection for optimal parallelism
Real-time progress visualization
Per-file error handling
Aggregate statistics

3. API Server (`api_server.rs`)

REST API server for OCR processing.

Demonstrates:

HTTP server with Axum framework
Single and batch image processing endpoints
Health check endpoint
Graceful shutdown
CORS support
Multipart file uploads

Usage:

# Start server
cargo run --example api_server

# In another terminal, test the API
curl -X POST -F "image=@equation.png" http://localhost:8080/ocr
curl http://localhost:8080/health

Endpoints:

GET /health - Health check
POST /ocr - Process single image
POST /batch - Process multiple images

4. Streaming Processing (`streaming.rs`)

Streaming PDF processing with real-time results.

Demonstrates:

Large document processing
Streaming results as pages are processed
Real-time progress reporting
Incremental JSON output
Memory-efficient processing

Usage:

cargo run --example streaming -- document.pdf output/

Features:

Processes pages concurrently (4 at a time)
Saves individual page results immediately
Generates final document summary
Per-page timing statistics

5. Custom Pipeline (`custom_pipeline.rs`)

Custom OCR pipeline with preprocessing and post-processing.

Demonstrates:

Image preprocessing (denoising, sharpening, binarization)
Post-processing filters
LaTeX validation
Confidence filtering
Custom output formatting
Otsu's thresholding

Usage:

cargo run --example custom_pipeline -- image.png

Pipeline Steps:

Preprocessing:
- Denoising
- Contrast enhancement
- Sharpening
- Binarization (Otsu's method)
- Deskewing
Post-processing:
- Confidence filtering
- LaTeX validation
- Spell checking
- Custom formatting

6. WASM Browser Demo (`wasm_demo.html`)

Browser-based OCR demonstration.

Demonstrates:

WebAssembly integration
Browser-based image upload
Drag-and-drop interface
Real-time visualization
Client-side processing

Setup:

# Build WASM module (when available)
wasm-pack build --target web

# Serve the demo
python3 -m http.server 8000
# Open http://localhost:8000/examples/wasm_demo.html

Features:

Modern, responsive UI
Drag-and-drop file upload
Live preview
Real-time results
No server required (runs in browser)

7. Agent-Based Processing (`lean_agentic.rs`)

Distributed OCR processing with agent coordination.

Demonstrates:

Multi-agent coordination
Distributed task processing
Fault tolerance
Load balancing
Agent statistics

Usage:

cargo run --example lean_agentic -- /path/to/documents

Features:

Spawns multiple OCR agents (default: 4)
Automatic task distribution
Per-agent statistics
Throughput metrics
JSON result export

Architecture:

Coordinator
├── Agent 1 (tasks: 12)
├── Agent 2 (tasks: 15)
├── Agent 3 (tasks: 11)
└── Agent 4 (tasks: 13)

8. Accuracy Testing (`accuracy_test.rs`)

OCR accuracy testing against ground truth datasets.

Demonstrates:

Dataset-based testing
Multiple accuracy metrics
Category-based analysis
Statistical correlation
Comprehensive reporting

Usage:

cargo run --example accuracy_test -- dataset.json

Dataset Format:

[
  {
    "image_path": "tests/images/quadratic.png",
    "ground_truth_text": "x^2 + 2x + 1 = 0",
    "ground_truth_latex": "x^{2} + 2x + 1 = 0",
    "category": "quadratic"
  }
]

Metrics Calculated:

Text Accuracy - Overall string similarity
Character Error Rate (CER) - Character-level errors
Word Error Rate (WER) - Word-level errors
LaTeX Accuracy - LaTeX format correctness
Confidence Correlation - Pearson correlation between confidence and accuracy
Category Breakdown - Per-category statistics

Example Output:

Total Cases: 100
Successful: 98 (98.0%)
Average Confidence: 92.5%
Average Text Accuracy: 94.2%
Average CER: 3.1%
Average WER: 5.8%
Confidence Correlation: 0.847

Category Breakdown:
  quadratic: 25 cases, 96.3% accuracy
  linear: 30 cases, 98.1% accuracy
  calculus: 20 cases, 89.7% accuracy

Common Patterns

Error Handling

All examples use anyhow::Result for error handling:

use anyhow::{Context, Result};

fn main() -> Result<()> {
    let image = image::open(path)
        .context("Failed to open image")?;
    Ok(())
}

Logging

Examples use env_logger for debug output:

# Run with debug logging
RUST_LOG=debug cargo run --example simple_ocr -- image.png

# Run with info logging (default)
RUST_LOG=info cargo run --example simple_ocr -- image.png

Configuration

OCR engine configuration:

use ruvector_scipix::OcrConfig;

let config = OcrConfig {
    confidence_threshold: 0.7,
    max_image_size: 4096,
    enable_preprocessing: true,
    // ... other options
};

Dependencies

Core dependencies used in examples:

anyhow - Error handling
tokio - Async runtime
image - Image processing
serde/serde_json - Serialization
indicatif - Progress bars
axum - HTTP server (api_server)
env_logger - Logging

Building Examples

Build all examples:

cargo build --examples

Build specific example:

cargo build --example simple_ocr

Run with optimizations:

cargo run --release --example batch_processing -- images/ output.json

Testing Examples

Create test images:

# Create test directory
mkdir -p test_images

# Add some test images
cp /path/to/math_equation.png test_images/

Run examples:

# Simple OCR
cargo run --example simple_ocr -- test_images/equation.png

# Batch processing
cargo run --example batch_processing -- test_images/ results.json

# Accuracy test (requires dataset)
cargo run --example accuracy_test -- test_dataset.json

Integration Guide

Using in Your Project

Add dependency:

[dependencies]
ruvector-scipix = "0.1.0"

Basic usage:

use ruvector_scipix::{OcrEngine, OcrConfig};

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let config = OcrConfig::default();
    let engine = OcrEngine::new(config).await?;

    let image = image::open("equation.png")?;
    let result = engine.recognize(&image).await?;

    println!("Text: {}", result.text);
    Ok(())
}

Advanced usage: See individual examples for advanced patterns like:

Custom pipelines
Batch processing
API integration
Agent-based processing

Performance Tips

Batch Processing:
- Use parallel processing for multiple images
- Adjust concurrency based on CPU cores
- Enable model caching for repeated runs
Memory Management:
- Stream large documents instead of loading all at once
- Use appropriate image resolution (downscale if needed)
- Clear cache periodically for long-running processes
Accuracy vs Speed:
- Higher confidence thresholds = more accuracy, slower processing
- Preprocessing improves accuracy but adds overhead
- Balance based on your use case

Troubleshooting

Common Issues

"Model not found"

# Download models first
./scripts/download_models.sh

"Out of memory"

Reduce batch size or concurrent workers
Downscale large images before processing
Enable streaming for PDFs

"Low confidence scores"

Enable preprocessing pipeline
Improve image quality (resolution, contrast)
Check for skewed or rotated images

Contributing

When adding new examples:

Add the .rs file to examples/
Update Cargo.toml with example entry
Document in this README
Include usage examples and expected output
Add error handling and logging
Keep examples self-contained

Resources

License

All examples are provided under the same license as ruvector-scipix.

README.md

ruvector-scipix Examples

Quick Start

Examples Overview

1. Simple OCR (simple_ocr.rs)

2. Batch Processing (batch_processing.rs)

3. API Server (api_server.rs)

4. Streaming Processing (streaming.rs)

5. Custom Pipeline (custom_pipeline.rs)

6. WASM Browser Demo (wasm_demo.html)

7. Agent-Based Processing (lean_agentic.rs)

8. Accuracy Testing (accuracy_test.rs)

Common Patterns

Error Handling

Logging

Configuration

Dependencies

Building Examples

Testing Examples

Integration Guide

Using in Your Project

Performance Tips

Troubleshooting

Common Issues

Contributing

Resources

License

1. Simple OCR (`simple_ocr.rs`)

2. Batch Processing (`batch_processing.rs`)

3. API Server (`api_server.rs`)

4. Streaming Processing (`streaming.rs`)

5. Custom Pipeline (`custom_pipeline.rs`)

6. WASM Browser Demo (`wasm_demo.html`)

7. Agent-Based Processing (`lean_agentic.rs`)

8. Accuracy Testing (`accuracy_test.rs`)