Amplitude-based detection is fragile in non-static environments #51

New Issue

2026-03-01T14:27:05+08:00

cwang-colud commented

2026-03-01 14:27:05 +08:00

(Migrated from github.com)

I have a core question after reading the implementation, focused on the practical applicability boundary of CSI amplitude features in real-world environments.

In the current code, there is logic that applies thresholding on amplitude statistics to produce human_detected / confidence. From my experience, in real (non-enclosed, non-static) environments, CSI amplitude is affected by many non-human factors, which makes the mapping between “amplitude fluctuations” and “human activity” unstable and increases the false positive risk. Only in relatively enclosed, stable-layout, and static environments do amplitude-based features tend to show more repeatable correlation with human activity.

Could you please clarify the following verifiable information so users can understand the intended scope and limitations of this project?

Does the project assume a fixed scene / static environment / fixed-position calibration by default?
Are there robustness designs intended for non-enclosed or dynamic environments?
Do you have cross-scene validation results , or a reproducible evaluation setup?

Thanks.

I have a core question after reading the implementation, focused on the practical applicability boundary of CSI amplitude features in real-world environments. In the current code, there is logic that applies thresholding on amplitude statistics to produce `human_detected` / `confidence`. From my experience, in real (non-enclosed, non-static) environments, CSI amplitude is affected by many non-human factors, which makes the mapping between “amplitude fluctuations” and “human activity” unstable and increases the false positive risk. Only in relatively enclosed, stable-layout, and static environments do amplitude-based features tend to show more repeatable correlation with human activity. Could you please clarify the following verifiable information so users can understand the intended scope and limitations of this project? 1. Does the project assume a fixed scene / static environment / fixed-position calibration by default? 2. Are there robustness designs intended for non-enclosed or dynamic environments? 3. Do you have cross-scene validation results , or a reproducible evaluation setup? Thanks.

ruvnet commented

2026-03-01 14:31:58 +08:00

(Migrated from github.com)

Thanks for the sharp question — this is exactly the kind of scrutiny WiFi sensing projects need. You're right that raw CSI amplitude alone is fragile. Let me answer each question with specific code references.

1. Does the project assume a fixed scene / static environment / fixed-position calibration?

Short answer: Yes, for the rule-based RSSI classifier. No, for the Rust CSI pipeline.

There are two separate detection paths in this project:

Python RSSI path (v1/src/sensing/classifier.py) — This is the rule-based classifier you identified. It uses fixed thresholds (presence_variance_threshold=0.5, motion_energy_threshold=0.1) and is explicitly designed for controlled, single-room scenarios. It does assume a relatively static environment. The confidence model is a weighted heuristic (60% base + 20% spectral + 20% cross-receiver agreement), not a learned model. This path is the MVP sensing layer — useful for demos and enclosed rooms, but you're correct that it would produce false positives in dynamic environments.

Rust CSI path (wifi-densepose-signal crate) — This is the SOTA pipeline and it has several robustness mechanisms beyond raw amplitude:

Mechanism	File	What it does
Hampel filter	`signal/src/hampel.rs`	Median + MAD outlier removal — robust to 50% contamination, unlike Z-score
Conjugate multiplication	`signal/src/csi_ratio.rs`	Reference-antenna ratio cancels common-mode noise (hardware drift, AGC, environmental RF)
Phase sanitization	`signal/src/phase_sanitizer.rs`	Unwraps and detrends phase to remove CFO/SFO artifacts
Fresnel zone geometry	`signal/src/fresnel.rs`	TX-body-RX distance estimation — uses geometry, not just amplitude
Subcarrier partitioning	`signal/src/subcarrier_selection.rs`	`mincut_subcarrier_partition()` — dynamically separates sensitive (human-affected) vs insensitive subcarriers using graph min-cut. This means the system adapts to which subcarriers carry body information vs environmental noise per frame.
Attention-gated spectrogram	`signal/src/spectrogram.rs`	`gate_spectrogram()` — learned attention weights suppress non-human spectral components
Adaptive threshold	`signal/src/motion.rs:536`	Optional adaptive thresholding based on running statistics (mean + σ)
Baseline calibration	`signal/src/motion.rs:553`	`calibrate()` captures empty-room baseline; detection uses ratio against baseline
Multi-feature fusion	`signal/src/motion.rs:39-46`	Detection fuses 4 independent signals: variance (0.3), correlation (0.2), phase (0.2), Doppler (0.3) — not amplitude alone

The MotionDetector in the Rust crate uses a MotionScore that combines variance, correlation, phase, and Doppler components. Amplitude alone is weighted at 0.3-0.4, never 1.0.

2. Are there robustness designs for non-enclosed or dynamic environments?

Partially — in the signal processing layer. Not yet in the end-to-end pipeline.

What exists:

Conjugate multiplication is the primary defense against environmental drift — it divides each antenna's CSI by a reference antenna, canceling common-mode perturbations (temperature, humidity, furniture movement on a slow timescale)
Dynamic subcarrier partitioning via mincut_subcarrier_partition() adapts per-frame to which subcarriers are sensitive to human motion vs. environmental multipath. This is the most relevant mechanism for non-static scenes.
Temporal smoothing (EMA with configurable smoothing_factor) suppresses transient false positives
Cross-receiver agreement (classifier.py:191-197) — when multiple receivers agree, confidence is boosted; disagreement reduces it

What is not yet implemented but planned:

Contrastive embedding (ADR-024/AETHER) — just merged on branch feat/adr-024-contrastive-csi-embedding. This learns environment-invariant representations via self-supervised contrastive learning. A model pretrained on diverse rooms should generalize to new environments without per-room calibration. The FingerprintIndex with temporal_baseline type is specifically designed for anomaly detection relative to a learned baseline, rather than a fixed threshold.
Multi-AP triangulation — the wifi-densepose-mat crate supports multi-AP fusion which increases robustness by providing geometric constraints that single-AP amplitude cannot.

Honest assessment: The current system works best in enclosed rooms with fixed AP positions. Open-plan offices, hallways with cross-traffic, or outdoor environments would require the contrastive embedding + multi-AP path to achieve acceptable false positive rates.

3. Do you have cross-scene validation results or a reproducible evaluation setup?

Not yet. This is a gap.

What exists for reproducibility:

Deterministic CSI proof bundles in v1/data/proof/ — verifiable synthetic CSI with fixed seeds
Synthetic data generation in both Python and Rust — useful for unit testing signal processing correctness
239 unit tests across the Rust workspace that verify individual signal processing components
MM-Fi and Wi-Pose dataset loaders (wifi-densepose-train/src/dataset.rs) — these are public datasets with multi-room, multi-subject captures that could serve as cross-scene benchmarks

What doesn't exist yet:

No published cross-scene evaluation metrics (different rooms, different AP placements, different subjects)
No standardized benchmark comparing our pipeline against baselines (e.g., WiFi-based HAR papers)
No false positive rate characterization in dynamic environments

I'll add a "Known Limitations" section to the README to make these boundaries explicit, and we've filed ADR-024 (AETHER) specifically because contrastive pretraining is the established approach in the literature (SelfHAR, Wang et al. 2023) for cross-environment generalization.

TL;DR: You identified a real limitation. The rule-based RSSI classifier is fragile by design and assumes static environments. The Rust CSI pipeline has multiple robustness mechanisms (conjugate mult, Hampel, min-cut subcarrier partition, multi-feature fusion) but hasn't been validated cross-scene. The newly implemented AETHER embedding model (ADR-024) is specifically designed to address cross-environment transfer via contrastive pretraining. Cross-scene benchmarks are needed and not yet done.

Thank you for pushing on this — it's exactly the kind of feedback that improves the project.

Thanks for the sharp question — this is exactly the kind of scrutiny WiFi sensing projects need. You're right that raw CSI amplitude alone is fragile. Let me answer each question with specific code references. --- ### 1. Does the project assume a fixed scene / static environment / fixed-position calibration? **Short answer: Yes, for the rule-based RSSI classifier. No, for the Rust CSI pipeline.** There are **two separate detection paths** in this project: **Python RSSI path** (`v1/src/sensing/classifier.py`) — This is the rule-based classifier you identified. It uses fixed thresholds (`presence_variance_threshold=0.5`, `motion_energy_threshold=0.1`) and is explicitly designed for **controlled, single-room scenarios**. It does assume a relatively static environment. The confidence model is a weighted heuristic (60% base + 20% spectral + 20% cross-receiver agreement), not a learned model. This path is the MVP sensing layer — useful for demos and enclosed rooms, but you're correct that it would produce false positives in dynamic environments. **Rust CSI path** (`wifi-densepose-signal` crate) — This is the SOTA pipeline and it has several robustness mechanisms beyond raw amplitude: | Mechanism | File | What it does | |-----------|------|-------------| | **Hampel filter** | `signal/src/hampel.rs` | Median + MAD outlier removal — robust to 50% contamination, unlike Z-score | | **Conjugate multiplication** | `signal/src/csi_ratio.rs` | Reference-antenna ratio cancels common-mode noise (hardware drift, AGC, environmental RF) | | **Phase sanitization** | `signal/src/phase_sanitizer.rs` | Unwraps and detrends phase to remove CFO/SFO artifacts | | **Fresnel zone geometry** | `signal/src/fresnel.rs` | TX-body-RX distance estimation — uses geometry, not just amplitude | | **Subcarrier partitioning** | `signal/src/subcarrier_selection.rs` | `mincut_subcarrier_partition()` — dynamically separates sensitive (human-affected) vs insensitive subcarriers using graph min-cut. This means the system adapts to which subcarriers carry body information vs environmental noise **per frame**. | | **Attention-gated spectrogram** | `signal/src/spectrogram.rs` | `gate_spectrogram()` — learned attention weights suppress non-human spectral components | | **Adaptive threshold** | `signal/src/motion.rs:536` | Optional adaptive thresholding based on running statistics (mean + σ) | | **Baseline calibration** | `signal/src/motion.rs:553` | `calibrate()` captures empty-room baseline; detection uses ratio against baseline | | **Multi-feature fusion** | `signal/src/motion.rs:39-46` | Detection fuses 4 independent signals: variance (0.3), correlation (0.2), phase (0.2), Doppler (0.3) — not amplitude alone | The `MotionDetector` in the Rust crate uses a `MotionScore` that combines variance, correlation, phase, and Doppler components. Amplitude alone is weighted at 0.3-0.4, never 1.0. --- ### 2. Are there robustness designs for non-enclosed or dynamic environments? **Partially — in the signal processing layer. Not yet in the end-to-end pipeline.** What exists: - **Conjugate multiplication** is the primary defense against environmental drift — it divides each antenna's CSI by a reference antenna, canceling common-mode perturbations (temperature, humidity, furniture movement on a slow timescale) - **Dynamic subcarrier partitioning** via `mincut_subcarrier_partition()` adapts per-frame to which subcarriers are sensitive to human motion vs. environmental multipath. This is the most relevant mechanism for non-static scenes. - **Temporal smoothing** (EMA with configurable `smoothing_factor`) suppresses transient false positives - **Cross-receiver agreement** (`classifier.py:191-197`) — when multiple receivers agree, confidence is boosted; disagreement reduces it What is **not yet implemented** but planned: - **Contrastive embedding (ADR-024/AETHER)** — just merged on branch `feat/adr-024-contrastive-csi-embedding`. This learns environment-invariant representations via self-supervised contrastive learning. A model pretrained on diverse rooms should generalize to new environments without per-room calibration. The `FingerprintIndex` with `temporal_baseline` type is specifically designed for anomaly detection relative to a learned baseline, rather than a fixed threshold. - **Multi-AP triangulation** — the `wifi-densepose-mat` crate supports multi-AP fusion which increases robustness by providing geometric constraints that single-AP amplitude cannot. **Honest assessment**: The current system works best in enclosed rooms with fixed AP positions. Open-plan offices, hallways with cross-traffic, or outdoor environments would require the contrastive embedding + multi-AP path to achieve acceptable false positive rates. --- ### 3. Do you have cross-scene validation results or a reproducible evaluation setup? **Not yet.** This is a gap. What exists for reproducibility: - **Deterministic CSI proof bundles** in `v1/data/proof/` — verifiable synthetic CSI with fixed seeds - **Synthetic data generation** in both Python and Rust — useful for unit testing signal processing correctness - **239 unit tests** across the Rust workspace that verify individual signal processing components - **MM-Fi and Wi-Pose dataset loaders** (`wifi-densepose-train/src/dataset.rs`) — these are public datasets with multi-room, multi-subject captures that could serve as cross-scene benchmarks What doesn't exist yet: - No published cross-scene evaluation metrics (different rooms, different AP placements, different subjects) - No standardized benchmark comparing our pipeline against baselines (e.g., WiFi-based HAR papers) - No false positive rate characterization in dynamic environments I'll add a "Known Limitations" section to the README to make these boundaries explicit, and we've filed ADR-024 (AETHER) specifically because contrastive pretraining is the established approach in the literature (SelfHAR, Wang et al. 2023) for cross-environment generalization. --- **TL;DR**: You identified a real limitation. The rule-based RSSI classifier is fragile by design and assumes static environments. The Rust CSI pipeline has multiple robustness mechanisms (conjugate mult, Hampel, min-cut subcarrier partition, multi-feature fusion) but hasn't been validated cross-scene. The newly implemented AETHER embedding model (ADR-024) is specifically designed to address cross-environment transfer via contrastive pretraining. Cross-scene benchmarks are needed and not yet done. Thank you for pushing on this — it's exactly the kind of feedback that improves the project.

ruvnet commented

2026-03-01 14:33:09 +08:00

(Migrated from github.com)

Thanks for the thoughtful question — these are exactly the right concerns for real-world CSI deployments. You're correct that raw amplitude thresholding alone is fragile outside controlled environments.

We've been actively working on this. Here's what the codebase now provides across the merged pipeline and the in-flight PR #52:

1. Static/calibrated environment assumption

The original amplitude-based detection (derive_pose_from_sensing in main.rs) does assume a relatively static scene — it uses fixed thresholds on variance/energy and was designed as a baseline heuristic, not a production classifier. This is acknowledged.

2. Robustness features for dynamic/non-enclosed environments

PR #52 (ADR-024, all 7 phases complete, 272 tests passing) introduces several mechanisms specifically designed to address the fragility you describe:

Contrastive embeddings (SimCLR/InfoNCE): Instead of thresholding raw amplitude, we learn a 128-dim embedding space where similar CSI patterns cluster together. This is inherently more robust than hand-tuned thresholds because the model learns which amplitude variations are semantically meaningful vs. noise.
5 physically-motivated augmentations (CsiAugmenter): temporal jitter, subcarrier masking, Gaussian noise injection, phase rotation, and amplitude scaling — all applied during self-supervised pretraining to teach the model invariance to exactly the non-human fluctuations you mention.
Environment drift detection (EnvironmentDetector): 3-sigma detector on embedding distance that flags when the environment has changed (e.g., furniture moved, new reflectors). Entries inserted during drift are tagged anomalous in the fingerprint index rather than silently corrupting the baseline.
MicroLoRA per-environment adaptation: Rank-4 LoRA adapters (1,792 params each, 93% smaller than full retrain) allow per-scene fine-tuning without catastrophic forgetting. When you deploy in a new room, you adapt the projection head while preserving learned CSI structure via EWC++ (Elastic Weight Consolidation).
Hard-negative mining: During training, the system selects difficult negative pairs (similar amplitude patterns from different activities) with configurable ratio and warmup, which directly improves discrimination in ambiguous scenarios.

3. Cross-scene validation

We don't yet have published cross-scene benchmarks — the ADR references MM-Fi and Wi-Pose as target evaluation datasets (both include multi-room, multi-subject protocols). The training pipeline (--pretrain, --train, --embed, --build-index CLI flags) is wired end-to-end and ready for this evaluation. The FingerprintIndex supports 4 index types (environment, activity, temporal, person) specifically to enable cross-scene retrieval experiments.

Bottom line: the amplitude-threshold path is a baseline/demo. The contrastive embedding pipeline in PR #52 is the intended production path and directly addresses the environmental fragility you've identified. Contributions and cross-scene evaluation results are very welcome.

Thanks for the thoughtful question — these are exactly the right concerns for real-world CSI deployments. You're correct that raw amplitude thresholding alone is fragile outside controlled environments. We've been actively working on this. Here's what the codebase now provides across the merged pipeline and the in-flight PR #52: ### 1. Static/calibrated environment assumption The original amplitude-based detection (`derive_pose_from_sensing` in `main.rs`) does assume a relatively static scene — it uses fixed thresholds on variance/energy and was designed as a baseline heuristic, not a production classifier. This is acknowledged. ### 2. Robustness features for dynamic/non-enclosed environments PR #52 (ADR-024, all 7 phases complete, 272 tests passing) introduces several mechanisms specifically designed to address the fragility you describe: - **Contrastive embeddings (SimCLR/InfoNCE)**: Instead of thresholding raw amplitude, we learn a 128-dim embedding space where similar CSI patterns cluster together. This is inherently more robust than hand-tuned thresholds because the model learns which amplitude variations are semantically meaningful vs. noise. - **5 physically-motivated augmentations** (`CsiAugmenter`): temporal jitter, subcarrier masking, Gaussian noise injection, phase rotation, and amplitude scaling — all applied during self-supervised pretraining to teach the model invariance to exactly the non-human fluctuations you mention. - **Environment drift detection** (`EnvironmentDetector`): 3-sigma detector on embedding distance that flags when the environment has changed (e.g., furniture moved, new reflectors). Entries inserted during drift are tagged `anomalous` in the fingerprint index rather than silently corrupting the baseline. - **MicroLoRA per-environment adaptation**: Rank-4 LoRA adapters (1,792 params each, 93% smaller than full retrain) allow per-scene fine-tuning without catastrophic forgetting. When you deploy in a new room, you adapt the projection head while preserving learned CSI structure via EWC++ (Elastic Weight Consolidation). - **Hard-negative mining**: During training, the system selects difficult negative pairs (similar amplitude patterns from different activities) with configurable ratio and warmup, which directly improves discrimination in ambiguous scenarios. ### 3. Cross-scene validation We don't yet have published cross-scene benchmarks — the ADR references MM-Fi and Wi-Pose as target evaluation datasets (both include multi-room, multi-subject protocols). The training pipeline (`--pretrain`, `--train`, `--embed`, `--build-index` CLI flags) is wired end-to-end and ready for this evaluation. The `FingerprintIndex` supports 4 index types (environment, activity, temporal, person) specifically to enable cross-scene retrieval experiments. **Bottom line**: the amplitude-threshold path is a baseline/demo. The contrastive embedding pipeline in PR #52 is the intended production path and directly addresses the environmental fragility you've identified. Contributions and cross-scene evaluation results are very welcome.

36mimu36 commented

2026-03-02 18:06:41 +08:00

(Migrated from github.com)

Practical finding: subcarrier spread vs. mean amplitude (real ESP32-S3 deployment)

I've been running the Python sensing pipeline with 2x ESP32-S3 nodes streaming ADR-018 CSI to a Raspberry Pi, and hit exactly the fragility described in this issue. Sharing concrete findings that may help others.

Setup

2x ESP32-S3-DevKitC (Node ID 1 & 2), CSI over UDP to Pi
Pi runs ws_server.py with Esp32UdpCollector → RssiFeatureExtractor → PresenceClassifier
Person seated between the two ESP32 nodes and the AP

Problem: `mean_amplitude` is noise-dominated

When using np.mean(amplitudes) per CSI frame as the time-series signal:

Metric	Value	Issue
Variance (time-domain)	~18–33	High → presence detected ✓
Motion Band (0.5–3 Hz)	0.000	FFT sees no body-frequency content
Breathing Band (0.1–0.5 Hz)	0.000	Same
Spectral Power	336 (raw) or 0 (after LPF)	All energy is high-freq frame-to-frame noise

The mean amplitude fluctuates rapidly frame-to-frame (measurement noise from I/Q quantization), but this noise is spectrally flat — it has no structure at breathing/motion frequencies. A Butterworth LPF at 5 Hz removes the noise but also zeroes out the entire signal.

Solution: use per-frame subcarrier spread (`amp_spread`)

Replacing mean_amplitude with np.std(amplitudes) (standard deviation across subcarriers within a single frame) produces a signal that does contain body-motion information:

amplitudes = np.sqrt(i_vals ** 2 + q_vals ** 2)
amp_spread = float(np.std(amplitudes))  # use this instead of np.mean()

After the same Butterworth LPF (3rd order, 5 Hz cutoff) + 10 Hz downsampling:

Metric	Value
Variance	~41
Motion Band	0.331 ✓
Breathing Band	0.000
Spectral Power	0.357

The intuition: human body causes frequency-selective fading (multipath). Different subcarriers are attenuated differently depending on body position. The spread (std) across subcarriers captures this multipath diversity and changes at body-motion rates, while the mean averages it out.

Additional changes needed for multi-node

When using 2+ ESP32 nodes, their CSI frames interleave in a single buffer, corrupting sample rate estimation (we saw 504 Hz estimated rate, which pushed all FFT bins above the motion/breathing bands). Fix: per-node ring buffers with separate feature extraction, then cross-receiver agreement for confidence.

Thresholds for amplitude-spread signal

The original thresholds (presence_variance=0.5, motion_energy=0.1) are tuned for RSSI (dBm). For the amplitude-spread signal, we found these work:

presence_variance_threshold=0.3
motion_energy_threshold=0.05

Limitation

Breathing band (0.1–0.5 Hz) is still 0.000 with this approach. Breathing detection likely requires per-subcarrier phase tracking rather than amplitude-only analysis, as the Rust pipeline's conjugate multiplication and phase sanitization are designed to provide.

Tested on: ESP32-S3-DevKitC × 2, Raspberry Pi (Debian 13, aarch64), Python 3.11, ESP-IDF v5.2

### Practical finding: subcarrier spread vs. mean amplitude (real ESP32-S3 deployment) I've been running the Python sensing pipeline with **2x ESP32-S3 nodes** streaming ADR-018 CSI to a Raspberry Pi, and hit exactly the fragility described in this issue. Sharing concrete findings that may help others. #### Setup - 2x ESP32-S3-DevKitC (Node ID 1 & 2), CSI over UDP to Pi - Pi runs `ws_server.py` with `Esp32UdpCollector` → `RssiFeatureExtractor` → `PresenceClassifier` - Person seated between the two ESP32 nodes and the AP #### Problem: `mean_amplitude` is noise-dominated When using `np.mean(amplitudes)` per CSI frame as the time-series signal: | Metric | Value | Issue | |--------|-------|-------| | Variance (time-domain) | ~18–33 | High → presence detected ✓ | | Motion Band (0.5–3 Hz) | 0.000 | FFT sees no body-frequency content | | Breathing Band (0.1–0.5 Hz) | 0.000 | Same | | Spectral Power | 336 (raw) or 0 (after LPF) | All energy is high-freq frame-to-frame noise | The mean amplitude fluctuates rapidly frame-to-frame (measurement noise from I/Q quantization), but this noise is **spectrally flat** — it has no structure at breathing/motion frequencies. A Butterworth LPF at 5 Hz removes the noise but also zeroes out the entire signal. #### Solution: use per-frame subcarrier spread (`amp_spread`) Replacing `mean_amplitude` with `np.std(amplitudes)` (standard deviation across subcarriers within a single frame) produces a signal that **does** contain body-motion information: ```python amplitudes = np.sqrt(i_vals ** 2 + q_vals ** 2) amp_spread = float(np.std(amplitudes)) # use this instead of np.mean() ``` After the same Butterworth LPF (3rd order, 5 Hz cutoff) + 10 Hz downsampling: | Metric | Value | |--------|-------| | Variance | ~41 | | **Motion Band** | **0.331** ✓ | | Breathing Band | 0.000 | | Spectral Power | 0.357 | The intuition: human body causes **frequency-selective fading** (multipath). Different subcarriers are attenuated differently depending on body position. The spread (std) across subcarriers captures this multipath diversity and changes at body-motion rates, while the mean averages it out. #### Additional changes needed for multi-node When using 2+ ESP32 nodes, their CSI frames interleave in a single buffer, corrupting sample rate estimation (we saw 504 Hz estimated rate, which pushed all FFT bins above the motion/breathing bands). Fix: **per-node ring buffers** with separate feature extraction, then cross-receiver agreement for confidence. #### Thresholds for amplitude-spread signal The original thresholds (`presence_variance=0.5`, `motion_energy=0.1`) are tuned for RSSI (dBm). For the amplitude-spread signal, we found these work: - `presence_variance_threshold=0.3` - `motion_energy_threshold=0.05` #### Limitation Breathing band (0.1–0.5 Hz) is still 0.000 with this approach. Breathing detection likely requires per-subcarrier phase tracking rather than amplitude-only analysis, as the Rust pipeline's conjugate multiplication and phase sanitization are designed to provide. --- Tested on: ESP32-S3-DevKitC × 2, Raspberry Pi (Debian 13, aarch64), Python 3.11, ESP-IDF v5.2

Sign in to join this conversation.

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: dearsky/wifi-densepose#51

Amplitude-based detection is fragile in non-static environments #51

1. Does the project assume a fixed scene / static environment / fixed-position calibration?

2. Are there robustness designs for non-enclosed or dynamic environments?

3. Do you have cross-scene validation results or a reproducible evaluation setup?

1. Static/calibrated environment assumption

2. Robustness features for dynamic/non-enclosed environments

3. Cross-scene validation

Practical finding: subcarrier spread vs. mean amplitude (real ESP32-S3 deployment)

Setup

Problem: mean_amplitude is noise-dominated

Solution: use per-frame subcarrier spread (amp_spread)

Additional changes needed for multi-node

Thresholds for amplitude-spread signal

Limitation

Problem: `mean_amplitude` is noise-dominated

Solution: use per-frame subcarrier spread (`amp_spread`)