* feat: recursive resolution + full DNSSEC validation Numa becomes a true DNS resolver — resolves from root nameservers with complete DNSSEC chain-of-trust verification. Recursive resolution: - Iterative RFC 1034 from configurable root hints (13 default) - CNAME chasing (depth 8), referral following (depth 10) - A+AAAA glue extraction, IPv6 nameserver support - TLD priming: NS + DS + DNSKEY for 34 gTLDs + EU ccTLDs - Config: mode = "recursive" in [upstream], root_hints, prime_tlds DNSSEC (all 4 phases): - EDNS0 OPT pseudo-record (DO bit, 1232 payload per DNS Flag Day 2020) - DNSKEY, DS, RRSIG, NSEC, NSEC3 record types with wire read/write - Signature verification via ring: RSA/SHA-256, ECDSA P-256, Ed25519 - Chain-of-trust: zone DNSKEY → parent DS → root KSK (key tag 20326) - DNSKEY RRset self-signature verification (RRSIG(DNSKEY) by KSK) - RRSIG expiration/inception time validation - NSEC: NXDOMAIN gap proofs, NODATA type absence, wildcard denial - NSEC3: SHA-1 iterated hashing, closest encloser proof, hash range - Authority RRSIG verification for denial proofs - Config: [dnssec] enabled/strict (default false, opt-in) - AD bit on Secure, SERVFAIL on Bogus+strict - DnssecStatus cached per entry, ValidationStats logging Performance: - TLD chain pre-warmed on startup (root DNSKEY + TLD DS/DNSKEY) - Referral DS piggybacking from authority sections - DNSKEY prefetch before validation loop - Cold-cache validation: ~1 DNSKEY fetch (down from 5) - Benchmarks: RSA 10.9µs, ECDSA 174ns, DS verify 257ns Also: - write_qname fix for root domain "." (was producing malformed queries) - write_record_header() dedup, write_bytes() bulk writes - DnsRecord::domain() + query_type() accessors - UpstreamMode enum, DEFAULT_EDNS_PAYLOAD const - Real glue TTL (was hardcoded 3600) - DNSSEC restricted to recursive mode only Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: TCP fallback, query minimization, UDP auto-disable Transport resilience for restrictive networks (ISPs blocking UDP:53): - DNS-over-TCP fallback: UDP fail/truncation → automatic TCP retry - UDP auto-disable: after 3 consecutive failures, switch to TCP-first - IPv6 → TCP directly (UDP socket binds 0.0.0.0, can't reach IPv6) - Network change resets UDP detection for re-probing - Root hint rotation in TLD priming Privacy: - RFC 7816 query minimization: root servers see TLD only, not full name Code quality: - Merged find_starting_ns + find_starting_zone → find_closest_ns - Extracted resolve_ns_addrs_from_glue shared helper - Removed overall timeout wrapper (per-hop timeouts sufficient) - forward_tcp for DNS-over-TCP (RFC 1035 §4.2.2) Testing: - Mock TCP-only DNS server for fallback tests (no network needed) - tcp_fallback_resolves_when_udp_blocked - tcp_only_iterative_resolution - tcp_fallback_handles_nxdomain - udp_auto_disable_resets - Integration test suite (4 suites, 51 tests) - Network probe script (tests/network-probe.sh) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: DNSSEC verified badge in dashboard query log - Add dnssec field to QueryLogEntry, track validation status per query - DnssecStatus::as_str() for API serialization - Dashboard shows green checkmark next to DNSSEC-verified responses - Blog post: add "How keys get there" section, transport resilience section, trim code blocks, update What's Next Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: use SVG shield for DNSSEC badge, update blog HTML Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: NS cache lookup from authorities, UDP re-probe, shield alignment - find_closest_ns checks authorities (not just answers) for NS records, fixing TLD priming cache misses that caused redundant root queries - Periodic UDP re-probe every 5min when disabled — re-enables UDP after switching from a restrictive network to an open one - Dashboard DNSSEC shield uses fixed-width container for alignment - Blog post: tuck key-tag into trust anchor paragraph Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: TCP single-write, mock server consistency, integration tests - TCP single-write fix: combine length prefix + message to avoid split segments that Microsoft/Azure DNS servers reject - Mock server (spawn_tcp_dns_server) updated to use single-write too - Tests: forward_tcp_wire_format, forward_tcp_single_segment_write - Integration: real-server checks for Microsoft/Office/Azure domains Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: recursive bar in dashboard, special-use domain interception Dashboard: - Add Recursive bar to resolution paths chart (cyan, distinct from Override) - Add RECURSIVE path tag style in query log Special-use domains (RFC 6761/6303/8880/9462): - .localhost → 127.0.0.1 (RFC 6761) - Private reverse PTR (10.x, 192.168.x, 172.16-31.x) → NXDOMAIN - _dns.resolver.arpa (DDR) → NXDOMAIN - ipv4only.arpa (NAT64) → 192.0.0.170/171 - mDNS service discovery for private ranges → NXDOMAIN Eliminates ~900ms SERVFAILs for macOS system queries that were hitting root servers unnecessarily. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * chore: move generated blog HTML to site/blog/posts/, gitignore - Generated HTML now in site/blog/posts/ (gitignored) - CI workflow runs pandoc + make blog before deploy - Updated all internal blog links to /blog/posts/ path - blog/*.md remains the source of truth Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: review feedback — memory ordering, RRSIG time, NS resolution - Ordering::Relaxed → Acquire/Release for UDP_DISABLED/UDP_FAILURES (ARM correctness for cross-thread coordination) - RRSIG time validation: serial number arithmetic (RFC 4034 §3.1.5) + 300s clock skew fudge factor (matches BIND) - resolve_ns_addrs_from_glue collects addresses from ALL NS names, not just the first with glue (improves failover) - is_special_use_domain: eliminate 16 format! allocations per .in-addr.arpa query (parse octet instead) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: API endpoint tests, coverage target - 8 new axum handler tests: health, stats, query-log, overrides CRUD, cache, blocking stats, services CRUD, dashboard HTML - Tests use tower::oneshot — no network, no server startup - test_ctx() builds minimal ServerCtx for isolated testing - `make coverage` target (cargo-tarpaulin), separate from `make all` - 82 total tests (was 74) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
149 lines
4.2 KiB
Rust
149 lines
4.2 KiB
Rust
use std::time::Instant;
|
|
|
|
pub struct ServerStats {
|
|
queries_total: u64,
|
|
queries_forwarded: u64,
|
|
queries_recursive: u64,
|
|
queries_cached: u64,
|
|
queries_blocked: u64,
|
|
queries_local: u64,
|
|
queries_overridden: u64,
|
|
upstream_errors: u64,
|
|
started_at: Instant,
|
|
}
|
|
|
|
#[derive(Clone, Copy, PartialEq, Eq)]
|
|
pub enum QueryPath {
|
|
Local,
|
|
Cached,
|
|
Forwarded,
|
|
Recursive,
|
|
Blocked,
|
|
Overridden,
|
|
UpstreamError,
|
|
}
|
|
|
|
impl QueryPath {
|
|
pub fn as_str(&self) -> &'static str {
|
|
match self {
|
|
QueryPath::Local => "LOCAL",
|
|
QueryPath::Cached => "CACHED",
|
|
QueryPath::Forwarded => "FORWARD",
|
|
QueryPath::Recursive => "RECURSIVE",
|
|
QueryPath::Blocked => "BLOCKED",
|
|
QueryPath::Overridden => "OVERRIDE",
|
|
QueryPath::UpstreamError => "SERVFAIL",
|
|
}
|
|
}
|
|
|
|
pub fn parse_str(s: &str) -> Option<QueryPath> {
|
|
if s.eq_ignore_ascii_case("LOCAL") {
|
|
Some(QueryPath::Local)
|
|
} else if s.eq_ignore_ascii_case("CACHED") {
|
|
Some(QueryPath::Cached)
|
|
} else if s.eq_ignore_ascii_case("FORWARD") {
|
|
Some(QueryPath::Forwarded)
|
|
} else if s.eq_ignore_ascii_case("RECURSIVE") {
|
|
Some(QueryPath::Recursive)
|
|
} else if s.eq_ignore_ascii_case("BLOCKED") {
|
|
Some(QueryPath::Blocked)
|
|
} else if s.eq_ignore_ascii_case("OVERRIDE") {
|
|
Some(QueryPath::Overridden)
|
|
} else if s.eq_ignore_ascii_case("SERVFAIL") {
|
|
Some(QueryPath::UpstreamError)
|
|
} else {
|
|
None
|
|
}
|
|
}
|
|
}
|
|
|
|
impl Default for ServerStats {
|
|
fn default() -> Self {
|
|
Self::new()
|
|
}
|
|
}
|
|
|
|
impl ServerStats {
|
|
pub fn new() -> Self {
|
|
ServerStats {
|
|
queries_total: 0,
|
|
queries_forwarded: 0,
|
|
queries_recursive: 0,
|
|
queries_cached: 0,
|
|
queries_blocked: 0,
|
|
queries_local: 0,
|
|
queries_overridden: 0,
|
|
upstream_errors: 0,
|
|
started_at: Instant::now(),
|
|
}
|
|
}
|
|
|
|
pub fn record(&mut self, path: QueryPath) -> u64 {
|
|
self.queries_total += 1;
|
|
match path {
|
|
QueryPath::Local => self.queries_local += 1,
|
|
QueryPath::Cached => self.queries_cached += 1,
|
|
QueryPath::Forwarded => self.queries_forwarded += 1,
|
|
QueryPath::Recursive => self.queries_recursive += 1,
|
|
QueryPath::Blocked => self.queries_blocked += 1,
|
|
QueryPath::Overridden => self.queries_overridden += 1,
|
|
QueryPath::UpstreamError => self.upstream_errors += 1,
|
|
}
|
|
self.queries_total
|
|
}
|
|
|
|
pub fn total(&self) -> u64 {
|
|
self.queries_total
|
|
}
|
|
|
|
pub fn uptime_secs(&self) -> u64 {
|
|
self.started_at.elapsed().as_secs()
|
|
}
|
|
|
|
pub fn snapshot(&self) -> StatsSnapshot {
|
|
StatsSnapshot {
|
|
uptime_secs: self.uptime_secs(),
|
|
total: self.queries_total,
|
|
forwarded: self.queries_forwarded,
|
|
recursive: self.queries_recursive,
|
|
cached: self.queries_cached,
|
|
local: self.queries_local,
|
|
overridden: self.queries_overridden,
|
|
blocked: self.queries_blocked,
|
|
errors: self.upstream_errors,
|
|
}
|
|
}
|
|
|
|
pub fn log_summary(&self) {
|
|
let uptime = self.started_at.elapsed();
|
|
let hours = uptime.as_secs() / 3600;
|
|
let mins = (uptime.as_secs() % 3600) / 60;
|
|
let secs = uptime.as_secs() % 60;
|
|
|
|
log::info!(
|
|
"STATS | uptime {}h{}m{}s | total {} | fwd {} | recursive {} | cached {} | local {} | override {} | blocked {} | errors {}",
|
|
hours, mins, secs,
|
|
self.queries_total,
|
|
self.queries_forwarded,
|
|
self.queries_recursive,
|
|
self.queries_cached,
|
|
self.queries_local,
|
|
self.queries_overridden,
|
|
self.queries_blocked,
|
|
self.upstream_errors,
|
|
);
|
|
}
|
|
}
|
|
|
|
pub struct StatsSnapshot {
|
|
pub uptime_secs: u64,
|
|
pub total: u64,
|
|
pub forwarded: u64,
|
|
pub recursive: u64,
|
|
pub cached: u64,
|
|
pub local: u64,
|
|
pub overridden: u64,
|
|
pub blocked: u64,
|
|
pub errors: u64,
|
|
}
|