HomeDocs-Technical WhitePaper53-Model Card Template v1.0

Chapter 12 — Monitoring, Drift & Rollback


I. Purpose & Scope


II. Prerequisites & Inputs


III. Monitoring KPIs & Thresholds


IV. Drift Detection

  1. Data drift:
    • Tests: KS/χ²/AD; multivariate MMD/Energy distance; windowed stratification (batch/device/region).
    • Path quantities: interval coverage & band-width trends for T_arr/Phi; align phase within reference window first.
  2. Concept drift:
    • Proxy ground truth / delayed labels: align online feedback with val/test/holdout.
    • Performance decay: ΔMAE/ΔAUC/Δr_phi over thresholds with non-overlapping CIs.
  3. Uncertainty calibration: PIT/calibration curves/Brier; on failure, enable conservative intervals or robust surrogates.

V. Rollback Mechanism

  1. FSM: normal → degrade → rollback → recover → normal, event-driven (gate breach/drift confirmed/resource alerts).
  2. Degrade:
    • Model: route to lower-complexity path / robust surrogates (Huber/quantile).
    • Data: tighten gates, isolate risky slices.
    • Path: switch to fullband/short window or raise Δell guard (without breaking upper bounds).
  3. Rollback execution: lock previous stable version (signature & checksum), keep I/O contract & coverage mode unchanged.
  4. Recovery & verification: progressive canary rollout; after /validate passes G1–G8 and perf/quality thresholds, switch fully.

VI. Normative Path Forms

Align “time → path → phase” before monitoring & alerts; record delta_form; arrays satisfy length & step constraints.


VII. Gate Mapping


VIII. Machine-Readable Configs
A. monitoring_rules.yaml

version: "1.0.0"

windows: { short_s: 300, long_s: 86400 }

kpis:

latency_p95_s: { target: 0.200, alert: 0.250, critical: 0.300 }

throughput_rps: { target_min: 1000 }

q_res: { target_max: 0.20 }

p_dim: { require: 1.0 }

r_phi_lb95: { target_min: 0.60 }

epsilon_flux_p95: { target_max: 0.02 }

delta_t_abs_ns: { target_max: 50 }

delta_tau_ch_ns: { target_max: 5 }

drift:

data: { test: "ks", p_crit: 0.01, strata: ["device","region"] }

concept: { metric: "val/MAE", delta_crit: 0.05, ci_agree: true }

actions:

on_alert: ["degrade"]

on_critical: ["rollback"]

B. rollback_fsm.yaml

version: "1.0.0"

states: [normal, degrade, rollback, recover]

transitions:

- { from: normal, to: degrade, when: "gate_alert or drift_alert" }

- { from: degrade, to: rollback, when: "gate_critical or perf_critical" }

- { from: rollback,to: recover, when: "stable_prev_version_ready" }

- { from: recover, to: normal, when: "validate_pass and perf_ok" }

degrade:

strategies: ["robust_surrogate","tighten_gates","isolate_slices"]

rollback:

version_tag: "v1.2.3-lock"

verify: ["checksum","/validate","SLA/SLO"]

recover:

rollout: { canary_percent: 10, steps: 3, pause_s: 600 }


C. alerts.jsonl (sample)

JSON json
{
  "ts": "2025-09-24T16:10:00Z",
  "level": "critical",
  "event": "gate_fail",
  "gate": "G4",
  "detail": "p_dim < 1.0",
  "action": "rollback"
}

IX. Anti-Patterns & Fixes


X. Cross-References


XI. Checklist


Copyright & License (CC BY 4.0)

Copyright: Unless otherwise noted, the copyright of “Energy Filament Theory” (text, charts, illustrations, symbols, and formulas) belongs to the author “Guanglin Tu”.
License: This work is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0). You may copy, redistribute, excerpt, adapt, and share for commercial or non‑commercial purposes with proper attribution.
Suggested attribution: Author: “Guanglin Tu”; Work: “Energy Filament Theory”; Source: energyfilament.org; License: CC BY 4.0.

First published: 2025-11-11|Current version:v5.1
License link:https://creativecommons.org/licenses/by/4.0/