HomeDocs-Technical WhitePaper43-EFT.WP.Data.DatasetCards v1.0

Chapter 12 Quality & Baselines


I. Chapter Purpose & Scope

. no ChineseFix quality gates (pass criteria), coverage metrics, and a unified posture for baseline tasks/metrics; define evaluation protocol, statistical significance, and reproducibility requirements; keep consistency with splits, labels/ontology, metrology, and uncertainty. Keys use snake_case; cross-volume citations follow “Volume+Version+Anchor”; math uses backticks and parentheses with

II. Terminology & Dependencies


III. Fields & Structure (Normative)

quality:

gates: # Quality gates (must all pass before release)

- {name: "label_consistency", threshold: 0.98, metric: "kappa"}

- {name: "leakage", threshold: 0.0, metric: "leakage_rate"}

- {name: "coverage_min", threshold: 0.99, metric: "split_coverage"}

- {name: "checksum_integrity", threshold: 1.0, metric: "sha256_ok_ratio"}

coverage: # Coverage & distribution monitoring

samples: 0 # replace with actual count at release

per_class: {} # {"FRB": 520, "RFI": 2100, ...}

per_region: {} # space/site/channel dimensions, etc.

ci_method: "bootstrap-bca"

target_ci: 0.95

baseline:

tasks: # Baseline task list (cls/retrieval/regression/detection…)

- {name:"cls_frb_vs_rfi", type:"classification", split:"test"}

metrics: # Metrics & definitions

- {name:"accuracy"}

- {name:"f1_macro"}

- {name:"roc_auc"}

- {name:"pr_auc"}

- {name:"ece"} # Expected Calibration Error

- {name:"brier"}

- {name:"rmse"} # regression/time-series

- {name:"map"} # detection/retrieval

eval_protocol: # Evaluation protocol

splits: "frozen" # must use frozen splits

seeds: [0,1,2,3,4]

repeats: 5

ci: {method:"bootstrap-bca", level:0.95}

significance: {test:"permutation", alpha:0.05}

fairness: {by:["class","region"], gap_metric:"abs_diff"}

robustness: {shift_tests:["snr_drop","time_jitter","spec_notch"]}

reports: # Deliverables & traceability

tables: ["quality/summary.csv","quality/per_class.csv"]

plots: ["quality/roc.png","quality/pr.png","quality/calibration.png"]

see:

- "EFT.WP.Core.DataSpec v1.0:EXPORT"

- "EFT.WP.Core.Metrology v1.0:check_dim"

(Consistent with Chapter 11 frozen splits, Chapter 8 labels/ontology, and Chapters 9–10 metrology/uncertainty.)


IV. Quality Gates (Definitions)


V. Coverage & Distribution Monitoring


VI. Baseline Tasks & Metrics


VII. Evaluation Protocol


VIII. Coupling with Uncertainty & Metrology

(Chapter 10), first normalize units/dimensions per Chapter 9, then combine and report. For path-dependent metrics like T_arr, register delta_form, path="gamma(ell)", measure="d ell", and pass check_dim. metrological uncertainty (resampling/bootstrap) and statistical uncertaintyWhen reporting

IX. Reporting & Traceability


X. Machine-Readable Fragment (Drop-in)

quality:

gates:

- {name:"label_consistency", metric:"kappa", threshold:0.98}

- {name:"leakage", metric:"leakage_rate", threshold:0.0}

- {name:"coverage_min", metric:"split_coverage", threshold:0.99}

coverage:

samples: 15000

per_class: {"FRB":520, "RFI":2100, "Noise":12380}

ci_method: "bootstrap-bca"

target_ci: 0.95

baseline:

tasks:

- {name:"cls_frb_vs_rfi", type:"classification", split:"test"}

metrics: [{name:"f1_macro"}, {name:"roc_auc"}, {name:"ece"}, {name:"brier"}]

eval_protocol:

splits: "frozen"

seeds: [0,1,2,3,4]

repeats: 5

ci: {method:"bootstrap-bca", level:0.95}

significance: {test:"permutation", alpha:0.05}

robustness: {shift_tests:["snr_drop","time_jitter","spec_notch"]}

reports:

tables: ["quality/summary.csv","quality/per_class.csv"]

plots: ["quality/roc.png","quality/pr.png","quality/calibration.png"]

see:

- "EFT.WP.Core.DataSpec v1.0:EXPORT"

- "EFT.WP.Core.Metrology v1.0:check_dim"

(Align with export_manifest.artifacts[]/references[].)


XI. Coupling with Export Manifest (Normative)

export_manifest:

artifacts:

- {path:"quality/summary.csv", sha256:"..."}

- {path:"quality/per_class.csv", sha256:"..."}

- {path:"quality/roc.png", sha256:"..."}

- {path:"quality/calibration.png", sha256:"..."}

references:

- "EFT.WP.Core.DataSpec v1.0:EXPORT"

- "EFT.WP.Core.Metrology v1.0:check_dim"

(Artifacts must be verifiable and carry anchors; no shortcodes/aliases.)


XII. Chapter Compliance Checklist


Copyright & License (CC BY 4.0)

Copyright: Unless otherwise noted, the copyright of “Energy Filament Theory” (text, charts, illustrations, symbols, and formulas) belongs to the author “Guanglin Tu”.
License: This work is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0). You may copy, redistribute, excerpt, adapt, and share for commercial or non‑commercial purposes with proper attribution.
Suggested attribution: Author: “Guanglin Tu”; Work: “Energy Filament Theory”; Source: energyfilament.org; License: CC BY 4.0.

First published: 2025-11-11|Current version:v5.1
License link:https://creativecommons.org/licenses/by/4.0/