HomeDocs-Technical WhitePaper36-EFT.WP.EDX.Current v1.0

Appendix C. Dataset Cards & Pipeline Cards


I. Purpose & Coverage (Mandatory)

  1. This appendix defines the Dataset Card (v1.0) and Pipeline Card (v1.0) fields, templates, and checks for the full path: acquisition/simulation → alignment → path correction → compute → inversion → PPC → export/release.
  2. Unified arrival dialect (two equivalent forms; path/measure explicit and delta_form recorded):
    • Constant-factored: T_arr = ( 1 / c_ref ) * ( ∫ n_eff d ell )
    • General: T_arr = ( ∫ ( n_eff / c_ref ) d ell )
  3. SI units. Hard gates: check_dim = pass, Re{Z_eft} ≥ 0, KK_consistency = pass, two-dialect T_arr agreement (≤ u(T_arr)).

II. Naming & Versioning


III. Dataset Card (v1.0) — Field Definitions

Field

Req.

Type

Constraints & Notes

dataset_id

str

Globally unique

version

str

semver

source

enum

measurement / simulation / third_party

instruments

list

Model/calibration/ranges

deemb

obj

De-embedding method/version/artifact path

sync

obj

Timebase/scheme/dt_sync_s

binding_ref

str

layout ↔ gamma(ell) binding ID

arrival

obj

form/gamma/measure/c_ref/Tarr/u_Tarr/delta_form

freq_grid_Hz

list[float]

Hz

Z_eft

obj

{real:[], imag:[]}

argZ

list[float]

rad

weights

obj

w_p(omega) (optional)

n_eff

list/obj

Per-segment or effective

qa_gates

obj

Results for check_dim/passivity/KK

hashes

obj

Per-file sha256

created_at

str

ISO 8601

toolchain

obj

Major versions & components


IV. Dataset Card — Minimal Template (copy-ready)

dataset_card:

dataset_id: "EDXCUR-001"

version: "1.0.0"

source: "measurement"

instruments:

- {type:"VNA", model:"—", cal:"2025-08-01", range:"10 MHz–20 GHz"}

deemb: {method:"TRL", version:"1.2", file:"/artifacts/deemb.json"}

sync: {ref:"10MHz", scheme:"shared_ref+trigger", dt_sync_s: 2.0e-12}

binding_ref: "LAY2PATH-xxxx"

arrival:

form: "n_over_c" # or "one_over_c_times_n"

gamma: "explicit"

measure: "d_ell"

c_ref: 299792458.0

Tarr_s: 1.234e-09

u_Tarr_s: 6.0e-12

delta_form: "n_over_c"

freq_grid_Hz: [ ... ]

Z_eft: {real:[...], imag:[...]}

argZ: [ ... ]

weights: {γ_main:[...], γ_side:[...] }

n_eff: [ ... ] # per segment or effective

qa_gates: {check_dim:"pass", passivity:"pass", KK:"pass"}

hashes:

files:

- {path:"/data/raw.s2p", sha256:"..."}

- {path:"/data/aligned.parquet", sha256:"..."}

created_at: "2025-09-15T09:30:00Z"

toolchain: {simstack:"1.0.0", kernels:"0.9.3"}


V. Pipeline Card (v1.0) — Stages & Fields

Field Definitions

Field

Req.

Type

Notes

pipeline_id

str

Unique pipeline ID

version

str

semver

steps

list[obj]

Ordered stage definitions

env_lock

obj

Environment lock (interpreter/deps/container/hardware)

seeds

obj

RNG seeds

artifacts

list[str]

Relative paths to exports


VI. Pipeline Card — Minimal Template (copy-ready)

pipeline_card:

pipeline_id: "EDXCUR-PIPE-001"

version: "1.0.0"

steps:

- {id:"ingest", cmd:"load_raw --input /data/raw.s2p"}

- {id:"align", cmd:"align --deemb /artifacts/deemb.json --sync /artifacts/sync.json"}

- {id:"path_correct", cmd:"pathfix --binding LAY2PATH-xxxx"}

- {id:"compute", cmd:"compute_zeft --arrival n_over_c"}

- {id:"invert", cmd:"invert --priors /cfg/priors.yaml --sampler NUTS"}

- {id:"ppc", cmd:"ppc --band ω1,ω2"}

- {id:"export", cmd:"export_cards --out /release"}

env_lock:

python: "3.11.6"

packages:

- {name:"numpy", version:"1.26.4"}

- {name:"scipy", version:"1.13.1"}

- {name:"pandas", version:"2.2.2"}

container:

image: "registry/edxcurrent:1.0.0"

digest: "sha256:..."

hardware: {cpu:"x86_64", gpu:"none", ram_GB:32}

seeds: {global: 20250915}

artifacts: ["/release/dataset_card.yaml","/release/pipeline_card.yaml"]


VII. Environment Lock (v1.0) — Fields & Template

env_lock:

os: {name:"Ubuntu", version:"22.04"}

interpreter: {name:"Python", version:"3.11.6"}

packages:

- {name:"numpy", version:"1.26.4", hash:"sha256:..."}

- {name:"scipy", version:"1.13.1", hash:"sha256:..."}

- {name:"pandas", version:"2.2.2", hash:"sha256:..."}

container:

image: "registry/edxcurrent:1.0.0"

digest: "sha256:..."

hardware: {cpu:"x86_64", gpu:"none", ram_GB:32}


VIII. QA Gates & Consistency Checks (Unified Record)

qa_gates:

check_dim: "pass" # dimensional consistency for key equalities

passivity: "pass" # min(Re{Z_eft}) ≥ 0

KK: "pass" # K–K consistency

Tarr_dual:

diff_s: 3.0e-12 # |T_arr (two dialects) difference|

u_Tarr_s:6.0e-12

pass: true


IX. Audit Trail & Release Manifest (Templates)

audit_manifest:

run_id: "RUN-2025-09-15-001"

toolchain: {simstack:"1.0.0", kernels:"0.9.3"}

diffs: {params_changed: ["Ks_tau"], reason:"tuning within prior"}

seeds: {global: 20250915}

qa:

check_dim: "pass"

passivity: "pass"

KK: "pass"

logs: "/logs/run_001.txt"

release_manifest:

normative_refs: ["EFT.WP.Core.DataSpec v1.0","Methods.Repro v1.0"]

artifacts:

- "/release/dataset_card.yaml"

- "/release/pipeline_card.yaml"

- "/release/env_lock.yaml"

- "/release/audit_manifest.yaml"

hashes:

dataset_card: "sha256:..."

pipeline_card: "sha256:..."

env_lock: "sha256:..."

audit: "sha256:..."


X. Equivalence & Reproducibility Metrics (Threshold Fields & Pseudocode)

equivalence_gates:

eps_Z_gate: 0.03 # 3% relative error

eps_phi_gate: 0.05 # rad

eps_w_gate: 0.10 # total weight diff

# 1) Two-dialect T_arr consistency

T1 = (1/c_ref) * sum(n_eff[i]*Δell[i]) # n_over_c

T2 = sum((n_eff[i]/c_ref) * Δell[i]) # one_over_c_times_n

assert abs(T1 - T2) <= u_Tarr

# 2) Quick gates

assert min(Re(Z_eft)) >= 0.0

assert KK_consistency(Z_eft) is True

# 3) Equivalence

eps_Z = max_abs(Z_eft_rep - Z_eft_pub) / max_abs(Z_eft_pub)

eps_phi= max_abs(phi_rep - phi_pub)

eps_w = max_over_ω(sum_abs(w_rep - w_pub))

assert eps_Z <= eps_Z_gate and eps_phi <= eps_phi_gate and eps_w <= eps_w_gate


XI. Suggested Release Directory Layout

/release/

dataset_card.yaml

pipeline_card.yaml

env_lock.yaml

audit_manifest.yaml

qa_report.json

artifacts/

deemb.json

sync.json

aligned.parquet

Z_eft.npy


XII. Error Codes (Unified Returns)

errors:

E_BAD_UNITS: "Unit or dimensional mismatch"

E_KK_FAIL: "K–K consistency failed"

E_PASSIVITY: "Passivity failed (Re{Z}<0)"

E_BINDING_MISSING: "Missing binding_ref or path segments"

E_DELTA_FORM_MISSING: "Arrival delta_form not recorded"

E_TARR_MISMATCH: "Two-dialect T_arr mismatch"

E_EQUIV_FAIL: "Equivalence thresholds not met"

E_ENVLOCK_FAIL: "Env-lock reconstruction failed or fingerprint mismatch"

E_QA_FAIL: "QA gate failed"


XIII. Compliance Bundle (one-shot, copy-ready)

dataset_card: { ... per §IV template ... }

pipeline_card: { ... per §VI template ... }

env_lock: { ... per §VII template ... }

qa_gates: { ... per §VIII template ... }

audit_manifest: { ... per §IX template ... }

release_manifest:

normative_refs: ["EFT.WP.Core.DataSpec v1.0","Methods.Repro v1.0"]

artifacts: ["/release/dataset_card.yaml","/release/pipeline_card.yaml","/release/env_lock.yaml","/release/audit_manifest.yaml"]

hashes: {dataset_card:"sha256:...", pipeline_card:"sha256:...", env_lock:"sha256:...", audit:"sha256:..."}

equivalence_gates: {eps_Z_gate:0.03, eps_phi_gate:0.05, eps_w_gate:0.10}


XIV. Release Hard Gates (Summary)

check_dim = pass; Re{Z_eft} ≥ 0; KK_consistency = pass; two-dialect T_arr agreement; dataset_card/pipeline_card/env_lock/audit_manifest present with sha256; equivalence thresholds ε_Z / ε_φ / ε_w passed.

Copyright & License (CC BY 4.0)

Copyright: Unless otherwise noted, the copyright of “Energy Filament Theory” (text, charts, illustrations, symbols, and formulas) belongs to the author “Guanglin Tu”.
License: This work is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0). You may copy, redistribute, excerpt, adapt, and share for commercial or non‑commercial purposes with proper attribution.
Suggested attribution: Author: “Guanglin Tu”; Work: “Energy Filament Theory”; Source: energyfilament.org; License: CC BY 4.0.

First published: 2025-11-11|Current version:v5.1
License link:https://creativecommons.org/licenses/by/4.0/