HomeDocs-Technical WhitePaper46-EFT.WP.Data.Benchmarks v1.0

Chapter 13 Fairness, Ethics & Safety Stress


I. Chapter Purpose & Scope

in benchmarks: slices and disparity measures, harm samples and misuse boundaries, policies and gates, reporting and arbitration, linkage with scoring/gates/leaderboard governance; ensure consistency with task definitions, metric system, evaluation protocol, privacy & compliance, metrology, and citation anchors.safety stress, and ethics, fairnessFix specifications for

II. Terminology & Dependencies

  1. Terms: slices, gap_metric (abs_diff|ratio|stat_parity|eq_opp), harm_suite (harm sample set), policy (allow/ban/restrict), gating (release gates), red_team (stress testing), incident, appeal.
  2. Dependencies: privacy, security & compliance (Pipeline v1.0, Ch.14); evaluation protocol (ModelCards v1.0, Ch.11); metrics & units (this volume, Ch.6); robustness & adversarial (this volume, Ch.12); unit & dimension checks (Core.Metrology v1.0:check_dim).
  3. Math & symbols: wrap inline symbols; any division/integral/composite operator must use parentheses; for T_arr use
    • T_arr = ( 1 / c_ref ) * ( ∫ n_eff d ell ), or
    • T_arr = ( ∫ ( n_eff / c_ref ) d ell ),
      declaring gamma(ell) and d ell. No Chinese in formulas/symbols/definitions.

III. Fields & Structure (Normative)

fairness_ethics:

slices:

- {axis:"locale", buckets:["en","zh","es"]}

- {axis:"gender*", buckets:["f","m","other"], note:"if legally permissible and de-identified"}

- {axis:"device", buckets:["mobile","desktop"]}

gap_metric: "abs_diff|ratio|stat_parity|eq_opp"

thresholds:

fairness_warn: 0.03

fairness_block: 0.05

harms:

harm_suite_ref: "safety/harm_suite@vX.Y"

categories: ["toxicity","self-harm","privacy","misinfo","bias"]

scoring: ["toxicity@prob","privacy_leak@binary","prompt_injection@binary"]

policies:

allowed_use: ["academic","benchmark"]

prohibited_use:["surveillance","unlawful_discrimination"]

restricted_use:["medical_advice","financial_advice"]

red_team:

enabled: true

playbooks: ["redteam/prompt_injection.md","redteam/toxicity.md"]

exposure: {shadow:true, canary:0.02}

reporting:

table_axes: ["axis","bucket","metric"]

include_ci: true

significance: {method:"bootstrap", B:10000, alpha:0.05, correction:"Holm-Bonferroni"}

disclosures:

sensitive_attributes: "de-identified|N/A"

human_in_the_loop: true

governance:

gating: {require_ci:true, min_runs:3}

incident:

notify: "security@org.example"

sla_hours: 72

appeal_window_days: 14


IV. Fairness Slices & Disparity Measures

  1. Slices: choose task-relevant and legally compliant axes (e.g., locale/device/region); for potentially sensitive axes, ensure de-identification and legality review.
  2. Disparities:
    • abs_diff = ( metric_ref - metric_grp );
    • ratio = ( metric_grp / metric_ref );
    • stat_parity/eq_opp follow positive-class/threshold definitions in the protocol.
      Report Δ and CI_95, apply multiple-comparison correction.

V. Harm Datasets & Safety Stress


VI. Gates & Governance


VII. Statistics & Reporting


VIII. Metrology & Units (SI)


IX. Machine-Readable Fragment (Drop-in)

fairness_ethics:

slices:

- {axis:"locale", buckets:["en","zh","es"]}

- {axis:"device", buckets:["mobile","desktop"]}

gap_metric: "abs_diff"

thresholds: {fairness_warn:0.03, fairness_block:0.05}

harms:

harm_suite_ref: "safety/harm_suite@v1.1"

categories: ["toxicity","privacy","prompt_injection"]

scoring: ["toxicity@prob","privacy_leak@binary","prompt_injection@binary"]

policies:

allowed_use: ["academic","benchmark"]

prohibited_use: ["surveillance"]

red_team:

enabled: true

playbooks: ["redteam/prompt_injection.md"]

exposure: {shadow:true, canary:0.02}

reporting:

table_axes: ["axis","bucket","metric"]

include_ci: true

significance: {method:"bootstrap", B:10000, alpha:0.05, correction:"Holm-Bonferroni"}

disclosures: {sensitive_attributes:"de-identified", human_in_the_loop:true}

governance:

gating: {require_ci:true, min_runs:3}

incident: {notify:"security@org.example", sla_hours:72, appeal_window_days:14}

metrology: {units:"SI", check_dim:true}


X. Lint Rules (Excerpt, Normative)

lint_rules:

- id: FAIR.SLICES_DEFINED

when: "$.fairness_ethics.slices"

assert: "len(value) >= 1 and all(has_keys(_, 'axis','buckets') for _ in value)"

level: error

- id: FAIR.GAP_METRIC_ALLOWED

when: "$.fairness_ethics.gap_metric"

assert: "value in ['abs_diff','ratio','stat_parity','eq_opp']"

level: error

- id: HARM.SUITE_REF_REQUIRED

when: "$.fairness_ethics.harms"

assert: "has_key(value, 'harm_suite_ref')"

level: error

- id: GOVERN.GATING_PARAMS

when: "$.fairness_ethics.governance.gating"

assert: "has_keys(require_ci, min_runs)"

level: error

- id: REPORT.SIGNIFICANCE_PARAMS

when: "$.fairness_ethics.reporting.significance"

assert: "has_keys(method, B, alpha)"

level: error

- id: METROLOGY.SI_AND_CHECKDIM

when: "$.metrology"

assert: "units == 'SI' and check_dim == true"

level: error


XI. Cross-Reference Anchors


XII. Chapter Compliance Checklist


Copyright & License (CC BY 4.0)

Copyright: Unless otherwise noted, the copyright of “Energy Filament Theory” (text, charts, illustrations, symbols, and formulas) belongs to the author “Guanglin Tu”.
License: This work is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0). You may copy, redistribute, excerpt, adapt, and share for commercial or non‑commercial purposes with proper attribution.
Suggested attribution: Author: “Guanglin Tu”; Work: “Energy Filament Theory”; Source: energyfilament.org; License: CC BY 4.0.

First published: 2025-11-11|Current version:v5.1
License link:https://creativecommons.org/licenses/by/4.0/