HomeDocs-Technical WhitePaper19-EFT.WP.Methods.SynthData v1.0

Chapter 7 — Conditional & Controllable Generation (Prompt/CFG/Rules)


I. Scope & Targets

  1. Goals
    • Define a unified convention for conditional and controllable generation: x ~ p_model(x | c; theta), where c may be a text prompt, structured conditions, numeric bounds, or a set of rules.
    • Specify cooperative mechanisms among CFG (classifier-free guidance), hard constraints, and soft penalties; achieve interpretable control without violating units/dimensions or physical constraints.
    • Bring condition alignment, rule satisfaction, and downstream utility under contracts and manifests: manifest.synth.cond.*.
  2. Inputs
    Generation engine engine(theta) (see Chapter 5), condition set Cset, rules & constraints Rules = { g_j(x,c) ≤ 0 }, a reference distribution or panel ref, time-base and arrival-time conventions, SLOs and thresholds.
  3. Outputs
    Conditional samples D_syn(c), alignment reports and acceptance rate acc_rate(c), control strengths and schedules w_cfg(t), contract evaluation report, and manifest.synth.cond.
  4. Applicability
    Applies to tabular, time series, image/audio/text, and multimodal settings; when physical chains are involved, execute jointly with Chapter 6 (record T_arr, delta_form).

II. Terms & Symbols


*III. Axioms P407- **


*IV. Minimal Equations S407- **

  1. S407-1 (Conditional Generation Base Form)
    x ~ p_model( x | c; theta ),目标最小化 D( p_model(x|c) || p_ref(x|c) ),D ∈ {W1, KL, MMD}。
  2. S407-2 (Generic CFG Form)
    With guided fields s_cond(z,t) = s_theta(z,t|c) and s_uncond(z,t) = s_theta(z,t|∅), define
    s_guided(z,t) = s_uncond(z,t) + w_cfg(t) * ( s_cond(z,t) - s_uncond(z,t) )。
  3. S407-3 (Soft Penalties with Lagrange Multipliers)
    min_theta E_{c} E_{x~p_theta(•|c)} [ L_fid(x,c) + ( ∑_j lambda_j * g_j^+(x,c) ) ],其中 g_j^+(x,c) = max( 0, g_j(x,c) )。
  4. S407-4 (Hard-Constraint Accept/Reject or Projection)
    • m_acc(x,c) = 1 若 ∀j, g_j(x,c) ≤ 0,否则 0;接受率 acc_rate = ( ∑ m_acc ) / N。
    • Constraint projection: x' = Pi_C(x) = argmin_{z ∈ C} d(z,x) with residual res_cons = d(x',x).
  5. S407-5 (Alignment Thresholds & Utility)
    sim_embed(x,c) ≥ sim_min,util(x,c) ≥ util_min;不达标触发重采样或增益控制。
  6. S407-6 (Sequential KL Regularization)
    max_theta E_{x~p_theta(•|c)}[ R(x,c) ] - beta * KL( p_theta(•|c) || p_ref(•|c) ),beta ≥ 0。
  7. S407-7 (Dual Arrival Forms)
    T_arr = ( 1 / c_ref ) * ( ∫ n_eff d ell ) 与 T_arr = ( ∫ ( n_eff / c_ref ) d ell ),
    delta_form = | ( 1 / c_ref ) * ( ∫ n_eff d ell ) - ( ∫ ( n_eff / c_ref ) d ell ) |。

V. Metrology Flow M40-7 (Conditional & Controllable Loop)


VI. Contracts & Assertions C40-7xx


VII. Implementation Bindings I40-7*


VIII. Cross-References


IX. Quality Metrics & Risk Control

  1. Core SLIs
    sim_embed_p50/p05/p95, acc_rate, res_cons, n_eff, latency_ms_p99, oom_rate, fairness_gap, delta_form, J.
  2. Common risks & mitigations
    • Overlarge CFG causing mode collapse → use increasing w_cfg(t) schedules and early stop; introduce KL regularization.
    • Overtight rules causing low acceptance → relax to soft penalties, switch to projection, or stage constraints.
    • Condition–sample mismatch → improve encoders or embeddings; apply adaptive sim_min.
    • Unit/dimension violations → run check_dim and range clamping in both encoding and post-processing.
    • Subgroup bias → enable reweighting or mapping alignment; publish gaps and corrective evidence.
    • Time/arrival drift → re-run annotate_time_arrival; audit delta_form and J.

Summary

This chapter establishes an executable specification for conditional and controllable generation: axioms P407-* for explicit conditions, auditable control, and minimal distortion; equations S407-* for CFG, Lagrangian penalties, and accept–project mechanics; the process flow M40-7 to close the loop across encoding, guidance, checking, alignment, and manifest publication; and contracts C40-7xx plus interfaces I40-7* to ensure engineering implementation and cross-volume consistency. Deliverables and metrics populate manifest.synth.cond, supporting downstream evaluation and release freeze.

Copyright & License (CC BY 4.0)

Copyright: Unless otherwise noted, the copyright of “Energy Filament Theory” (text, charts, illustrations, symbols, and formulas) belongs to the author “Guanglin Tu”.
License: This work is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0). You may copy, redistribute, excerpt, adapt, and share for commercial or non‑commercial purposes with proper attribution.
Suggested attribution: Author: “Guanglin Tu”; Work: “Energy Filament Theory”; Source: energyfilament.org; License: CC BY 4.0.

First published: 2025-11-11|Current version:v5.1
License link:https://creativecommons.org/licenses/by/4.0/