HomeDocs-Technical WhitePaper18-EFT.WP.Methods.CrossStats v1.0

Chapter 4 — Estimation and Intervals (Frequentist/Bayesian Harmonization)


One-Line Objective

Harmonize frequentist and Bayesian conventions for point estimates, intervals, and uncertainty; clarify release rules under weights, units, and a unified timebase.

I. Scope and Objects

  1. Scope
    • Applies to point estimation, interval estimation, and posterior summaries for weighted samples, complex designs, and online streams.
    • Covers uncertainty propagation for means/proportions/rates, regression (GLM), ratios, and functional metrics g( theta ).
  2. Objects
    • Inputs: data D = { (y_i, x_i, w_i, t_i) }, sampling info pi(i) or replicate weights, window Delta_t, arrival-time fields T_arr.
    • Outputs: hat{theta}, SE(hat{theta}), CI_{1-alpha} or posterior intervals, U = k * u_c, and manifest.stats.estim.*.
    • Constraints: consistent units and dimensions; sum(w_i)/N_hat ≈ 1; compute on tau_mono, publish on ts.

II. Terms and Variables

  1. Basics
    • theta (parameter vector), hat{theta} (estimator), SE (standard error), V (covariance), CI_{1-alpha} (interval).
    • Weighted mean: hat{mu}_w = ( ∑ w_i y_i ) / ( ∑ w_i ).
    • Ratio: R = ( ∑ w_i a_i ) / ( ∑ w_i b_i ).
  2. GLM and robust variance
    • Score equations: U( theta ) = ∑ x_i * ( y_i - mu_i( theta ) ) / v_i( theta ) = 0.
    • Sandwich variance: V_hat = ( A^{-1} ) * B * ( A^{-1} )^T, with A = - ∂U/∂theta, B = ∑ u_i u_i^T.
  3. Bayesian elements
    p(theta), L(theta; D), posterior p(theta | D) ∝ L * p(theta), posterior predictive p(y_new | D) = ( ∫ p(y_new | theta) p(theta | D) d theta ).
  4. Metrology and units
    unit(hat{theta}) = unit(theta), dim(hat{theta}) = dim(theta); run check_dim( y - f(x) ) prior to release.
  5. Time and arrival time
    Statistical window: window( t; Delta_t, tau_mono ); record both T_arr conventions and delta_form in parallel.

III. Axioms P304-*


IV. Minimal Equations S304-*

  1. S304-1 (Weighted mean and variance)
    • hat{mu}_w = ( ∑ w_i y_i ) / ( ∑ w_i ).
    • Linearization variance (SRS approximation; use replication for complex designs):
      Var( hat{mu}_w ) ≈ ( ∑ w_i^2 ( y_i - hat{mu}_w )^2 ) / ( ( ∑ w_i )^2 ).
  2. S304-2 (Proportion/rate intervals)
    • Wilson proportion interval:
      p_w = ( y + z^2 / 2 ) / ( n + z^2 ),
      half = z * sqrt( ( p_hat ( 1 - p_hat ) + z^2 / ( 4 n ) ) / ( n + z^2 ) ),
      CI = [ p_w - half , p_w + half ].
    • Poisson rate (exposure E): lambda_hat = ( k / E ), normal-approx interval lambda_hat ± z * sqrt( k ) / E (use exact or Byar for small samples).
  3. S304-3 (Delta method)
    • Scalar: Var( g( hat{theta} ) ) ≈ ( g'( theta ) )^2 Var( hat{theta} );
    • Vector: Var( g( hat{theta} ) ) ≈ G V G^T, with G = ∂g/∂theta |_{hat{theta}}.
  4. S304-4 (Ratio estimator via Delta)
    With R = A / B,
    Var( R ) ≈ ( 1 / B^2 ) Var( A ) + ( A^2 / B^4 ) Var( B ) - ( 2 A / B^3 ) Cov( A, B ).
  5. S304-5 (GLM normal-approx intervals)
    CI_{1-alpha}( theta_j ) = hat{theta}_j ± z_{1-alpha/2} * SE( hat{theta}_j ); use t_{df} for small samples.
  6. S304-6 (Bootstrap intervals)
    Percentile: CI = [ q_{alpha/2}( theta^* ), q_{1-alpha/2}( theta^* ) ]; BCa as the default robust option.
  7. S304-7 (Bayesian intervals and coverage factor)
    • Central or HPD: CI = [ q_{alpha/2}( p(theta|D) ), q_{1-alpha/2} ];
    • Metrology mapping: align U = k * u_c with frequentist intervals; under normal approximation, k ≈ z_{1-alpha/2}.
  8. S304-8 (Posterior predictive checks)
    ppc = P( T( y_rep ) ≥ T( y_obs ) | D ); publish the chosen statistic T(•) and the ppc value.
  9. S304-9 (Two T_arr conventions discrepancy)
    delta_form = | ( 1 / c_ref ) * ( ∫ n_eff d ell ) - ( ∫ ( n_eff / c_ref ) d ell ) |, with delta_form ≤ tol_Tarr asserted.
  10. S304-10 (Placeholder for multiple comparisons)
    Family-wise error control is executed in Chapter 6; here, interval widths are computed under the given alpha_budget, with alpha_used ≤ alpha_budget.

V. Statistical Process M30-4 (Ready → Estimate → Intervals → Diagnostics → Release)


VI. Contracts and Assertions (C30-4xx)


VII. Implementation Bindings I30-*


VIII. Cross-References


IX. Quality and Risk Control

  1. SLI/SLO
    Coverage error | cov_hat - ( 1 - alpha ) |, interval width width_p50/p90, var_gap, mcse_p95, latency_ms_p99.
  2. Risk control and rollback
    • Triggers: failures of C30-402/403/405 or rhat > cap_rhat.
    • Actions: switch between robust and replicate variance; downgrade the release to “experimental”; revert to the previous signed manifest and alert.

Summary

This chapter enforces unified conventions for estimation and intervals via P304-*, provides general formulas S304-*, operationalizes the release pipeline M30-4 with quality gates C30-4xx, and delivers consistent frequentist/Bayesian outputs through I30-*. It forms a stable base for subsequent chapters on multiple comparisons, drift, and causal assessment.

Copyright & License (CC BY 4.0)

Copyright: Unless otherwise noted, the copyright of “Energy Filament Theory” (text, charts, illustrations, symbols, and formulas) belongs to the author “Guanglin Tu”.
License: This work is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0). You may copy, redistribute, excerpt, adapt, and share for commercial or non‑commercial purposes with proper attribution.
Suggested attribution: Author: “Guanglin Tu”; Work: “Energy Filament Theory”; Source: energyfilament.org; License: CC BY 4.0.

First published: 2025-11-11|Current version:v5.1
License link:https://creativecommons.org/licenses/by/4.0/