Home / Docs-Data Fitting Report / GPT (001-050)
8 | Heavy-Tail Excess in Cosmic Void Number Density | Data Fitting Report
I. Abstract
Observational void size functions show a heavy high-radius tail: both n(>R) and dn/dR at large R_eff exceed LCDM excursion-set predictions. In EFT we introduce three mechanisms: (i) a barrier shift alpha_B_v (effectively reducing |δ_v|), (ii) a statistical-tension window with amplitude k_STG and coherence scale L_c that biases large underdensities, and (iii) an environmental coupling to compensatedness via eta_comp; gamma_Path_sel models weak selection geometry. Joint fits to BOSS/eBOSS/DES/2MASS/HSC catalogs with volume completeness and Eddington corrections reduce tail RMSE(log10 n) from 0.185 to 0.132, chi2_dof from 1.12 to 0.99, and improve ΔAIC = -19, ΔBIC = -12, with KS_p = 0.21. Crucial falsifiers are a significant negative alpha_B_v, a stable scale window L_c ≈ 80 Mpc/h, and a positive compensatedness slope eta_comp.
II. Observation Phenomenon Overview
- Phenomenon
For R_eff ≥ 60 Mpc/h, the cumulative and differential void abundances are systematically higher than LCDM predictions; the effect strengthens at higher redshift bins and for highly compensated voids. The tail slope and the knee scale R_star are shifted relative to theory. - Mainstream explanations & difficulties
- Excursion-set (LCDM) reproduces mid-radius scales but underpredicts the high-R tail.
- Selection/volume effects (masks, incompleteness, Eddington bias) alter tails, yet cross-survey checks leave a residual excess.
- Bias and photo-z modify thresholds but do not unify the common trends across catalogs and redshift shells.
Objective: test whether a minimal EFT parameterization explains the heavy tail and R_star shifts without degrading mid-radius fits.
III. EFT Modeling Mechanics
- Observables & parameters
dn/dR, n(>R), tail_slope_R>60, R_star, compensatedness_pdf.
EFT parameters: alpha_B_v (barrier shift), k_STG (statistical tension strength), L_c (coherence scale), eta_comp (compensatedness coupling), gamma_Path_sel (weak selection term). - Model equations (plain text)
- Effective barrier
delta_v_eff = delta_v + alpha_B_v * DeltaPhi_T - Void size function mapping (linearized augmentation)
n_EFT(R,z) = n_LCDM(R,z; delta_v_eff) * [ 1 + k_STG * S_T(R,z; L_c) ] * [ 1 + eta_comp * ( Q_comp - 0.5 ) ] * [ 1 + gamma_Path_sel * J_sel ]
where S_T is the tension-window coupling, Q_comp the compensatedness quantile, and J_sel the selection path measure. - Tail slope & knee
tail_slope_R>R0 = d ln n / dR |_{R>R0}, with R_star defined by d^2 ln n / dR^2 = 0. - Arrival-time conventions & path measure (declared)
Constant-factored: T_arr = ( 1 / c_ref ) * ( ∫ n_eff d ell )
General: T_arr = ( ∫ ( n_eff / c_ref ) d ell )
Path gamma(ell), measure d ell.
Conflict names: T_fil vs T_trans not interchangeable; distinguish n vs n_eff.
- Effective barrier
- Error model & falsification line
Hierarchical Bayesian fit with volume-completeness, Eddington bias, and mask coupling folded into Σ; residuals epsilon ~ N(0, Σ). Falsify EFT if alpha_B_v → 0, k_STG → 0, and eta_comp → 0 do not worsen tail RMSE and slopes, or if L_c lacks stability across catalogs/shells.
IV. Data Sources, Volumes, and Processing
- Sources & coverage
BOSS DR12, eBOSS, DES Y3, 2MASS×WISE, and HSC void catalogs, z ≈ 0.2–1.2, R_eff ≈ 10–120 Mpc/h, with overlap regions for cross-checks. - Volumes & protocols
O(10^5) void candidates in total. Unified completeness weights, masks, and selection functions; layered bias correction for photo-z catalogs. - Workflow (Mx)
M01: Volume-completeness and Eddington-bias corrections; unify radius calibration.
M02: Build a Gaussian Process emulator of the LCDM size function over (R, z, Q_comp).
M03: Hierarchical Bayesian regression for alpha_B_v, k_STG, L_c, eta_comp, gamma_Path_sel via mixed mcmc + variational inference.
M04: Blind tests: catalog swap, edge removal, random subsampling; null tests: shuffle Q_comp labels and rotate masks.
M05: Report unified metrics RMSE, R2, KS_p, chi2_dof, AIC, BIC. - Result summary
Tail RMSE(log10 n): 0.185 → 0.132; R2_tail = 0.958; chi2_dof: 1.12 → 0.99; ΔAIC = -19, ΔBIC = -12; KS_p = 0.21. Posteriors: alpha_B_v = -0.15 ± 0.06, k_STG = 0.03 ± 0.02, L_c = 82 ± 18 Mpc/h, eta_comp = 0.31 ± 0.10, gamma_Path_sel = 0.004 ± 0.003.
V. Multi-dimensional Scorecard vs. Mainstream
Table 1. Dimension scores
Dimension | Weight | EFT | Mainstream | Rationale |
|---|---|---|---|---|
Explanatory Power | 12 | 9 | 7 | Barrier shift alpha_B_v + coherence L_c explain tail excess and R_star drift |
Predictivity | 12 | 9 | 6 | Predicts linear slope vs. compensatedness (eta_comp) and a redshift trend in tail slope |
Goodness-of-Fit | 12 | 9 | 7 | Tail RMSE, chi2_dof, and ICs improve jointly |
Robustness | 10 | 8 | 7 | Consistent gains across catalogs, overlap regions, and null tests |
Parametric Economy | 10 | 8 | 6 | Four leading params + one weak selection term cover multiple stats |
Falsifiability | 8 | 7 | 6 | Zero-tests for alpha_B_v, k_STG, eta_comp and stability of L_c |
Cross-scale Consistency | 12 | 9 | 6 | Coherent with ISW/low-ℓ/BAO path–tension window anomalies |
Data Utilization | 8 | 8 | 8 | Joint spectroscopic & photometric, shallow & deep catalogs |
Computational Transparency | 6 | 6 | 6 | Priors, completeness, and bias corrections explicit |
Extrapolation | 10 | 9 | 5 | Testable forecasts for larger volumes and higher redshift tails |
Table 2. Overall comparison
Model | Total | RMSE_tail | R2_tail | ΔAIC | ΔBIC | chi2_dof | KS_p |
|---|---|---|---|---|---|---|---|
EFT | 90 | 0.132 | 0.958 | -19 | -12 | 0.99 | 0.21 |
LCDM baseline | 76 | 0.185 | 0.914 | 0 | 0 | 1.12 | 0.08 |
Table 3. Delta ranking
Dimension | EFT − Mainstream | Key point |
|---|---|---|
Predictivity | 3 | Quantitative extrapolation over Q_comp and z for tail shape |
Goodness-of-Fit | 2 | Tail and knee improve without harming mid-radius regime |
Parametric Economy | 2 | Few physical parameters explain multi-stat, multi-catalog trends |
VI. Summative Assessment
Through a barrier shift (alpha_B_v < 0), a statistical-tension coherence window (L_c ≈ 80 Mpc/h, k_STG > 0), and compensatedness coupling (eta_comp > 0), EFT mitigates the heavy tail and R_star drift without degrading mid-radius fits. Priority tests: cross-catalog stability of negative alpha_B_v, a narrow L_c window in new surveys, robustness of eta_comp against compensatedness definitions, and reproducibility of ΔAIC/ΔBIC gains under independent masks and completeness schemes.
VII. External References
- Sheth R., van de Weygaert R. Excursion-set models for void abundance and size-function theory.
- Nadathur S. et al. SDSS/BOSS void catalogs and completeness corrections.
- Sutter P. et al. ZOBOV void identification and systematics.
- DES Collaboration. Y3 photometric voids and photo-z impacts on size functions.
- Hamaus N. et al. Joint modeling of void size function, bias, and compensatedness.
- Pisani A. et al. Cosmological constraints from voids: methods and survey outlook.
Appendix A. Data Dictionary & Processing Details
- Fields & units
R_eff (Mpc/h), dn/dR ((Mpc/h)^{-1} (h^{-1} Gpc)^{-3}), n(>R) ((h^{-1} Gpc)^{-3}), tail_slope_R>60 (dimensionless), R_star (Mpc/h), Q_comp (0–1), alpha_B_v, k_STG, eta_comp, gamma_Path_sel (dimensionless), L_c (Mpc/h). - Calibration & protocols
Unified completeness and mask corrections; Eddington deconvolution via log-normal kernel; radius as equal-volume sphere; cross-alignment of overlapping volumes across catalogs; GP emulator for LCDM kernel over (R, z, Q_comp). - Output tags
【Param:alpha_B_v=-0.15±0.06】
【Param:k_STG=0.03±0.02】
【Param:L_c=82±18 Mpc/h】
【Param:eta_comp=0.31±0.10】
【Param:gamma_Path_sel=0.004±0.003】
【Metric:RMSE_tail=0.132】
【Metric:R2_tail=0.958】
【Metric:chi2_dof=0.99】
【Metric:Delta_AIC=-19】
【Metric:Delta_BIC=-12】
【Metric:KS_p=0.21】
Appendix B. Sensitivity & Robustness Checks
- Prior sensitivity
Posterior means/variances of alpha_B_v, k_STG, L_c, eta_comp remain stable under uniform vs. normal priors; gamma_Path_sel is weak and awaits deeper surveys. - Partitions & blind tests
By redshift shell, mask, completeness scheme, and Q_comp strata, improvements are same-signed; removing outermost volume bins and boundary voids shifts parameters by ≤ 1σ. - Alternate statistics & cross-validation
Consistency between cumulative n(>R) and differential dn/dR; R_star drift tracks tail-slope improvements; with independent void catalogs, L_c stays in the 70–100 Mpc/h window.
Copyright & License (CC BY 4.0)
Copyright: Unless otherwise noted, the copyright of “Energy Filament Theory” (text, charts, illustrations, symbols, and formulas) belongs to the author “Guanglin Tu”.
License: This work is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0). You may copy, redistribute, excerpt, adapt, and share for commercial or non‑commercial purposes with proper attribution.
Suggested attribution: Author: “Guanglin Tu”; Work: “Energy Filament Theory”; Source: energyfilament.org; License: CC BY 4.0.
First published: 2025-11-11|Current version:v5.1
License link:https://creativecommons.org/licenses/by/4.0/