Extended dark energy analysis using DESI DR2 BAO measurements

Q: What is the main research question of the paper?

Whether DESI DR2 BAO measurements (combined with CMB and SNe) indicate dynamical dark energy—i.e., a time-varying equation of state—rather than a cosmological constant.

Q: What datasets are used, and what redshift range do they cover?

The primary dataset is DESI DR2 BAO from $z\approx 0.1$ to $z\approx 4.2$ (BGS, LRG, ELG, QSO, and Ly$\alpha$ samples). They combine this with Planck CMB (full likelihood plus lensing) and with one of three SNe Ia compilations: PantheonPlus, Union3, or DESY5.

Q: What is the baseline dark-energy model used for comparison?

The CPL $w_0w_a$ parameterization, $w(z)=w_0+w_a\frac{z}{1+z}$, with $w=-1$ corresponding to ΛCDM.

Q: How do the authors quantify the preference for dynamical dark energy over ΛCDM?

They use improvements in fit relative to ΛCDM, reported as $\Delta\chi^2_{\rm MAP}$ and interpreted as $\sim 2.8$ to $4.2\sigma$ depending on the dataset combination; they also use Bayesian evidence (in an appendix) and DIC for model comparison.

Q: What is the reported statistical strength of the deviation from ΛCDM?

For DESI DR2 BAO combined with CMB and SNe, the deviation is reported as $2.8$–$4.2\sigma$, with fit improvements of $-21.0\le \Delta\chi^2\le -10.7$ relative to ΛCDM.

Q: What do the reconstructions suggest about the equation of state $w(z)$ at low redshift?

They find $w(z)>-1$ today but phantom-like behavior in the past, implying an effective crossing/turnover near $z\sim 0.5$. Non-parametric methods show the strongest deviation at low redshift ($z\lesssim 0.3$).

Q: What non-parametric methods are used, and what do they find?

They use binning (including binning in $f_{\rm DE}(z)$) and Gaussian process regression. Both reproduce the main trend and localize the crossing/turnover around $z\approx 0.5$, with the lowest-redshift bin showing $>3\sigma$ deviation from $w=-1$ in the binning analysis.

K. Lodha, R. Calderón, William L. Matthewson, Arman Shafieloo, M Ishak, Jian Pan, C. García-Quintero, D Huterer, Georgios Valogiannis, Luís Alfonso Ureña López, +90 more

Physical review. D/Physical review. D.·2025·Physics and Astronomy·89 citations

9 min read

Read the full paper at DOI or on arxiv

TL;DR

DESI DR2 BAO combined with Planck CMB and SNe Ia yields a multi-sigma preference for dynamical dark energy over ΛCDM, with reported deviations of $\sim 2.8$ – $4.2 σ$ depending on the dataset combination.

Briefing Cornell Notes

Briefing

This paper asks whether the late-time acceleration of the Universe is consistent with a cosmological constant (ΛCDM) or instead requires dynamical dark energy with a time-varying equation of state. The question matters because ΛCDM is extremely successful but conceptually incomplete: it treats dark energy as a constant vacuum term, while many theoretical ideas motivate evolving dark sectors. The authors focus on new, high-precision baryon acoustic oscillation (BAO) measurements from DESI Data Release 2 (DR2), and they test whether the inferred dark-energy behavior is robust to modeling choices—both parametric (fixed functional forms for the equation of state) and non-parametric (reconstructions with more flexibility).

The study is a combined-probe Bayesian inference analysis. The primary dataset is DESI DR2 BAO, spanning redshifts from approximately 0.1 to 4.2 and organized into multiple tracer samples: BGS (using isotropic DV/rd constraints at 0.1 < z < 0.4), LRG bins (0.4 < z < 0.6 and 0.6 < z < 0.8), LRG+ELG (0.8 < z < 1.1), ELG (1.1 < z < 1.6), and QSO (0.8 < z < 2.1), plus Lyα BAO (1.8 < z < 4.2). To anchor early-universe physics and break degeneracies, they include Planck CMB information using a full likelihood setup (high-1 TTTEEE plus low-1 TT and low-1 EE, plus lensing from Planck NPIPE PR4 and ACT DR6) implemented via Cobaya. For some non-parametric reconstructions that can struggle with negative dark-energy density, they also use a compressed CMB prior described by $(θ_{*}, ω_{b}, ω_{b c})_{CMB}$ , which captures key geometric and sound-horizon information while marginalizing over late-time effects.

For late-time distance calibration, the analysis combines DESI with Type Ia supernovae (SNe Ia) from three compilations: PantheonPlus (1550 spectroscopically confirmed SNe, 0.001 < z < 2.26), Union3 (2087 SNe, 0.01 < z < 2.26), and DESY5 (1635 photometric SNe, 0.1 < z < 1.13, plus 194 historical low-z SNe). The paper emphasizes Union3 in figures as a conservative choice due to larger uncertainties, but it reports how conclusions vary across the three SNe datasets.

Methodologically, the authors run Markov Chain Monte Carlo sampling (Metropolis-Hastings) in Cobaya, using CAMB with modifications to allow generalized dark-energy equations of state. They assume spatial flatness (motivated by prior DESI evidence that curvature is not significant). For perturbations and the possibility of crossing the phantom divide, they use the parametrized post-Friedmann (PPF) framework, which permits transitions across $w = - 1$ . They adopt priors summarized in their Table I, including a baseline $w_{0}$ and $w_{a}$ prior structure and additional priors for non-parametric expansions.

The key baseline parameterization is the Chevallier-Polarski-Linder (CPL) form, written as $w (z) = w_{0} + w_{a} \frac{z}{1 + z}$ (equivalently $w (a) = w_{0} + w_{a} (1 - a)$ ). In the DESI DR2 BAO + CMB + Union3 combination, the authors report a preference for a dynamical equation of state away from ΛCDM: the data favor the quadrant $w_{0} > - 1$ and $w_{a} < 0$ , implying that $w (z)$ was phantom-like ( $w (z) < - 1$ ) in the past and evolves to $w (z) > - 1$ today. They quantify the deviation from ΛCDM as an improvement in fit corresponding to roughly $2.8$ to $4.2 σ$ depending on the dataset combination; they also state that DESI DR2 increases the statistical significance relative to DR1, with improvements in fit ranging from $- 21.0 \leq Δ χ^{2} \leq - 10.7$ (where $Δ χ^{2}$ is defined relative to the ΛCDM best fit). They further note that DESI+CMB alone already suggests a deviation at about $\sim 3 σ$ , independent of any SNe compilation.

To test robustness, the paper explores multiple alternative two-parameter $w (z)$ forms (BA, EXP, LOG, and JBP) and finds that, except for JBP, these parameterizations yield similar low-redshift phantom-crossing behavior and comparable fit quality. They report explicit $Δ χ_{MAP}^{2}$ values relative to ΛCDM for DESI+CMB+Union3: BA $Δ χ^{2} = - 17.3$ , EXP $= - 17.5$ , LOG $= - 17.6$ , JBP $= - 13.6$ , and CPL $= - 17.4$ . This indicates that the data do not strongly prefer one functional form over another; rather, they constrain a general trend.

A central theme is whether the apparent “phantom crossing” is genuine or an artifact of parameterization. The authors introduce crossing statistics using Chebyshev polynomial expansions of $w (z)$ and of the normalized dark-energy density $f_{DE} (z) = ρ_{DE} (z) / ρ_{DE, 0}$ . They emphasize that expanding $f_{DE} (z)$ allows the effective energy density to change sign, broadening the model space (and capturing behaviors that $w (z)$ -only expansions may miss). Using a low-order Chebyshev expansion (they show results for $N = 3$ ), they find reconstructed behaviors that agree well with the CPL $w_{0} w_{a}$ trends, including a smooth evolution that is consistent with a turnover and a crossing near $z \sim 0.5$ (with the exact crossing redshift depending on the chosen parameterization).

They also perform non-parametric reconstructions via two techniques. First, they use binning with smooth transitions controlled by a hyperbolic tangent smoothing scale (they set $s = 0.02$ for bin edges). They test multiple binning schemes and find the tightest constraints at low redshift, where the data prefer $w (z)$ values more than $3 σ$ away from $- 1$ in the lowest redshift bin. At higher redshift bins, deviations are within $2 σ$ of ΛCDM. For $f_{DE} (z)$ binning (using compressed CMB priors to avoid computational issues with $f_{DE} < 0$ ), they observe a turnover in the range $0.5 < z < 1.0$ with $f_{DE} (z) > 0$ at around $2 σ$ for most bins. They further apply PCA to decorrelate binned parameters, finding that the most informative principal components are localized at low redshift (the first component is localized in $0.1 < z \leq 0.3$ and has an uncertainty at least $20 \times$ smaller than later components).

Second, they use Gaussian process (GP) regression to reconstruct $w (z)$ and $f_{DE} (z)$ as smooth functions with minimal assumptions. They impose $w (z \geq z_{m a x}) = - 1$ with $z_{m a x} = 10$ and use a squared-exponential kernel centered on $w = - 1$ with hyperparameters controlling smoothness. The GP reconstructions align closely with the CPL $w_{0} w_{a}$ posterior predictions and indicate a phantom-like deviation at low redshift with hints of crossing near $z \approx 0.5$ . They also show that including CMB tightens constraints (especially on $Ω_{m}$ ), while DESI+SNe alone allows a broader range of $w (z)$ shapes.

To interpret the deviations physically, the authors examine three dark-energy dynamical classes: (i) thawing quintessence (minimally coupled scalar fields with $w \geq - 1$ ), (ii) emergent dark energy (dark energy negligible for most of cosmic history and emerging recently), and (iii) mirage dark energy (a phenomenological class aligned with a specific degeneracy direction in the $w_{0} w_{a}$ plane). They report that thawing and emergent classes are not strongly favored, while mirage performs remarkably well, capturing the data with one additional degree of freedom.

They quantify model preference using both $Δ χ_{MAP}^{2}$ and deviance information criterion (DIC). For DESI+CMB+Union3, the w0wa model yields $Δ χ_{MAP}^{2} = - 17.4$ and $Δ DIC = - 17.2$ relative to ΛCDM. The calibrated thawing class gives only mild improvements (e.g., $Δ χ^{2}$ around $- 2.5$ for PantheonPlus and $- 7.1$ for DESY5, with DIC values less favorable than w0wa), while algebraic thawing improves more ( $Δ χ^{2} = - 2.9$ for Union3 and $- 13.2$ for DESY5). The emergent class shows essentially no improvement for Union3 ( $Δ χ^{2} \approx - 0.1$ ) and small negative DIC changes. In contrast, the mirage class achieves strong improvements comparable to w0wa: for Union3, $Δ χ^{2} = - 16.2$ and $Δ DIC = - 18.7$ .

A key limitation acknowledged by the authors is that “phantom crossing” inferred from $w_{0} w_{a}$ or other flexible reconstructions may be spurious: the parameterization may mimic observables without reflecting a true underlying $w (z)$ crossing. They address this by comparing to a phantom-restricted thawing model (algebraic thawing) that enforces $w (z) \geq - 1$ . Although the thawing model can fit somewhat better than ΛCDM, it is substantially less favored than w0wa, suggesting that the data prefer a sharp evolution feature (a rapid increase and then decrease in dark-energy density) that non-crossing models struggle to reproduce without fine-tuning.

The paper also includes validation on mock datasets. In two mocks (one generated from ΛCDM and one from a w0waCDM model), both binning and GP reconstructions recover the true $w (z)$ within $1 σ$ in most cases. They also test an extreme thawing-like mock and show that their non-parametric implementation may fail to recover some extreme behaviors, indicating that priors and reconstruction choices can limit performance in unusual regions of model space.

Practically, the results imply that if the observed deviations are not due to unmodeled systematics, then dark energy likely evolves at low redshift ( $z ≲ 0.3$ ) and may exhibit an effective phantom-like behavior around $z \sim 0.5$ . This matters for theorists building dark-sector models and for survey analysts planning next-generation cross-checks. The authors emphasize that decisive tests will require complementary probes beyond BAO and background distances, including redshift-space distortions and peculiar velocities (growth information), improved low-redshift supernova measurements, and future CMB experiments to tighten early-universe constraints and break degeneracies.

Overall, the paper’s core contribution is a robustness study: across multiple parametric forms and two non-parametric reconstruction methods, the inferred dark-energy evolution is stable and consistent with a dynamical deviation from ΛCDM, with the strongest and most localized evidence at low redshift and an apparent crossing/turnover feature near $z \approx 0.5$ . While the phantom-crossing interpretation remains theoretically challenging and potentially model-dependent, the authors conclude that the evidence for dynamical dark energy is robust under modeling choices and that ΛCDM is disfavored at the several-sigma level with DESI DR2 BAO combined with CMB and SNe.

Cornell Notes

Using DESI DR2 BAO distances combined with Planck CMB and Type Ia supernovae, the authors test whether dark energy is consistent with ΛCDM or requires evolution. They find that multiple parametric and non-parametric reconstructions agree on a low-redshift deviation from $w = - 1$ , with an apparent phantom-like crossing/turnover near $z \sim 0.5$ , and that extending ΛCDM to a two-parameter $w (z)$ model captures the data trends.

What is the main research question of the paper?

Whether DESI DR2 BAO measurements (combined with CMB and SNe) indicate dynamical dark energy—i.e., a time-varying equation of state—rather than a cosmological constant.

What datasets are used, and what redshift range do they cover?

The primary dataset is DESI DR2 BAO from $z \approx 0.1$ to $z \approx 4.2$ (BGS, LRG, ELG, QSO, and Ly $α$ samples). They combine this with Planck CMB (full likelihood plus lensing) and with one of three SNe Ia compilations: PantheonPlus, Union3, or DESY5.

What is the baseline dark-energy model used for comparison?

The CPL $w_{0} w_{a}$ parameterization, $w (z) = w_{0} + w_{a} \frac{z}{1 + z}$ , with $w = - 1$ corresponding to ΛCDM.

How do the authors quantify the preference for dynamical dark energy over ΛCDM?

They use improvements in fit relative to ΛCDM, reported as $Δ χ_{MAP}^{2}$ and interpreted as $\sim 2.8$ to $4.2 σ$ depending on the dataset combination; they also use Bayesian evidence (in an appendix) and DIC for model comparison.

What is the reported statistical strength of the deviation from ΛCDM?

For DESI DR2 BAO combined with CMB and SNe, the deviation is reported as $2.8$ – $4.2 σ$ , with fit improvements of $- 21.0 \leq Δ χ^{2} \leq - 10.7$ relative to ΛCDM.

What do the reconstructions suggest about the equation of state $w (z)$ at low redshift?

They find $w (z) > - 1$ today but phantom-like behavior in the past, implying an effective crossing/turnover near $z \sim 0.5$ . Non-parametric methods show the strongest deviation at low redshift ( $z ≲ 0.3$ ).

How do alternative two-parameter $w (z)$ forms affect the conclusions?

Alternative parameterizations (BA, EXP, LOG, JBP) yield broadly similar low-redshift phantom-crossing behavior and comparable $Δ χ^{2}$ improvements; JBP fits slightly worse (e.g., $Δ χ^{2} = - 13.6$ vs $\sim - 17$ for others).

What non-parametric methods are used, and what do they find?

They use binning (including binning in $f_{DE} (z)$ ) and Gaussian process regression. Both reproduce the main trend and localize the crossing/turnover around $z \approx 0.5$ , with the lowest-redshift bin showing $> 3 σ$ deviation from $w = - 1$ in the binning analysis.

How do physically motivated dark-energy classes compare in model selection?

Thawing and emergent classes are not strongly favored, while mirage dark energy performs remarkably well, achieving $Δ χ^{2}$ and $Δ DIC$ improvements comparable to the $w_{0} w_{a}$ model.

What is the main limitation regarding the phantom-crossing interpretation?

The inferred crossing may be spurious due to the flexibility or bias of $w (z)$ parameterizations; the authors test this by comparing to a phantom-restricted thawing model (enforcing $w \geq - 1$ ), which fits less well than the $w_{0} w_{a}$ model.

Review Questions

Why does the paper use both parametric and non-parametric reconstructions, and what specific robustness check does each provide?
What observational feature near $z \sim 0.5$ is repeatedly found across methods, and how is it quantified (e.g., via $Δ χ^{2}$ , binning significance, or GP behavior)?
How do the authors test whether phantom crossing is genuine rather than an artifact of the $w_{0} w_{a}$ parameterization?
What does the PCA of binned $w (z)$ reveal about which redshift ranges carry most constraining power?
Compare the model-comparison outcomes ( $Δ χ^{2}$ , DIC) for thawing, emergent, and mirage classes—what does mirage’s success imply?

Key Points

1
DESI DR2 BAO combined with Planck CMB and SNe Ia yields a multi-sigma preference for dynamical dark energy over ΛCDM, with reported deviations of $\sim 2.8$ – $4.2 σ$ depending on the dataset combination.
2
Across multiple parametric $w (z)$ forms (CPL and alternatives), the data favor $w_{0} > - 1$ and $w_{a} < 0$ , corresponding to phantom-like behavior in the past and $w (z) > - 1$ today.
3
Non-parametric reconstructions (binning and Gaussian process regression) reproduce the same qualitative trend and localize a turnover/crossing feature near $z \approx 0.5$ , with the strongest deviation from $w = - 1$ at low redshift ( $z ≲ 0.3$ ).
4
Model comparison using DIC and $Δ χ^{2}$ shows that the mirage dark-energy class fits nearly as well as the $w_{0} w_{a}$ model, while thawing and emergent classes are less favored.
5
The paper emphasizes that phantom crossing inferred from flexible parameterizations could be spurious; a phantom-restricted thawing model (enforcing $w \geq - 1$ ) fits worse than $w_{0} w_{a}$ .
6
Validation on mock datasets supports the reconstruction methods for typical cases, but an extreme thawing-like mock demonstrates that some unusual behaviors may not be recovered well.

Highlights

“Even with the additional flexibility introduced by non-parametric approaches, such as binning and Gaussian Processes, we find that extending ΛCDM to include a two parameter w(z) is sufficient to capture the trends present in the data.”

“DESI DR2 BAO data show that the mean posterior distributions have shifted slightly toward the ΛCDM-expected values, while the reduced uncertainties have marginally increased the statistical significance of the deviations from ΛCDM to 2.8–4.2σ.”

“The current data indicate a clear preference for models that feature a phantom crossing; although alternatives lacking this feature are disfavored, they cannot yet be ruled out.”

“Gaussian process regression is better able to localize the redshift where the crossing should occur, around z∼0.5.”

“In contrast, the mirage class performs remarkably well, capturing DE phenomenology with just one additional degree of freedom.”

Topics

Cosmology
Dark energy phenomenology
Large-scale structure and BAO
Bayesian model comparison
Gaussian process regression in cosmology
Non-parametric reconstruction of $w(z)$
Phantom divide crossing and $w=-1$ stability
Supernova cosmology (SNe Ia)
CMB compression and late-time parameter inference

Mentioned

DESI (Dark Energy Spectroscopic Instrument)
Planck
ACT (Atacama Cosmology Telescope)
Cobaya
CAMB
CLASS (cited for context)
PPF framework (parametrized post-Friedmann)
iminuit
PolyChord
anesthetic
Zenodo
MCMC (Metropolis-Hastings)
scikit-hep/iminuit
K. Lodha
R. Calderón
William L. Matthewson
Arman Shafieloo
M. Ishak
H. Huterer
G. Valogiannis
L. A. Ureña-López
N. V. Kamble
D. Parkinson
A. G. Kim
G. B. Zhao
J. L. Cervantes-Cota
J. Rohlf
F. Lozano-Rodríguez
O. Lahav
D. J. Eisenstein
E. Gaztañaga
A. J. Ross
DESI Collaboration (collective authorship)
BAO - Baryon Acoustic Oscillations
DESI - Dark Energy Spectroscopic Instrument
DR1/DR2 - Data Release 1/Data Release 2
CMB - Cosmic Microwave Background
SNe Ia - Type Ia Supernovae
CPL - Chevallier-Polarski-Linder ($w_0w_a$ parameterization)
GP - Gaussian Process
DIC - Deviance Information Criterion
PPF - Parametrized Post-Friedmann framework
MCMC - Markov Chain Monte Carlo
MAP - Maximum a posteriori
PCA - Principal Component Analysis
DV/rd, DM/rd, DH/rd - BAO distance ratios to the sound horizon
NEC - Null Energy Condition
LRG/ELG/QSO/BGS - DESI tracer categories (Luminous Red Galaxies, Emission Line Galaxies, Quasars, Bright Galaxy Survey)