KiDS-Legacy: Cosmological constraints from cosmic shear with the complete Kilo-Degree Survey

Q: What is the primary dataset and survey footprint used for the cosmic shear analysis?

The complete KiDS survey (KiDS-dr5) covering 1347 deg$^2$ of deep nine-band imaging, plus 23 deg$^2$ of KiDS-like spectroscopic calibration observations (KiDZ fields). The analysis uses six tomographic bins and reaches photometric redshift $z_B\le2.0$.

Q: Which summary statistics are used for the fiducial cosmological inference, and why not $\xi_\pm$?

Fiducial inference uses COSEBIs $E_n$ and COSEBIs-derived band powers $C_{\rm E}$, measured over $\theta\in[2,300]$ arcmin. The authors do not use $\xi_\pm$ as primary because of scale-mixing, E/B-mode leakage issues with masks, and evidence that $\xi_\pm$ likelihood becomes non-Gaussian on large scales.

Q: What is the fiducial cosmological constraint on the amplitude of structure growth?

Using COSEBIs $E_n$, the paper reports $S_8=0.815^{+0.016}_{-0.021}$. In the optimized degeneracy parametrization $\Sigma_8=\sigma_8(\Omega_m/0.3)^\alpha$ with $\alpha=0.58$, it finds $\Sigma_8=0.821^{+0.014}_{-0.016}$.

Q: What modeling choices are central to controlling systematics?

A new physically motivated intrinsic-alignment model (NLA-$M$) depending on early-type fraction and host halo mass; marginalization over baryonic feedback via HMCode2020’s AGN feedback parameter; correlated $\delta z$ priors for redshift distribution biases; and inclusion of multiplicative shear calibration uncertainty.

Q: How robust are the constraints to key modeling variations (e.g., power spectrum, covariance, IA)?

Switching emulator to direct camb changes $\Sigma_8$ by at most $0.03\sigma$. Using only Gaussian covariance reduces uncertainty by $12\%$–$19\%$ with negligible shift ($\Delta\Sigma_8\approx0.02\sigma$). Changing IA models yields mild shifts (max $\sim0.17\sigma$ for $C_{\rm E}$, up to $\sim0.54\sigma$ for some $E_n$ comparisons) and can reduce constraining power by $\sim12\%$–$13\%$ for NLA-$z$.

Angus H. Wright, Benjamin Stölzner, Marika Asgari, Maciej Bilicki, Benjamin Giblin, Catherine Heymans, H. Hildebrandt, Henk Hoekstra, Benjamin Joachimi, Konrad Kuijken, +29 more

Astronomy and Astrophysics·2025·Physics and Astronomy·49 citations

8 min read

Read the full paper at DOI or on arxiv

TL;DR

KiDS-Legacy (complete KiDS-dr5) uses 1347 deg $^{2}$ of nine-band imaging and six tomographic bins to $z_{B} \leq 2.0$ , with improved redshift calibration and enhanced simulations.

Briefing Cornell Notes

Briefing

This paper asks how well the completed Kilo-Degree Survey (KiDS) can constrain late-time structure growth and cosmology using cosmic shear, and—crucially—whether the resulting constraints remain consistent with the high-redshift expectations from the Planck CMB. This matters because weak-lensing measurements of the clustering amplitude have, for about a decade, tended to yield lower values of the commonly used amplitude combination $S_{8} \equiv σ_{8} Ω_{m} /0.3$ than Planck, contributing to the so-called $S_{8}$ tension. Resolving whether this is a statistical fluctuation, a modeling/systematic issue (e.g., photometric redshift calibration, intrinsic alignments, baryonic feedback, or shear calibration), or a genuine cosmological discrepancy is central to the field’s transition from “stage-III” to “stage-IV” lensing surveys.

The authors present the KiDS-Legacy cosmic shear analysis using the final KiDS data release (KiDS-dr5). The completed KiDS footprint covers 1347 deg $^{2}$ of deep nine-band imaging (optical plus NIR) and includes an additional 23 deg $^{2}$ of KiDS-like calibration observations from deep spectroscopic surveys. The analysis extends the lensed galaxy sample to photometric redshift $z_{B} \leq 2.0$ by improving the redshift distribution estimation methodology and by using enhanced calibration data and multi-band image simulations.

Methodologically, the study follows a standard catalogs-to-cosmology pipeline implemented in the CosmoPipe framework. The core observable is the cosmic shear two-point signal, but the paper emphasizes that different summary statistics weight angular scales differently and can therefore respond differently to systematics. The authors measure two-point correlation functions $ξ_{\pm} (θ)$ but do not use them as the primary cosmological data vector due to concerns about scale-mixing and E/B-mode leakage in the presence of masks. Instead, the fiducial cosmological inference uses COSEBIs $E_{n}$ and band powers $C_{E}$ , both constructed from linear combinations of finely binned $ξ_{\pm}$ over $θ \in [2, 300]$ arcmin. They use six tomographic bins (with effective number density $n_{eff} = 8.79$ arcmin $^{- 2}$ after calibration and masking) and include multiplicative shear calibration uncertainty via marginalization. The analysis uses analytic covariance modeling with the OneCovariance code, including Gaussian sample-variance and shot-noise terms, non-Gaussian contributions, and super-sample covariance; covariance predictions are validated against GLASS mocks.

For theoretical modeling, the paper computes shear power spectra using Limber-approximated projections of the non-linear matter power spectrum. The non-linear matter power spectrum is modeled with HMCode2020 (calibrated against simulations) rather than Halofit, and the analysis marginalizes over uncertainty in baryonic feedback through the HMCode2020 AGN feedback parameter $lo g_{10} (T_{AGN} / K)$ . Intrinsic alignments (IA) are treated with a new physically motivated model: the fiducial IA model (NLA- $M$ ) makes the alignment amplitude depend on both the fraction of early-type (“red”) galaxies and their host halo mass distribution. The paper also tests alternative IA models (NLA baseline, NLA- $z$ with redshift evolution, and a restricted TATT scale-dependent extension NLA- $k$ ).

Key results are reported primarily in terms of $Σ_{8} \equiv σ_{8} (Ω_{m} /0.3)^{α}$ , where $α$ is optimized to match the degeneracy direction of the KiDS dataset. The fiducial COSEBIs-based constraint is $Σ_{8} = 0.82 1_{- 0.016}^{+ 0.014}$ (with $α = 0.58$ ). In the more traditional $S_{8}$ parametrization, the fiducial COSEBIs result is $S_{8} = 0.81 5_{- 0.021}^{+ 0.016}$ (also reported as $S_{8} = 0.81 5_{- 0.021}^{+ 0.016}$ for the marginal mode and HPDI summary). The paper states that this is in agreement with Planck Legacy at the $0.73 σ$ level in $Σ_{8}$ (and discusses Hellinger-distance-based consistency metrics). Compared to previous KiDS analyses, the authors claim an improvement in constraining power of about $\sim 32%$ in $Σ_{8}$ , driven by increased survey area, deeper redshift reach, improved redshift distribution estimation, and enhanced calibration/image reduction.

The paper also quantifies robustness to modeling choices. Replacing the emulator-based non-linear power spectrum predictions with direct camb calculations changes the recovered $Σ_{8}$ by at most $0.03 σ$ . Switching off non-Gaussian covariance terms reduces the uncertainty by $12%$ to $19%$ but does not significantly shift the central value ( $Δ Σ_{8} \approx 0.02 σ$ ), indicating that non-Gaussian scales contribute meaningfully to the constraining power. Varying intrinsic alignment models produces mild shifts in $S_{8}$ and $Σ_{8}$ (maximal differences up to $0.17 σ$ for $C_{E}$ , and up to $0.54 σ$ for $E_{n}$ in some comparisons), while changing IA model choice can reduce constraining power by $\sim 12%$ to $13%$ for NLA- $z$ relative to NLA- $M$ . Redshift calibration variations generally keep $Σ_{8}$ consistent (typically $Δ Σ_{8} \leq 0.17 σ$ ), while using MICE2-calibrated $N (z)$ (less realistic at high redshift) can shift $Σ_{8}$ upward by as much as $1.25 σ$ , which the authors treat as inferior to the fiducial SKiLLS-based calibration.

A notable systematic-control component is the null testing program. The authors perform PSF modeling residual tests, PSF leakage tests, and B-mode tests. They report that PSF modeling and PSF leakage contamination are negligible relative to statistical uncertainties (typically below a $10%$ benchmark). For B modes, an astrometric issue was identified during the blinded phase, leading to additional masking of $\sim 4%$ of sources (46.6 deg $^{2}$ removed from the initial footprint). After this, the B-mode p-values across all tomographic combinations pass their threshold $p > 0.01$ for the fiducial COSEBIs-based analysis.

Finally, the paper uses the agreement with Planck to infer constraints on baryonic feedback amplitude. With cosmological parameters fixed to Planck values, they place an upper limit on baryon feedback of $lo g_{10} (T_{AGN} / K) \leq 7.71$ at 68% confidence and $\leq 8.01$ at 95% confidence. In the fiducial analysis, the MAP estimate lies at the lower prior boundary, and the marginal posterior is skewed toward negligible feedback, implying little evidence for strong baryonic suppression in the scales to which KiDS is most sensitive.

Limitations are mostly implicit in the modeling and validation choices: the analysis relies on Limber approximation (argued to be adequate because IA contributes only $\sim 10%$ of the signal and because the analysis uses $ℓ ≳ 50$ ); it uses a specific non-linear power spectrum model (HMCode2020) and a particular baryonic feedback parametrization; and it uses physically motivated IA modeling with priors informed by external direct alignment measurements. The authors also acknowledge that $ξ_{\pm}$ are less reliable due to scale-mixing and non-Gaussian likelihood behavior on large scales, and therefore do not use them as the primary inference vector. Additionally, while covariance modeling is validated against mocks to within $\sim 5%$ to $10%$ depending on scale, analytic covariance remains an approximation.

Practically, the results provide a high-robustness stage-III weak-lensing benchmark: the KiDS-Legacy dataset is reported as the most internally robust KiDS sample to date (with a companion paper providing extensive internal/external consistency tests). For cosmologists and survey teams, the main takeaway is that improved redshift calibration and survey characterization can bring cosmic shear constraints into strong consistency with Planck in $S_{8}$ / $Σ_{8}$ , reducing the impetus to invoke new physics at this stage. For future stage-IV analyses (Euclid, Rubin/LSST, Roman), this work also supplies a template for how to manage redshift systematics, IA modeling, baryonic feedback marginalization, and end-to-end null testing—especially the importance of astrometric systematics and the use of physically motivated IA models.

Overall, the paper’s core contribution is a precise, carefully validated cosmic shear constraint from the complete KiDS survey, yielding $Σ_{8} \approx 0.821$ with $\sim 0.016$ uncertainty and demonstrating $≲ 1 σ$ agreement with Planck, while showing that the remaining modeling/systematic uncertainties are controlled at the level required for stage-III cosmology and for informing stage-IV expectations.

Cornell Notes

KiDS-Legacy uses the complete KiDS-dr5 cosmic shear dataset (1347 deg $^{2}$ , six tomographic bins to $z \leq 2$ ) with improved redshift calibration, enhanced simulations, and a physically motivated intrinsic-alignment model to infer late-time structure growth. The fiducial COSEBIs-based constraint is $Σ_{8} = 0.82 1_{- 0.016}^{+ 0.014}$ ( $α = 0.58$ ), consistent with Planck at the $0.73 σ$ level, and the authors show robustness to major modeling choices via extensive null tests and analysis variations.

What cosmological tension does the paper target, and why is it important?

It targets the long-standing tendency of cosmic shear measurements to prefer lower structure-growth amplitudes than Planck, often summarized by $S_{8} = σ_{8} Ω_{m} /0.3$ . Resolving whether this is due to systematics (e.g., redshift calibration, IA, baryons) or new physics is central for interpreting current and future lensing surveys.

What is the primary dataset and survey footprint used for the cosmic shear analysis?

The complete KiDS survey (KiDS-dr5) covering 1347 deg $^{2}$ of deep nine-band imaging, plus 23 deg $^{2}$ of KiDS-like spectroscopic calibration observations (KiDZ fields). The analysis uses six tomographic bins and reaches photometric redshift $z_{B} \leq 2.0$ .

What study design and analysis pipeline are used to go from images to cosmological constraints?

A blind catalogs-to-cosmology pipeline in CosmoPipe: shear catalogues (from lensfit), spectroscopic compilation for redshift calibration (KiDZ), simulated lightcones for calibration (SKiLLS and MICE2), construction of $N (z)$ with correlated $δ z$ priors, two-point statistics (COSEBIs and band powers), analytic covariance (OneCovariance), and Bayesian inference with the nautilus sampler inside Cosmosis.

Which summary statistics are used for the fiducial cosmological inference, and why not $ξ_{\pm}$ ?

Fiducial inference uses COSEBIs $E_{n}$ and COSEBIs-derived band powers $C_{E}$ , measured over $θ \in [2, 300]$ arcmin. The authors do not use $ξ_{\pm}$ as primary because of scale-mixing, E/B-mode leakage issues with masks, and evidence that $ξ_{\pm}$ likelihood becomes non-Gaussian on large scales.

What is the fiducial cosmological constraint on the amplitude of structure growth?

Using COSEBIs $E_{n}$ , the paper reports $S_{8} = 0.81 5_{- 0.021}^{+ 0.016}$ . In the optimized degeneracy parametrization $Σ_{8} = σ_{8} (Ω_{m} /0.3)^{α}$ with $α = 0.58$ , it finds $Σ_{8} = 0.82 1_{- 0.016}^{+ 0.014}$ .

How does the result compare to Planck, quantitatively?

The authors state that KiDS-Legacy is consistent with Planck Legacy at $0.73 σ$ in $Σ_{8}$ (and discuss Hellinger-distance consistency as well).

What modeling choices are central to controlling systematics?

A new physically motivated intrinsic-alignment model (NLA- $M$ ) depending on early-type fraction and host halo mass; marginalization over baryonic feedback via HMCode2020’s AGN feedback parameter; correlated $δ z$ priors for redshift distribution biases; and inclusion of multiplicative shear calibration uncertainty.

How robust are the constraints to key modeling variations (e.g., power spectrum, covariance, IA)?

Switching emulator to direct camb changes $Σ_{8}$ by at most $0.03 σ$ . Using only Gaussian covariance reduces uncertainty by $12%$ – $19%$ with negligible shift ( $Δ Σ_{8} \approx 0.02 σ$ ). Changing IA models yields mild shifts (max $\sim 0.17 σ$ for $C_{E}$ , up to $\sim 0.54 σ$ for some $E_{n}$ comparisons) and can reduce constraining power by $\sim 12%$ – $13%$ for NLA- $z$ .

What systematic null tests were performed, and what was the outcome for B modes?

They performed PSF modeling residual tests, PSF leakage tests, and B-mode tests using COSEBIs $B_{n}$ . After identifying an astrometric issue, they masked $\sim 4%$ of sources (46.6 deg $^{2}$ ) so that B-mode p-values pass the $p > 0.01$ threshold for the fiducial analysis.

Review Questions

What is the difference between $S_{8}$ and $Σ_{8}$ , and why does the paper optimize $α$ ?
Which modeling components most directly address the dominant lensing systematics (redshift calibration, IA, baryons), and how are their uncertainties propagated?
Why does the paper treat $ξ_{\pm}$ as less reliable than COSEBIs/band powers for the primary inference?
Summarize the main numerical fiducial results ( $S_{8}$ , $Σ_{8}$ , and the Planck consistency level) and identify the main drivers of the improvement over earlier KiDS analyses.
What did the B-mode null test reveal, and how did the resulting masking decision affect the final dataset used for cosmology?

Key Points

1
KiDS-Legacy (complete KiDS-dr5) uses 1347 deg $^{2}$ of nine-band imaging and six tomographic bins to $z_{B} \leq 2.0$ , with improved redshift calibration and enhanced simulations.
2
The fiducial COSEBIs-based constraint is $S_{8} = 0.81 5_{- 0.021}^{+ 0.016}$ ; in the optimized degeneracy parametrization $Σ_{8}$ it is $Σ_{8} = 0.82 1_{- 0.016}^{+ 0.014}$ with $α = 0.58$ .
3
KiDS-Legacy finds agreement with Planck Legacy at the $0.73 σ$ level in $Σ_{8}$ , reducing the $S_{8}$ tension relative to earlier KiDS releases.
4
A new physically motivated intrinsic-alignment model (NLA- $M$ )—depending on early-type fraction and host halo mass—is a key systematic-control improvement; alternative IA models change results only mildly.
5
The analysis marginalizes over baryonic feedback using HMCode2020’s AGN feedback parameter and finds little evidence for strong feedback when cosmology is fixed to Planck (upper limit $lo g_{10} (T_{AGN} / K) \leq 7.71$ at 68%).
6
Extensive null tests show negligible PSF modeling/leakage contamination; an astrometric issue was identified via B-mode tests, leading to masking $\sim 4%$ of sources to satisfy $p > 0.01$ .
7
Robustness checks (camb vs emulator, Gaussian vs full covariance, redshift calibration variations, IA variations) show that the recovered $Σ_{8}$ is stable within $O (0.1 σ)$ for most plausible changes.

Highlights

“S8​≡σ8​Ωm​/0.3​=0.815−0.021+0.016​ is found to be in agreement (0.73σ) with results from the Planck Legacy cosmic microwave background experiment.”

“Compared to previous KiDS analyses… the increased survey area and redshift depth results in a ∼32% improvement in constraining power… where Σ8​≡σ8​(Ωm​/0.3)α=0.821−0.016+0.014​, with α=0.58.”

“We adopted a new physically motivated intrinsic alignment (IA) model that jointly depends on the galaxy sample’s halo mass and spectral type distributions… informed by previous direct alignment measurements.”

“Our fiducial $E_n$ measurement constrains S8​ … 0.815−0.021+0.016​… [and] Σ8​=0.821−0.016+0.014​.”

“With cosmological parameters fixed to those reported by Planck… we were able to place an upper limit constraint of log10​(TAGN​/K)≤7.71(8.01) at one-sided 68%(95%) confidence.”

Topics

Cosmology
Weak gravitational lensing
Cosmic shear
Large-scale structure
Photometric redshift calibration
Intrinsic alignments
Baryonic feedback modeling
Statistical inference and likelihoods
Survey systematics and null tests
Cosmological parameter estimation

Mentioned

KiDS (Kilo-Degree Survey)
VIKING (VISTA Kilo-Degree Infrared Galaxy Survey)
CosmoPipe
Cosmosis
nautilus sampler
Treecorr
lensfit
OneCovariance
HMCode2020
camb
CosmoPower
SKiLLS (simulations)
MICE2 (simulations)
GLASS (mocks)
SALMO (variable-depth mocks)
SOM (self-organising maps)
MCMC/likelihood framework (Cosmosis-based)
EuclidEmulatorv2
BACCO
FLAMINGO
BAHAMAS
GLASS
Angus H. Wright
Benjamin Stölzner
Marika Asgari
Maciej Bilicki
Benjamin Giblin
Catherine Heymans
Hendrik Hildebrandt
Henk Hoekstra
Benjamin Joachimi
Konrad Kuijken
Shun-Sheng Li
Robert Reischke
Maximilian von Wietersheim-Kramsta
Jelte de Jong
Edwin Valentijn
Nick Kaiser (dedication)
KiDS - Kilo-Degree Survey
VIKING - VISTA Kilo-Degree Infrared Galaxy Survey
NIR - Near-infrared
N(z) - Redshift distribution
CC - Cross-correlation
KiDZ - Spectroscopic calibration fields used for KiDS photo-z calibration
COSEBIs - Complete Orthogonal Sets of E/B-integrals
E/B modes - Even/odd parity decomposition of shear field
2PCF - Two-point correlation function
GGL - Galaxy-galaxy lensing
IA - Intrinsic alignments
NLA - Non-linear alignment model
TATT - Tidal alignment and tidal torquing model
NLA-z - NLA with redshift dependence
NLA-k - NLA with restricted scale dependence
NLA-M - NLA with mass and galaxy-type dependence
HMCode2020 - Halo-model-based non-linear matter power spectrum code
AGN - Active galactic nucleus
SSC - Super-sample covariance
NG - Non-Gaussian covariance term
PTE - Probability to exceed
HPDI - Highest posterior density interval
MAP - Maximum a posteriori
PJ-HPD - Projected joint highest posterior density interval
S8 - $\sigma_8\sqrt{\Omega_m/0.3}$ amplitude combination
Sigma8 - $\sigma_8(\Omega_m/0.3)^\alpha$ optimized degeneracy combination
camb - Code for Anisotropies in the Microwave Background
CosmoPower - Neural-network emulator for power spectra
GLASS - Generator for Large-Scale Structure (mock generator)
SALMO - Speedy Acquisition for Lensing and Matter Observables
SOM - Self-organising map
SKiLLS - Multi-colour image simulations for KiDS
MICE2 - Simulation suite for lensing calibration
E_n/B_n - COSEBIs mode amplitudes
E_n - COSEBIs E-mode statistic
C_E - E-mode band powers
$\xi_\pm$ - Shear two-point correlation functions
$\ell$ - Angular multipole
$k$ - Wavenumber
$\delta z$ - Redshift bias parameters
$m$ - Multiplicative shear calibration bias