Get AI summaries of any video or article — Sign up free

KiDS-Legacy: Cosmological constraints from cosmic shear with the complete Kilo-Degree Survey

Angus H. Wright, Benjamin Stölzner, Marika Asgari, Maciej Bilicki, Benjamin Giblin, Catherine Heymans, H. Hildebrandt, Henk Hoekstra, Benjamin Joachimi, Konrad Kuijken, +29 more
8 min read

Read the full paper at DOI or on arxiv

TL;DR

KiDS-Legacy (complete KiDS-dr5) uses 1347 deg of nine-band imaging and six tomographic bins to , with improved redshift calibration and enhanced simulations.

Briefing

This paper asks how well the completed Kilo-Degree Survey (KiDS) can constrain late-time structure growth and cosmology using cosmic shear, and—crucially—whether the resulting constraints remain consistent with the high-redshift expectations from the Planck CMB. This matters because weak-lensing measurements of the clustering amplitude have, for about a decade, tended to yield lower values of the commonly used amplitude combination than Planck, contributing to the so-called tension. Resolving whether this is a statistical fluctuation, a modeling/systematic issue (e.g., photometric redshift calibration, intrinsic alignments, baryonic feedback, or shear calibration), or a genuine cosmological discrepancy is central to the field’s transition from “stage-III” to “stage-IV” lensing surveys.

The authors present the KiDS-Legacy cosmic shear analysis using the final KiDS data release (KiDS-dr5). The completed KiDS footprint covers 1347 deg of deep nine-band imaging (optical plus NIR) and includes an additional 23 deg of KiDS-like calibration observations from deep spectroscopic surveys. The analysis extends the lensed galaxy sample to photometric redshift by improving the redshift distribution estimation methodology and by using enhanced calibration data and multi-band image simulations.

Methodologically, the study follows a standard catalogs-to-cosmology pipeline implemented in the CosmoPipe framework. The core observable is the cosmic shear two-point signal, but the paper emphasizes that different summary statistics weight angular scales differently and can therefore respond differently to systematics. The authors measure two-point correlation functions but do not use them as the primary cosmological data vector due to concerns about scale-mixing and E/B-mode leakage in the presence of masks. Instead, the fiducial cosmological inference uses COSEBIs and band powers , both constructed from linear combinations of finely binned over arcmin. They use six tomographic bins (with effective number density arcmin after calibration and masking) and include multiplicative shear calibration uncertainty via marginalization. The analysis uses analytic covariance modeling with the OneCovariance code, including Gaussian sample-variance and shot-noise terms, non-Gaussian contributions, and super-sample covariance; covariance predictions are validated against GLASS mocks.

For theoretical modeling, the paper computes shear power spectra using Limber-approximated projections of the non-linear matter power spectrum. The non-linear matter power spectrum is modeled with HMCode2020 (calibrated against simulations) rather than Halofit, and the analysis marginalizes over uncertainty in baryonic feedback through the HMCode2020 AGN feedback parameter . Intrinsic alignments (IA) are treated with a new physically motivated model: the fiducial IA model (NLA-) makes the alignment amplitude depend on both the fraction of early-type (“red”) galaxies and their host halo mass distribution. The paper also tests alternative IA models (NLA baseline, NLA- with redshift evolution, and a restricted TATT scale-dependent extension NLA-).

Key results are reported primarily in terms of , where is optimized to match the degeneracy direction of the KiDS dataset. The fiducial COSEBIs-based constraint is (with ). In the more traditional parametrization, the fiducial COSEBIs result is (also reported as for the marginal mode and HPDI summary). The paper states that this is in agreement with Planck Legacy at the level in (and discusses Hellinger-distance-based consistency metrics). Compared to previous KiDS analyses, the authors claim an improvement in constraining power of about in , driven by increased survey area, deeper redshift reach, improved redshift distribution estimation, and enhanced calibration/image reduction.

The paper also quantifies robustness to modeling choices. Replacing the emulator-based non-linear power spectrum predictions with direct camb calculations changes the recovered by at most . Switching off non-Gaussian covariance terms reduces the uncertainty by to but does not significantly shift the central value (), indicating that non-Gaussian scales contribute meaningfully to the constraining power. Varying intrinsic alignment models produces mild shifts in and (maximal differences up to for , and up to for in some comparisons), while changing IA model choice can reduce constraining power by to for NLA- relative to NLA-. Redshift calibration variations generally keep consistent (typically ), while using MICE2-calibrated (less realistic at high redshift) can shift upward by as much as , which the authors treat as inferior to the fiducial SKiLLS-based calibration.

A notable systematic-control component is the null testing program. The authors perform PSF modeling residual tests, PSF leakage tests, and B-mode tests. They report that PSF modeling and PSF leakage contamination are negligible relative to statistical uncertainties (typically below a benchmark). For B modes, an astrometric issue was identified during the blinded phase, leading to additional masking of of sources (46.6 deg removed from the initial footprint). After this, the B-mode p-values across all tomographic combinations pass their threshold for the fiducial COSEBIs-based analysis.

Finally, the paper uses the agreement with Planck to infer constraints on baryonic feedback amplitude. With cosmological parameters fixed to Planck values, they place an upper limit on baryon feedback of at 68% confidence and at 95% confidence. In the fiducial analysis, the MAP estimate lies at the lower prior boundary, and the marginal posterior is skewed toward negligible feedback, implying little evidence for strong baryonic suppression in the scales to which KiDS is most sensitive.

Limitations are mostly implicit in the modeling and validation choices: the analysis relies on Limber approximation (argued to be adequate because IA contributes only of the signal and because the analysis uses ); it uses a specific non-linear power spectrum model (HMCode2020) and a particular baryonic feedback parametrization; and it uses physically motivated IA modeling with priors informed by external direct alignment measurements. The authors also acknowledge that are less reliable due to scale-mixing and non-Gaussian likelihood behavior on large scales, and therefore do not use them as the primary inference vector. Additionally, while covariance modeling is validated against mocks to within to depending on scale, analytic covariance remains an approximation.

Practically, the results provide a high-robustness stage-III weak-lensing benchmark: the KiDS-Legacy dataset is reported as the most internally robust KiDS sample to date (with a companion paper providing extensive internal/external consistency tests). For cosmologists and survey teams, the main takeaway is that improved redshift calibration and survey characterization can bring cosmic shear constraints into strong consistency with Planck in /, reducing the impetus to invoke new physics at this stage. For future stage-IV analyses (Euclid, Rubin/LSST, Roman), this work also supplies a template for how to manage redshift systematics, IA modeling, baryonic feedback marginalization, and end-to-end null testing—especially the importance of astrometric systematics and the use of physically motivated IA models.

Overall, the paper’s core contribution is a precise, carefully validated cosmic shear constraint from the complete KiDS survey, yielding with uncertainty and demonstrating agreement with Planck, while showing that the remaining modeling/systematic uncertainties are controlled at the level required for stage-III cosmology and for informing stage-IV expectations.

Cornell Notes

KiDS-Legacy uses the complete KiDS-dr5 cosmic shear dataset (1347 deg, six tomographic bins to ) with improved redshift calibration, enhanced simulations, and a physically motivated intrinsic-alignment model to infer late-time structure growth. The fiducial COSEBIs-based constraint is (), consistent with Planck at the level, and the authors show robustness to major modeling choices via extensive null tests and analysis variations.

What cosmological tension does the paper target, and why is it important?

It targets the long-standing tendency of cosmic shear measurements to prefer lower structure-growth amplitudes than Planck, often summarized by . Resolving whether this is due to systematics (e.g., redshift calibration, IA, baryons) or new physics is central for interpreting current and future lensing surveys.

What is the primary dataset and survey footprint used for the cosmic shear analysis?

The complete KiDS survey (KiDS-dr5) covering 1347 deg of deep nine-band imaging, plus 23 deg of KiDS-like spectroscopic calibration observations (KiDZ fields). The analysis uses six tomographic bins and reaches photometric redshift .

What study design and analysis pipeline are used to go from images to cosmological constraints?

A blind catalogs-to-cosmology pipeline in CosmoPipe: shear catalogues (from lensfit), spectroscopic compilation for redshift calibration (KiDZ), simulated lightcones for calibration (SKiLLS and MICE2), construction of with correlated priors, two-point statistics (COSEBIs and band powers), analytic covariance (OneCovariance), and Bayesian inference with the nautilus sampler inside Cosmosis.

Which summary statistics are used for the fiducial cosmological inference, and why not ?

Fiducial inference uses COSEBIs and COSEBIs-derived band powers , measured over arcmin. The authors do not use as primary because of scale-mixing, E/B-mode leakage issues with masks, and evidence that likelihood becomes non-Gaussian on large scales.

What is the fiducial cosmological constraint on the amplitude of structure growth?

Using COSEBIs , the paper reports . In the optimized degeneracy parametrization with , it finds .

How does the result compare to Planck, quantitatively?

The authors state that KiDS-Legacy is consistent with Planck Legacy at in (and discuss Hellinger-distance consistency as well).

What modeling choices are central to controlling systematics?

A new physically motivated intrinsic-alignment model (NLA-) depending on early-type fraction and host halo mass; marginalization over baryonic feedback via HMCode2020’s AGN feedback parameter; correlated priors for redshift distribution biases; and inclusion of multiplicative shear calibration uncertainty.

How robust are the constraints to key modeling variations (e.g., power spectrum, covariance, IA)?

Switching emulator to direct camb changes by at most . Using only Gaussian covariance reduces uncertainty by with negligible shift (). Changing IA models yields mild shifts (max for , up to for some comparisons) and can reduce constraining power by for NLA-.

What systematic null tests were performed, and what was the outcome for B modes?

They performed PSF modeling residual tests, PSF leakage tests, and B-mode tests using COSEBIs . After identifying an astrometric issue, they masked of sources (46.6 deg) so that B-mode p-values pass the threshold for the fiducial analysis.

Review Questions

  1. What is the difference between and , and why does the paper optimize ?

  2. Which modeling components most directly address the dominant lensing systematics (redshift calibration, IA, baryons), and how are their uncertainties propagated?

  3. Why does the paper treat as less reliable than COSEBIs/band powers for the primary inference?

  4. Summarize the main numerical fiducial results (, , and the Planck consistency level) and identify the main drivers of the improvement over earlier KiDS analyses.

  5. What did the B-mode null test reveal, and how did the resulting masking decision affect the final dataset used for cosmology?

Key Points

  1. 1

    KiDS-Legacy (complete KiDS-dr5) uses 1347 deg of nine-band imaging and six tomographic bins to , with improved redshift calibration and enhanced simulations.

  2. 2

    The fiducial COSEBIs-based constraint is ; in the optimized degeneracy parametrization it is with .

  3. 3

    KiDS-Legacy finds agreement with Planck Legacy at the level in , reducing the tension relative to earlier KiDS releases.

  4. 4

    A new physically motivated intrinsic-alignment model (NLA-)—depending on early-type fraction and host halo mass—is a key systematic-control improvement; alternative IA models change results only mildly.

  5. 5

    The analysis marginalizes over baryonic feedback using HMCode2020’s AGN feedback parameter and finds little evidence for strong feedback when cosmology is fixed to Planck (upper limit at 68%).

  6. 6

    Extensive null tests show negligible PSF modeling/leakage contamination; an astrometric issue was identified via B-mode tests, leading to masking of sources to satisfy .

  7. 7

    Robustness checks (camb vs emulator, Gaussian vs full covariance, redshift calibration variations, IA variations) show that the recovered is stable within for most plausible changes.

Highlights

is found to be in agreement () with results from the Planck Legacy cosmic microwave background experiment.”
“Compared to previous KiDS analyses… the increased survey area and redshift depth results in a improvement in constraining power… where , with .”
“We adopted a new physically motivated intrinsic alignment (IA) model that jointly depends on the galaxy sample’s halo mass and spectral type distributions… informed by previous direct alignment measurements.”
“Our fiducial $E_n$ measurement constrains … [and] .”
“With cosmological parameters fixed to those reported by Planck… we were able to place an upper limit constraint of at one-sided confidence.”

Topics

  • Cosmology
  • Weak gravitational lensing
  • Cosmic shear
  • Large-scale structure
  • Photometric redshift calibration
  • Intrinsic alignments
  • Baryonic feedback modeling
  • Statistical inference and likelihoods
  • Survey systematics and null tests
  • Cosmological parameter estimation

Mentioned

  • KiDS (Kilo-Degree Survey)
  • VIKING (VISTA Kilo-Degree Infrared Galaxy Survey)
  • CosmoPipe
  • Cosmosis
  • nautilus sampler
  • Treecorr
  • lensfit
  • OneCovariance
  • HMCode2020
  • camb
  • CosmoPower
  • SKiLLS (simulations)
  • MICE2 (simulations)
  • GLASS (mocks)
  • SALMO (variable-depth mocks)
  • SOM (self-organising maps)
  • MCMC/likelihood framework (Cosmosis-based)
  • EuclidEmulatorv2
  • BACCO
  • FLAMINGO
  • BAHAMAS
  • GLASS
  • Angus H. Wright
  • Benjamin Stölzner
  • Marika Asgari
  • Maciej Bilicki
  • Benjamin Giblin
  • Catherine Heymans
  • Hendrik Hildebrandt
  • Henk Hoekstra
  • Benjamin Joachimi
  • Konrad Kuijken
  • Shun-Sheng Li
  • Robert Reischke
  • Maximilian von Wietersheim-Kramsta
  • Jelte de Jong
  • Edwin Valentijn
  • Nick Kaiser (dedication)
  • KiDS - Kilo-Degree Survey
  • VIKING - VISTA Kilo-Degree Infrared Galaxy Survey
  • NIR - Near-infrared
  • N(z) - Redshift distribution
  • CC - Cross-correlation
  • KiDZ - Spectroscopic calibration fields used for KiDS photo-z calibration
  • COSEBIs - Complete Orthogonal Sets of E/B-integrals
  • E/B modes - Even/odd parity decomposition of shear field
  • 2PCF - Two-point correlation function
  • GGL - Galaxy-galaxy lensing
  • IA - Intrinsic alignments
  • NLA - Non-linear alignment model
  • TATT - Tidal alignment and tidal torquing model
  • NLA-z - NLA with redshift dependence
  • NLA-k - NLA with restricted scale dependence
  • NLA-M - NLA with mass and galaxy-type dependence
  • HMCode2020 - Halo-model-based non-linear matter power spectrum code
  • AGN - Active galactic nucleus
  • SSC - Super-sample covariance
  • NG - Non-Gaussian covariance term
  • PTE - Probability to exceed
  • HPDI - Highest posterior density interval
  • MAP - Maximum a posteriori
  • PJ-HPD - Projected joint highest posterior density interval
  • S8 - \(\sigma_8\sqrt{\Omega_m/0.3}\) amplitude combination
  • Sigma8 - \(\sigma_8(\Omega_m/0.3)^\alpha\) optimized degeneracy combination
  • camb - Code for Anisotropies in the Microwave Background
  • CosmoPower - Neural-network emulator for power spectra
  • GLASS - Generator for Large-Scale Structure (mock generator)
  • SALMO - Speedy Acquisition for Lensing and Matter Observables
  • SOM - Self-organising map
  • SKiLLS - Multi-colour image simulations for KiDS
  • MICE2 - Simulation suite for lensing calibration
  • E_n/B_n - COSEBIs mode amplitudes
  • E_n - COSEBIs E-mode statistic
  • C_E - E-mode band powers
  • \(\xi_\pm\) - Shear two-point correlation functions
  • \(\ell\) - Angular multipole
  • \(k\) - Wavenumber
  • \(\delta z\) - Redshift bias parameters
  • \(m\) - Multiplicative shear calibration bias