Graph Neural Networks for Binding Affinity Prediction

TL;DR

Binding affinity is commonly quantified by Ki, where smaller Ki indicates stronger ligand–target binding.

Briefing Cornell Notes

Briefing

Binding affinity—the strength of a ligand’s interaction with a target biomolecule—is a central metric in early drug discovery because it helps rank candidate “hits” and guide designs that bind tightly to the intended target while avoiding off-target binding that can trigger side effects. Experimentally measuring binding affinity (often via the equilibrium inhibition constant, Ki) is accurate but costly in time, money, and labor. That bottleneck drives virtual screening: computational methods that sift through large libraries to identify molecules most likely to bind before expensive lab testing.

Virtual screening splits into two broad camps. Ligand-based approaches start from known active compounds and build models (such as pharmacophore features) that encode where key chemical interactions occur—hydrophobic regions, hydrogen bond acceptors/donors, and molecular shape—often using static constraints like exclusion volumes. Structure-based approaches instead assume a 3D receptor model is available and search over ligand structures to maximize predicted binding. However, conventional structure-based methods can struggle to reliably separate active from inactive ligands; an example involving thrombin ligands reportedly failed to distinguish high-affinity binders (low Ki) from poor binders, suggesting that more than standard docking-style signals may be needed.

Graph neural networks (GNNs) are presented as a newer, resource-efficient form of virtual screening that can improve accuracy and prediction speed, but only after careful parameterization of both ligand and receptor. Ligands are converted into molecular graphs where atoms become nodes with feature vectors (e.g., neighbor counts, hydrogens, formal charge) and bonds become edges with categorical or numeric descriptors (aromatic, conjugated, ring membership, and which atoms they connect). Receptors—proteins or polynucleotides—are handled by building graph representations from structural information. One common route uses adjacency matrices derived from 3D coordinates, where edges can be undirected (mirrored distances), directed (non-mirrored), or weighted (encoding bond/contact strength or distance). Other approaches avoid full coordinate dependence by predicting secondary structure first, then extracting contact maps and converting them into protein graphs with featurized amino-acid attributes like charge and functional groups.

Once ligand and receptor are both graphs, the GNN processes node and edge features through shared hidden layers, repeatedly aggregating information from neighboring nodes and edges up to a chosen depth. This recursive message passing builds graph-level embeddings in a fixed-size vector space, enabling a downstream dense layer to predict binding affinity for a ligand–receptor pair. Architecturally, the discussion contrasts recurrent-style GNNs (same weights reused until convergence) with convolutional-style GNNs (different weights per iteration).

The practical payoff is framed in early discovery terms: target interaction prediction is typically slow and expensive, yet computational screening can cut overall timelines and costs substantially. GNN-based affinity prediction is described as fast—milliseconds per ligand–receptor pair—and as extending docking capabilities by accepting receptor structures without coordinates, broadening what can be modeled when structural data is incomplete.

Cornell Notes

Binding affinity (often measured by Ki) determines how strongly a ligand binds to a target and therefore drives hit ranking and selective drug design. Because experimental assays are expensive, virtual screening uses computational methods to narrow candidate molecules before lab testing. Traditional ligand-based and structure-based virtual screening can fail to reliably separate active from inactive compounds, motivating graph neural networks. GNNs represent ligands and receptors as graphs: atoms and bonds become node/edge features for ligands, while proteins can be encoded via adjacency matrices from coordinates or via contact maps derived from predicted secondary structure. Message passing over these graphs produces embeddings that a neural network maps to binding affinity quickly, enabling high-throughput prediction.

Why does Ki matter so much in binding affinity prediction?

Ki (equilibrium inhibition constant) is used to rank binding strength: smaller Ki corresponds to stronger binding between a ligand (e.g., estradiol or tamoxifen) and its target receptor (e.g., estrogen receptor). That ranking directly affects which compounds move forward in early drug discovery, where the goal is high affinity to the intended target and low affinity to other targets to reduce off-target side effects.

What distinguishes ligand-based from structure-based virtual screening?

Ligand-based virtual screening assumes information about known active ligands and builds models such as pharmacophores that encode interaction types and locations (hydrophobic regions, hydrogen bond acceptors/donors, and molecular shape), often with static constraints like exclusion volumes. Structure-based virtual screening assumes a 3D receptor model and searches over ligand structures to maximize predicted binding/affinity.

Why might conventional virtual screening struggle to separate active and inactive ligands?

An example described for thrombin ligands reports poor separation: ligands with high Ki (weak binders) and low Ki (strong binders) did not separate reliably using conventional virtual screening methods. The takeaway is that standard signals may not capture the distinctions needed for accurate active/inactive discrimination for a given receptor.

How are ligands parameterized for a graph neural network?

Ligands are converted into undirected (or sometimes directed) molecular graphs. Nodes represent atoms with feature vectors such as neighbor counts, number of hydrogens, and formal charge. Edges represent bonds with descriptors like aromaticity, conjugation, ring membership, and which atoms the bond connects. The result is a graph that encodes both local chemistry and connectivity.

What are common ways to parameterize receptors as graphs?

A frequent method builds adjacency matrices from 3D coordinates, where edges indicate inter-atomic or inter-amino-acid relationships. These can be undirected (mirrored connections), directed (non-mirrored), or weighted (edge values encode distance/strength/type). Alternative approaches predict secondary structure first, derive a contact map, then extract a protein graph and featurize amino acids (e.g., charge and functional groups), enabling modeling even when full coordinates aren’t available.

How does a GNN turn ligand–receptor graphs into an affinity prediction?

The GNN performs message passing: node and edge features are aggregated from neighbors recursively up to a chosen depth, producing embeddings in a fixed-size vector space. Ligand and receptor embeddings are then combined (e.g., concatenated) and passed through a dense layer to predict binding affinity for each ligand–receptor pair. The discussion contrasts recurrent GNNs (shared weights until convergence) with convolutional GNNs (different weights per iteration).

Review Questions

How does the choice of receptor graph construction method (coordinate-based adjacency vs contact-map-derived graph) change what structural information the model requires?
Explain how node and edge feature design for ligands (atoms vs bonds) influences the information a GNN can learn for binding affinity.
What architectural difference between recurrent and convolutional GNNs affects how weights are applied during message passing?

Key Points

1
Binding affinity is commonly quantified by Ki, where smaller Ki indicates stronger ligand–target binding.
2
Virtual screening reduces experimental workload by computationally selecting molecules likely to bind before lab assays.
3
Ligand-based virtual screening uses known actives to build pharmacophore-like interaction and shape models, while structure-based methods rely on 3D receptor geometry.
4
Conventional virtual screening can fail to reliably separate active from inactive ligands for certain targets, motivating newer approaches.
5
Graph neural networks require careful ligand and receptor parameterization into graphs with meaningful node and edge features.
6
Ligands are represented as molecular graphs with atom features (e.g., formal charge, hydrogens) and bond features (e.g., aromatic/conjugated/ring membership).
7
Receptors can be encoded via adjacency matrices from coordinates or via predicted secondary structure and contact maps, enabling predictions even without coordinates.

Highlights

Ki provides a direct ranking signal for binding strength: lower Ki corresponds to tighter binding.

A reported thrombin case found conventional virtual screening failed to cleanly distinguish high-affinity from low-affinity ligands, pointing to limits of standard methods.

GNNs convert both ligands and receptors into graphs and use recursive neighbor aggregation to build embeddings for rapid affinity prediction.

Contact-map-derived protein graphs allow GNN affinity prediction without requiring full 3D coordinates.

Topics

Binding Affinity
Virtual Screening
Graph Neural Networks
Ligand Parametrization
Receptor Graphs