GraphCliff: Short-Long Range Gating for Subtle Differences but Critical Changes
2511.03170v1
cs.CE, cs.AI
2025-11-07
Авторы:
Hajung Kim, Jueon Park, Junseok Choe, Sheunheun Baek, Hyeon Hwang, Jaewoo Kang
Abstract
Quantitative structure-activity relationship assumes a smooth relationship
between molecular structure and biological activity. However, activity cliffs
defined as pairs of structurally similar compounds with large potency
differences break this continuity. Recent benchmarks targeting activity cliffs
have revealed that classical machine learning models with extended connectivity
fingerprints outperform graph neural networks. Our analysis shows that graph
embeddings fail to adequately separate structurally similar molecules in the
embedding space, making it difficult to distinguish between structurally
similar but functionally different molecules. Despite this limitation,
molecular graph structures are inherently expressive and attractive, as they
preserve molecular topology. To preserve the structural representation of
molecules as graphs, we propose a new model, GraphCliff, which integrates
short- and long-range information through a gating mechanism. Experimental
results demonstrate that GraphCliff consistently improves performance on both
non-cliff and cliff compounds. Furthermore, layer-wise node embedding analyses
reveal reduced over-smoothing and enhanced discriminative power relative to
strong baseline graph models.
Ссылки и действия
Дополнительные ресурсы: