Platonic Transformers: A Solid Choice For Equivariance

2510.03511v1 cs.CV, cs.AI, cs.LG, eess.IV 2025-10-08

Авторы:

Mohammad Mohaiminul Islam, Rishabh Anand, David R. Wessels, Friso de Kruiff, Thijs P. Kuipers, Rex Ying, Clara I. Sánchez, Sharvaree Vadgama, Georg Bökman, Erik J. Bekkers

Abstract

While widespread, Transformers lack inductive biases for geometric symmetries common in science and computer vision. Existing equivariant methods often sacrifice the efficiency and flexibility that make Transformers so effective through complex, computationally intensive designs. We introduce the Platonic Transformer to resolve this trade-off. By defining attention relative to reference frames from the Platonic solid symmetry groups, our method induces a principled weight-sharing scheme. This enables combined equivariance to continuous translations and Platonic symmetries, while preserving the exact architecture and computational cost of a standard Transformer. Furthermore, we show that this attention is formally equivalent to a dynamic group convolution, which reveals that the model learns adaptive geometric filters and enables a highly scalable, linear-time convolutional variant. Across diverse benchmarks in computer vision (CIFAR-10), 3D point clouds (ScanObjectNN), and molecular property prediction (QM9, OMol25), the Platonic Transformer achieves competitive performance by leveraging these geometric constraints at no additional cost.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Platonic Transformers: A Solid Choice For Equivariance

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Efficient Edge-Compatible CNN for Speckle-Based Material Recognition in Laser Cu...

MODEST: Multi-Optics Depth-of-Field Stereo Dataset

Cross-Domain Generalization of Multimodal LLMs for Global Photovoltaic Assessmen...

TriggerNet: A Novel Explainable AI Framework for Red Palm Mite Detection and Mul...

SPEGNet: Synergistic Perception-Guided Network for Camouflaged Object Detection

Навигация