MSPT: Efficient Large-Scale Physical Modeling via Parallelized Multi-Scale Attention

2512.01738v1 cs.LG 2025-12-04

Авторы:

Pedro M. P. Curvo, Jan-Willem van de Meent, Maksim Zhdanov

Abstract

A key scalability challenge in neural solvers for industrial-scale physics simulations is efficiently capturing both fine-grained local interactions and long-range global dependencies across millions of spatial elements. We introduce the Multi-Scale Patch Transformer (MSPT), an architecture that combines local point attention within patches with global attention to coarse patch-level representations. To partition the input domain into spatially-coherent patches, we employ ball trees, which handle irregular geometries efficiently. This dual-scale design enables MSPT to scale to millions of points on a single GPU. We validate our method on standard PDE benchmarks (elasticity, plasticity, fluid dynamics, porous flow) and large-scale aerodynamic datasets (ShapeNet-Car, Ahmed-ML), achieving state-of-the-art accuracy with substantially lower memory footprint and computational cost.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

MSPT: Efficient Large-Scale Physical Modeling via Parallelized Multi-Scale Attention

Авторы:

Abstract

Ссылки и действия

Связанные статьи

QoSDiff: An Implicit Topological Embedding Learning Framework Leveraging Denoisi...

Coefficient of Variation Masking: A Volatility-Aware Strategy for EHR Foundation...

Variance Matters: Improving Domain Adaptation via Stratified Sampling

Mitigating the Antigenic Data Bottleneck: Semi-supervised Learning with Protein ...

Rethinking Tokenization for Clinical Time Series: When Less is More

Навигация