iLTM: Integrated Large Tabular Model

2511.15941v1 cs.LG, cs.AI 2025-11-21

Авторы:

David Bonet, Marçal Comajoan Cara, Alvaro Calafell, Daniel Mas Montserrat, Alexander G. Ioannidis

Abstract

Tabular data underpins decisions across science, industry, and public services. Despite rapid progress, advances in deep learning have not fully carried over to the tabular domain, where gradient-boosted decision trees (GBDTs) remain a default choice in practice. We present iLTM, an integrated Large Tabular Model that unifies tree-derived embeddings, dimensionality-agnostic representations, a meta-trained hypernetwork, multilayer perceptrons (MLPs), and retrieval within a single architecture. Pretrained on more than 1,800 heterogeneous classification datasets, iLTM achieves consistently superior performance across tabular classification and regression tasks, from small datasets to large and high-dimensional tasks. After light fine-tuning, the meta-trained hypernetwork transfers to regression targets, matching or surpassing strong baselines. Extensive experiments show that iLTM outperforms well-tuned GBDTs and leading deep tabular models while requiring less task-specific tuning. By bridging the gap between tree-based and neural methods, iLTM offers a new framework for tabular foundation models for robust, adaptable, and scalable tabular learning.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

iLTM: Integrated Large Tabular Model

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Approximation of Box Decomposition Algorithm for Fast Hypervolume-Based Multi-Ob...

NEAT: Neighborhood-Guided, Efficient, Autoregressive Set Transformer for 3D Mole...

Sparse Attention Post-Training for Mechanistic Interpretability

Neural Coherence : Find higher performance to out-of-distribution tasks from few...

Impugan: Learning Conditional Generative Models for Robust Data Imputation

Навигация