Efficient Generalization via Multimodal Co-Training under Data Scarcity and Distribution Shift

2510.07509v1 cs.LG, cs.IT, math.IT 2025-10-11

Авторы:

Tianyu Bell Pan, Damon L. Woodard

Abstract

This paper explores a multimodal co-training framework designed to enhance model generalization in situations where labeled data is limited and distribution shifts occur. We thoroughly examine the theoretical foundations of this framework, deriving conditions under which the use of unlabeled data and the promotion of agreement between classifiers for different modalities lead to significant improvements in generalization. We also present a convergence analysis that confirms the effectiveness of iterative co-training in reducing classification errors. In addition, we establish a novel generalization bound that, for the first time in a multimodal co-training context, decomposes and quantifies the distinct advantages gained from leveraging unlabeled multimodal data, promoting inter-view agreement, and maintaining conditional view independence. Our findings highlight the practical benefits of multimodal co-training as a structured approach to developing data-efficient and robust AI systems that can effectively generalize in dynamic, real-world environments. The theoretical foundations are examined in dialogue with, and in advance of, established co-training principles.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Efficient Generalization via Multimodal Co-Training under Data Scarcity and Distribution Shift

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Complexity as Advantage: A Regret-Based Perspective on Emergent Structure

An Efficient Classification Model for Cyber Text

Measuring the Intrinsic Dimension of Earth Representations

Optimal Information Combining for Multi-Agent Systems Using Adaptive Bias Learni...

Transformers Provably Learn Directed Acyclic Graphs via Kernel-Guided Mutual Inf...

Навигация