Uncertainty-Resilient Multimodal Learning via Consistency-Guided Cross-Modal Transfer

2511.15741v1 cs.AI, cs.HC, cs.LG 2025-11-21
Авторы:

Hyo-Jeong Jang

Abstract

Multimodal learning systems often face substantial uncertainty due to noisy data, low-quality labels, and heterogeneous modality characteristics. These issues become especially critical in human-computer interaction settings, where data quality, semantic reliability, and annotation consistency vary across users and recording conditions. This thesis tackles these challenges by exploring uncertainty-resilient multimodal learning through consistency-guided cross-modal transfer. The central idea is to use cross-modal semantic consistency as a basis for robust representation learning. By projecting heterogeneous modalities into a shared latent space, the proposed framework mitigates modality gaps and uncovers structural relations that support uncertainty estimation and stable feature learning. Building on this foundation, the thesis investigates strategies to enhance semantic robustness, improve data efficiency, and reduce the impact of noise and imperfect supervision without relying on large, high-quality annotations. Experiments on multimodal affect-recognition benchmarks demonstrate that consistency-guided cross-modal transfer significantly improves model stability, discriminative ability, and robustness to noisy or incomplete supervision. Latent space analyses further show that the framework captures reliable cross-modal structure even under challenging conditions. Overall, this thesis offers a unified perspective on resilient multimodal learning by integrating uncertainty modeling, semantic alignment, and data-efficient supervision, providing practical insights for developing reliable and adaptive brain-computer interface systems.

Ссылки и действия

Связанные статьи

Dynamic Trust Calibration Using Contextual Bandits

## Контекст Оптимальное принятие решений в совместных сценариях человеко-компьютерных интеракций зависит от доверия, чет...

2025-10-01

Interactive Program Synthesis for Modeling Collaborative Physical Activities fro...

## Контекст Учить системы выполнять физические задачи является давней целью в области Интерфейсов человека-компьютера (H...

2025-10-01

HealthSLM-Bench: Benchmarking Small Language Models for Mobile and Wearable Heal...

## Контекст Mobile и wearable healthcare monitoring (МХЗ и ОХЗ мониторинг) являются критически важными для обнаружения з...

2025-09-11