BioVERSE: Representation Alignment of Biomedical Modalities to LLMs for Multi-Modal Reasoning

2510.01428v1 q-bio.QM, cs.AI 2025-10-04

Авторы:

Ching-Huei Tsou, Michal Ozery-Flato, Ella Barkan, Diwakar Mahajan, Ben Shapira

Abstract

Recent advances in large language models (LLMs) and biomedical foundation models (BioFMs) have achieved strong results in biological text reasoning, molecular modeling, and single-cell analysis, yet they remain siloed in disjoint embedding spaces, limiting cross-modal reasoning. We present BIOVERSE (Biomedical Vector Embedding Realignment for Semantic Engagement), a two-stage approach that adapts pretrained BioFMs as modality encoders and aligns them with LLMs through lightweight, modality-specific projection layers. The approach first aligns each modality to a shared LLM space through independently trained projections, allowing them to interoperate naturally, and then applies standard instruction tuning with multi-modal data to bring them together for downstream reasoning. By unifying raw biomedical data with knowledge embedded in LLMs, the approach enables zero-shot annotation, cross-modal question answering, and interactive, explainable dialogue. Across tasks spanning cell-type annotation, molecular description, and protein function reasoning, compact BIOVERSE configurations surpass larger LLM baselines while enabling richer, generative outputs than existing BioFMs, establishing a foundation for principled multi-modal biomedical reasoning.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

BioVERSE: Representation Alignment of Biomedical Modalities to LLMs for Multi-Modal Reasoning

Авторы:

Abstract

Ссылки и действия

Связанные статьи

The BEAT-CF Causal Model: A model for guiding the design of trials and observati...

RadDiff: Retrieval-Augmented Denoising Diffusion for Protein Inverse Folding

Beyond Protein Language Models: An Agentic LLM Framework for Mechanistic Enzyme ...

Dual-Path Knowledge-Augmented Contrastive Alignment Network for Spatially Resolv...

GeoPl@ntNet: A Platform for Exploring Essential Biodiversity Variables

Навигация