📊 Статистика дайджестов

Всего дайджестов: 34607 Добавлено сегодня: 484

Последнее обновление: сегодня

📄 STAR-GO: Improving Protein Function Prediction by Learning to Hierarchically Integrate Ontology-Informed Semantic Embeddings

2025-12-09

Авторы:

Mehmet Efe Akça, Gökçe Uludoğan, Arzucan Özgür, İnci M. Baytaş

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Accurate prediction of protein function is essential for elucidating molecular mechanisms and advancing biological and therapeutic discovery. Yet experimental annotation lags far behind the rapid growth of protein sequence data. Computational approaches address this gap by associating proteins with Gene Ontology (GO) terms, which encode functional knowledge through hierarchical relations and textual definitions. However, existing models often emphasize one modality over the other, limiting their...

ID: 2512.05245v1 q-bio.BM, cs.LG

arXiv PDF

📄 Unlocking hidden biomolecular conformational landscapes in diffusion models at inference time

2025-12-04

Авторы:

Daniel D. Richman, Jessica Karaguesian, Carl-Mikael Suomivuori, Ron O. Dror

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The function of biomolecules such as proteins depends on their ability to interconvert between a wide range of structures or "conformations." Researchers have endeavored for decades to develop computational methods to predict the distribution of conformations, which is far harder to determine experimentally than a static folded structure. We present ConforMix, an inference-time algorithm that enhances sampling of conformational distributions using a combination of classifier guidance, filtering,...

ID: 2512.03312v1 q-bio.BM, cs.LG

arXiv PDF

📄 Few-shot Protein Fitness Prediction via In-context Learning and Test-time Training

2025-12-04

Авторы:

Felix Teufel, Aaron W. Kollasch, Yining Huang, Ole Winther, Kevin K. Yang, Pascal Notin, Debora S. Marks

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Accurately predicting protein fitness with minimal experimental data is a persistent challenge in protein engineering. We introduce PRIMO (PRotein In-context Mutation Oracle), a transformer-based framework that leverages in-context learning and test-time training to adapt rapidly to new proteins and assays without large task-specific datasets. By encoding sequence information, auxiliary zero-shot predictions, and sparse experimental labels from many assays as a unified token set in a pre-trainin...

ID: 2512.02315v1 q-bio.BM, cs.LG

arXiv PDF

📄 EnzyCLIP: A Cross-Attention Dual Encoder Framework with Contrastive Learning for Predicting Enzyme Kinetic Constants

2025-12-02

Авторы:

Anas Aziz Khan, Md Shah Fahad, Priyanka, Ramesh Chandra, Guransh Singh

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Accurate prediction of enzyme kinetic parameters is crucial for drug discovery, metabolic engineering, and synthetic biology applications. Current computational approaches face limitations in capturing complex enzyme-substrate interactions and often focus on single parameters while neglecting the joint prediction of catalytic turnover numbers (Kcat) and Michaelis-Menten constants (Km). We present EnzyCLIP, a novel dual-encoder framework that leverages contrastive learning and cross-attention mec...

ID: 2512.00379v1 q-bio.BM, cs.LG

arXiv PDF

📄 Compact Artificial Neural Network Models for Predicting Protein Residue -- RNA Base Binding

2025-11-15

Авторы:

Stanislav Selitskiy

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Large Artificial Neural Network (ANN) models have demonstrated success in various domains, including general text and image generation, drug discovery, and protein-RNA (ribonucleic acid) binding tasks. However, these models typically demand substantial computational resources, time, and data for effective training. Given that such extensive resources are often inaccessible to many researchers and that life sciences data sets are frequently limited, we investigated whether small ANN models could ...

ID: 2511.08648v1 q-bio.BM, cs.LG

arXiv PDF

📄 EnzyControl: Adding Functional and Substrate-Specific Control for Enzyme Backbone Generation

2025-10-31

Авторы:

Chao Song, Zhiyuan Liu, Han Huang, Liang Wang, Qiong Wang, Jianyu Shi, Hui Yu, Yihang Zhou, Yang Zhang

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Designing enzyme backbones with substrate-specific functionality is a critical challenge in computational protein engineering. Current generative models excel in protein design but face limitations in binding data, substrate-specific control, and flexibility for de novo enzyme backbone generation. To address this, we introduce EnzyBind, a dataset with 11,100 experimentally validated enzyme-substrate pairs specifically curated from PDBbind. Building on this, we propose EnzyControl, a method that ...

ID: 2510.25132v1 q-bio.BM, cs.LG

arXiv PDF

📄 Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

2025-10-14

Авторы:

Siyuan Chen, Minghao Guo, Caoliwen Wang, Anka He Chen, Yikun Zhang, Jingjing Chai, Yin Yang, Wojciech Matusik, Peter Yichen Chen

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Biomolecular interaction modeling has been substantially advanced by foundation models, yet they often produce all-atom structures that violate basic steric feasibility. We address this limitation by enforcing physical validity as a strict constraint during both training and inference with a uniffed module. At its core is a differentiable projection that maps the provisional atom coordinates from the diffusion model to the nearest physically valid conffguration. This projection is achieved using...

ID: 2510.08946v1 q-bio.BM, cs.LG

arXiv PDF

📄 FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D ligand generation and affinity prediction

2025-10-07

Авторы:

Julian Cremer, Tuan Le, Mohammad M. Ghahremanpour, Emilia Sługocka, Filipe Menezes, Djork-Arné Clevert

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We present FLOWR:root, an equivariant flow-matching model for pocket-aware 3D ligand generation with joint binding affinity prediction and confidence estimation. The model supports de novo generation, pharmacophore-conditional sampling, fragment elaboration, and multi-endpoint affinity prediction (pIC50, pKi, pKd, pEC50). Training combines large-scale ligand libraries with mixed-fidelity protein-ligand complexes, followed by refinement on curated co-crystal datasets and parameter-efficient finet...

ID: 2510.02578v2 q-bio.BM, cs.LG

arXiv PDF

📄 GeoGraph: Geometric and Graph-based Ensemble Descriptors for Intrinsically Disordered Proteins

2025-10-04

Авторы:

Eoin Quinn, Marco Carobene, Jean Quentin, Sebastien Boyer, Miguel Arbesú, Oliver Bent

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

While deep learning has revolutionized the prediction of rigid protein structures, modelling the conformational ensembles of Intrinsically Disordered Proteins (IDPs) remains a key frontier. Current AI paradigms present a trade-off: Protein Language Models (PLMs) capture evolutionary statistics but lack explicit physical grounding, while generative models trained to model full ensembles are computationally expensive. In this work we critically assess these limits and propose a path forward. We in...

ID: 2510.00774v1 q-bio.BM, cs.LG

arXiv PDF

📄 AI-based Methods for Simulating, Sampling, and Predicting Protein Ensembles

2025-09-24

Авторы:

Bowen Jing, Bonnie Berger, Tommi Jaakkola

## Контекст Одним из основных задач в биоинформатике является понимание и моделирование белков, которые часто существуют в форме энзим, т.е. как коллективные структуры, которые взаимодействуют и меняют свою форму в зависимости от условий. Традиционные подходы к моделированию энзимов часто требуют больших вычислительных ресурсов и трудноспособных экспериментов. Несмотря на прорывы в структурном моделировании белков, получение реалистичных ансамблей белков остается сложной задачей. Новые модели, основанные на глубоком обучении, позволяют заполнить этот пробел, предлагая более точные и эффективные методы для моделирования и симуляции белков, что включает их статистические особенности и поведение в разных условиях. ## Метод Методы основаны на глубоком обучении, включая методы коарсе-грейнд форс-филдов, генеративных моделей, методов пертурбации последовательностей и моделей описательных параметров белков. Глубокие нейронные сети используются для изучения взаимосвязи между ансамблем белков и их физиологическими свойствами. Техники включают в себя глубокое обучение с подкреплением, глубокие сверточные сети, текстовые модели для прогнозирования белков и методы множественного выравнивания последовательностей. Эти методы применяются для точного прогнозирования и симуляции белков, а также для создания виртуальных ансамблей белков, которые могут быть использованы в различных биологических исследованиях. ## Результаты Исследования показали, что глубокие нейронные сети могут точно прогнозировать белковые ансамбли и симулировать их поведение в разных условиях. Использование множественного выравнивания последовательностей позволяет улучшить точность моделей, которые могут предсказывать поведение белков в динамике. Также был демонстрирован потенциал глубокой нейросети в симуляции белков, которые могут менять свойства в реальном времени, а также в чистом виртуальном режиме. Эти подходы продемонстрировали высокую точность в прогнозировании характера поведения белков в разных условиях, включая их способность менять свою форму и функциональные свойства. ## Значимость Применение этих методов может быть полезно в различных областях, включая лекарственные исследования, генетическую инженерию и поиск новых биологических маркеров. Эти модели могут быть использованы для прогнозирования белковых ансамблей, что позволяет улучшить понимание их функциональных свойств и взаимодействий с другими белками. Это может привести к разработке новых лекарственных сре

Annotation:

Advances in deep learning have opened an era of abundant and accurate predicted protein structures; however, similar progress in protein ensembles has remained elusive. This review highlights several recent research directions towards AI-based predictions of protein ensembles, including coarse-grained force fields, generative models, multiple sequence alignment perturbation methods, and modeling of ensemble descriptors. An emphasis is placed on realistic assessments of the technological maturity...

ID: 2509.17224v1 q-bio.BM, cs.LG, physics.bio-ph

arXiv PDF

Показано 1 - 10 из 17 записей