📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 0

Последнее обновление: сегодня

📄 Towards Scalable Meta-Learning of near-optimal Interpretable Models via Synthetic Model Generations

2025-11-08

Авторы:

Kyaw Hpone Myint, Zhe Wu, Alexandre G. R. Day, Giri Iyengar

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Decision trees are widely used in high-stakes fields like finance and healthcare due to their interpretability. This work introduces an efficient, scalable method for generating synthetic pre-training data to enable meta-learning of decision trees. Our approach samples near-optimal decision trees synthetically, creating large-scale, realistic datasets. Using the MetaTree transformer architecture, we demonstrate that this method achieves performance comparable to pre-training on real-world data o...

ID: 2511.04000v1 cs.LG, cs.AI, cs.CL, stat.ML

arXiv PDF

📄 Ground-Truth Subgraphs for Better Training and Evaluation of Knowledge Graph Augmented LLMs

2025-11-08

Авторы:

Alberto Cattaneo, Carlo Luschi, Daniel Justus

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Retrieval of information from graph-structured knowledge bases represents a promising direction for improving the factuality of LLMs. While various solutions have been proposed, a comparison of methods is difficult due to the lack of challenging QA datasets with ground-truth targets for graph retrieval. We present SynthKGQA, a framework for generating high-quality synthetic Knowledge Graph Question Answering datasets from any Knowledge Graph, providing the full set of ground-truth facts in the K...

ID: 2511.04473v1 cs.LG, cs.AI, cs.CL, cs.IR

arXiv PDF

📄 Large language models replicate and predict human cooperation across experiments in game theory

2025-11-08

Авторы:

Andrea Cera Palatsi, Samuel Martin-Gutierrez, Ana S. Cardenal, Max Pellert

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Large language models (LLMs) are increasingly used both to make decisions in domains such as health, education and law, and to simulate human behavior. Yet how closely LLMs mirror actual human decision-making remains poorly understood. This gap is critical: misalignment could produce harmful outcomes in practical applications, while failure to replicate human behavior renders LLMs ineffective for social simulations. Here, we address this gap by developing a digital twin of game-theoretic experim...

ID: 2511.04500v1 cs.AI, cs.CL, cs.GT, cs.MA

arXiv PDF

📄 Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper

2025-11-08

Авторы:

Atsuyuki Miyai, Mashiro Toyooka, Takashi Otonari, Zaiying Zhao, Kiyoharu Aizawa

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Understanding the current capabilities and risks of AI Scientist systems is essential for ensuring trustworthy and sustainable AI-driven scientific progress while preserving the integrity of the academic ecosystem. To this end, we develop Jr. AI Scientist, a state-of-the-art autonomous AI scientist system that mimics the core research workflow of a novice student researcher: Given the baseline paper from the human mentor, it analyzes its limitations, formulates novel hypotheses for improvement, ...

ID: 2511.04583v1 cs.AI, cs.CL, cs.CV, cs.LG

arXiv PDF

📄 Are We Asking the Right Questions? On Ambiguity in Natural Language Queries for Tabular Data Analysis

2025-11-08

Авторы:

Daniel Gomm, Cornelius Wolff, Madelon Hulsebos

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Natural language interfaces to tabular data must handle ambiguities inherent to queries. Instead of treating ambiguity as a deficiency, we reframe it as a feature of cooperative interaction, where the responsibility of query specification is shared among the user and the system. We develop a principled framework distinguishing cooperative queries, i.e., queries that yield a resolvable interpretation, from uncooperative queries that cannot be resolved. Applying the framework to evaluations for ta...

ID: 2511.04584v1 cs.AI, cs.CL, cs.DB, cs.HC

arXiv PDF

📄 DR. WELL: Dynamic Reasoning and Learning with Symbolic World Model for Embodied LLM-Based Multi-Agent Collaboration

2025-11-08

Авторы:

Narjes Nourzad, Hanqing Yang, Shiyu Chen, Carlee Joe-Wong

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Cooperative multi-agent planning requires agents to make joint decisions with partial information and limited communication. Coordination at the trajectory level often fails, as small deviations in timing or movement cascade into conflicts. Symbolic planning mitigates this challenge by raising the level of abstraction and providing a minimal vocabulary of actions that enable synchronization and collective progress. We present DR. WELL, a decentralized neurosymbolic framework for cooperative mult...

ID: 2511.04646v1 cs.AI, cs.CL, cs.LG, cs.MA

arXiv PDF

📄 VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks

2025-11-08

Авторы:

Yu Feng, Nathaniel Weir, Kaj Bostrom, Sam Bayless, Darion Cassel, Sapana Chaudhary, Benjamin Kiesl-Reiter, Huzefa Rangwala

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

LLMs can perform multi-step reasoning through Chain-of-Thought (CoT), but they cannot reliably verify their own logic. Even when they reach correct answers, the underlying reasoning may be flawed, undermining trust in high-stakes scenarios. To mitigate this issue, we introduce VeriCoT, a neuro-symbolic method that extracts and verifies formal logical arguments from CoT reasoning. VeriCoT formalizes each CoT reasoning step into first-order logic and identifies premises that ground the argument in...

ID: 2511.04662v1 cs.AI, cs.CL

arXiv PDF

📄 Zero-shot data citation function classification using transformer-based large language models (LLMs)

2025-11-07

Авторы:

Neil Byers, Ali Zaidi, Valerie Skye, Chris Beecroft, Kjiersten Fagnan

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Efforts have increased in recent years to identify associations between specific datasets and the scientific literature that incorporates them. Knowing that a given publication cites a given dataset, the next logical step is to explore how or why that data was used. Advances in recent years with pretrained, transformer-based large language models (LLMs) offer potential means for scaling the description of data use cases in the published literature. This avoids expensive manual labeling and the d...

ID: 2511.02936v1 cs.LG, cs.AI, cs.CL

arXiv PDF

📄 From Measurement to Expertise: Empathetic Expert Adapters for Context-Based Empathy in Conversational AI Agents

2025-11-07

Авторы:

Erfan Shayegani, Jina Suh, Andy Wilson, Nagu Rangan, Javier Hernandez

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Empathy is a critical factor in fostering positive user experiences in conversational AI. While models can display empathy, it is often generic rather than tailored to specific tasks and contexts. In this work, we introduce a novel framework for developing and evaluating context-specific empathetic large language models (LLMs). We first analyze a real-world conversational dataset consisting of 672 multi-turn conversations across 8 tasks, revealing significant differences in terms of expected and...

ID: 2511.03143v1 cs.HC, cs.AI, cs.CL, cs.CY, cs.LG

arXiv PDF

📄 Watermarking Large Language Models in Europe: Interpreting the AI Act in Light of Technology

2025-11-07

Авторы:

Thomas Souverain

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

To foster trustworthy Artificial Intelligence (AI) within the European Union, the AI Act requires providers to mark and detect the outputs of their general-purpose models. The Article 50 and Recital 133 call for marking methods that are ''sufficiently reliable, interoperable, effective and robust''. Yet, the rapidly evolving and heterogeneous landscape of watermarks for Large Language Models (LLMs) makes it difficult to determine how these four standards can be translated into concrete and measu...

ID: 2511.03641v1 cs.CR, cs.AI, cs.CL, cs.CY, 68T01, 68727, 68T30, 68T35, 68T37, 68T50

arXiv PDF

Показано 221 - 230 из 1292 записей