📊 Статистика дайджестов

Всего дайджестов: 35039 Добавлено сегодня: 432

Последнее обновление: сегодня

📄 Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning

2025-12-05

Авторы:

Purbesh Mitra, Sennur Ulukus

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Long context reasoning in large language models (LLMs) has demonstrated enhancement of their cognitive capabilities via chain-of-thought (CoT) inference. Training such models is usually done via reinforcement learning with verifiable rewards (RLVR) in reasoning based problems, like math and programming. However, RLVR is limited by several bottlenecks, such as, lack of dense reward, and inadequate sample efficiency. As a result, it requires significant compute resources in post-training phase. To...

ID: 2512.05105v1 cs.CL, cs.AI, cs.IT, cs.LG, eess.SP

arXiv PDF

📄 Know Your Limits: Entropy Estimation Modeling for Compression and Generalization

2025-11-15

Авторы:

Benjamin L. Badger, Matthew Neligeorge

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Language prediction is constrained by informational entropy intrinsic to language, such that there exists a limit to how accurate any language model can become and equivalently a lower bound to language compression. The most efficient language compression algorithms today are causal (next token prediction) large language models, but the use of these models to form accurate estimates of language entropy is currently computationally infeasible. We introduce encoder-augmented causal decoder model a...

ID: 2511.10618v1 cs.CL, cs.AI, cs.IT, cs.LG

arXiv PDF

📄 The Geometry of Truth: Layer-wise Semantic Dynamics for Hallucination Detection in Large Language Models

2025-10-08

Авторы:

Amir Hameed Mir

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Large Language Models (LLMs) often produce fluent yet factually incorrect statements-a phenomenon known as hallucination-posing serious risks in high-stakes domains. We present Layer-wise Semantic Dynamics (LSD), a geometric framework for hallucination detection that analyzes the evolution of hidden-state semantics across transformer layers. Unlike prior methods that rely on multiple sampling passes or external verification sources, LSD operates intrinsically within the model's representational ...

ID: 2510.04933v1 cs.CL, cs.AI, cs.IT, cs.LG, cs.NE, math.IT, 68T50, 68T07, 62H30, I.2.7; I.2.6; F.2.2; H.3.3

arXiv PDF