📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 82

Последнее обновление: сегодня

📄 Impact of Layer Norm on Memorization and Generalization in Transformers

2025-11-15

Авторы:

Rishi Singhal, Jung-Eun Kim

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Layer Normalization (LayerNorm) is one of the fundamental components in transformers that stabilizes training and improves optimization. In recent times, Pre-LayerNorm transformers have become the preferred choice over Post-LayerNorm transformers due to their stable gradient flow. However, the impact of LayerNorm on learning and memorization across these architectures remains unclear. In this work, we investigate how LayerNorm influences memorization and learning for Pre- and Post-LayerNorm tran...

ID: 2511.10566v1 cs.LG, cs.AI, cs.CL, cs.CV

arXiv PDF

📄 Towards Emotionally Intelligent and Responsible Reinforcement Learning

2025-11-15

Авторы:

Garapati Keerthana, Manik Gupta

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Personalized decision systems in healthcare and behavioral support often rely on static rule-based or engagement-maximizing heuristics that overlook users' emotional context and ethical constraints. Such approaches risk recommending insensitive or unsafe interventions, especially in domains involving serious mental illness, substance use disorders, or depression. To address this limitation, we propose a Responsible Reinforcement Learning (RRL) framework that integrates emotional and contextual u...

ID: 2511.10573v1 cs.LG, cs.AI, cs.CL, cs.HC, cs.MA

arXiv PDF

📄 APP: Accelerated Path Patching with Task-Specific Pruning

2025-11-11

Авторы:

Frauke Andersen, William Rudman, Ruochen Zhang, Carsten Eickhoff

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Circuit discovery is a key step in many mechanistic interpretability pipelines. Current methods, such as Path Patching, are computationally expensive and have limited in-depth circuit analysis for smaller models. In this study, we propose Accelerated Path Patching (APP), a hybrid approach leveraging our novel contrastive attention head pruning method to drastically reduce the search space of circuit discovery methods. Our Contrastive-FLAP pruning algorithm uses techniques from causal mediation a...

ID: 2511.05442v1 cs.LG, cs.AI, cs.CL, 68Uxx, I.2.7; I.2.6; I.2.m

arXiv PDF

📄 RLHF: A comprehensive Survey for Cultural, Multimodal and Low Latency Alignment Methods

2025-11-08

Авторы:

Raghav Sharma, Manan Mehta, Sai Tiger Raina

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Reinforcement Learning from Human Feedback (RLHF) is the standard for aligning Large Language Models (LLMs), yet recent progress has moved beyond canonical text-based methods. This survey synthesizes the new frontier of alignment research by addressing critical gaps in multi-modal alignment, cultural fairness, and low-latency optimization. To systematically explore these domains, we first review foundational algo- rithms, including PPO, DPO, and GRPO, before presenting a detailed analysis of the...

ID: 2511.03939v1 cs.LG, cs.AI, cs.CL

arXiv PDF

📄 Towards Scalable Meta-Learning of near-optimal Interpretable Models via Synthetic Model Generations

2025-11-08

Авторы:

Kyaw Hpone Myint, Zhe Wu, Alexandre G. R. Day, Giri Iyengar

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Decision trees are widely used in high-stakes fields like finance and healthcare due to their interpretability. This work introduces an efficient, scalable method for generating synthetic pre-training data to enable meta-learning of decision trees. Our approach samples near-optimal decision trees synthetically, creating large-scale, realistic datasets. Using the MetaTree transformer architecture, we demonstrate that this method achieves performance comparable to pre-training on real-world data o...

ID: 2511.04000v1 cs.LG, cs.AI, cs.CL, stat.ML

arXiv PDF

📄 Ground-Truth Subgraphs for Better Training and Evaluation of Knowledge Graph Augmented LLMs

2025-11-08

Авторы:

Alberto Cattaneo, Carlo Luschi, Daniel Justus

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Retrieval of information from graph-structured knowledge bases represents a promising direction for improving the factuality of LLMs. While various solutions have been proposed, a comparison of methods is difficult due to the lack of challenging QA datasets with ground-truth targets for graph retrieval. We present SynthKGQA, a framework for generating high-quality synthetic Knowledge Graph Question Answering datasets from any Knowledge Graph, providing the full set of ground-truth facts in the K...

ID: 2511.04473v1 cs.LG, cs.AI, cs.CL, cs.IR

arXiv PDF

📄 Zero-shot data citation function classification using transformer-based large language models (LLMs)

2025-11-07

Авторы:

Neil Byers, Ali Zaidi, Valerie Skye, Chris Beecroft, Kjiersten Fagnan

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Efforts have increased in recent years to identify associations between specific datasets and the scientific literature that incorporates them. Knowing that a given publication cites a given dataset, the next logical step is to explore how or why that data was used. Advances in recent years with pretrained, transformer-based large language models (LLMs) offer potential means for scaling the description of data use cases in the published literature. This avoids expensive manual labeling and the d...

ID: 2511.02936v1 cs.LG, cs.AI, cs.CL

arXiv PDF

📄 Reasoning Planning for Language Models

2025-11-06

Авторы:

Bao Nguyen, Hieu Trung Nguyen, Ruifeng She, Xiaojin Fu, Viet Anh Nguyen

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Selecting an appropriate reasoning method for a given query remains a key challenge in language model generation. Existing approaches typically generate multiple candidate responses and use an aggregation strategy to select the output answer, often assuming that more candidate answers yield higher accuracy. We revisit this assumption through a rigorous theoretical analysis, deriving accuracy bounds for standard aggregation methods under fixed generation distributions and candidate sizes. Buildin...

ID: 2511.00521v1 cs.LG, cs.AI, cs.CL

arXiv PDF

📄 Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Steering

2025-11-06

Авторы:

Eric Bigelow, Daniel Wurgaft, YingQiao Wang, Noah Goodman, Tomer Ullman, Hidenori Tanaka, Ekdeep Singh Lubana

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Large language models (LLMs) can be controlled at inference time through prompts (in-context learning) and internal activations (activation steering). Different accounts have been proposed to explain these methods, yet their common goal of controlling model behavior raises the question of whether these seemingly disparate methodologies can be seen as specific instances of a broader framework. Motivated by this, we develop a unifying, predictive account of LLM control from a Bayesian perspective....

ID: 2511.00617v1 cs.LG, cs.AI, cs.CL, stat.ML

arXiv PDF

📄 RLAC: Reinforcement Learning with Adversarial Critic for Free-Form Generation Tasks

2025-11-06

Авторы:

Mian Wu, Gavin Zhang, Sewon Min, Sergey Levine, Aviral Kumar

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Open-ended generation tasks require outputs to satisfy diverse and often implicit task-specific evaluation rubrics. The sheer number of relevant rubrics leads to prohibitively high verification costs and incomplete assessments of a response, making reinforcement learning (RL) post-training with rubric-based rewards difficult to scale. This problem is exacerbated by the fact that often the best way to combine these rubrics into one single reward is also highly prompt-specific. We propose Reinforc...

ID: 2511.01758v1 cs.LG, cs.AI, cs.CL

arXiv PDF

Показано 41 - 50 из 278 записей