📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 82

Последнее обновление: сегодня

📄 Advancing Automated Ethical Profiling in SE: a Zero-Shot Evaluation of LLM Reasoning

2025-10-04

Авторы:

Patrizio Migliarini, Mashal Afzal Memon, Marco Autili, Paola Inverardi

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Large Language Models (LLMs) are increasingly integrated into software engineering (SE) tools for tasks that extend beyond code synthesis, including judgment under uncertainty and reasoning in ethically significant contexts. We present a fully automated framework for assessing ethical reasoning capabilities across 16 LLMs in a zero-shot setting, using 30 real-world ethically charged scenarios. Each model is prompted to identify the most applicable ethical theory to an action, assess its moral ac...

ID: 2510.00881v1 cs.SE, cs.AI

arXiv PDF

📄 CodeGenLink: A Tool to Find the Likely Origin and License of Automatically Generated Code

2025-10-04

Авторы:

Daniele Bifolco, Guido Annicchiarico, Pierluigi Barbiero, Massimiliano Di Penta, Fiorella Zampetti

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Large Language Models (LLMs) are widely used in software development tasks nowadays. Unlike reusing code taken from the Web, for LLMs' generated code, developers are concerned about its lack of trustworthiness and possible copyright or licensing violations, due to the lack of code provenance information. This paper proposes CodeGenLink, a GitHub CoPilot extension for Visual Studio Code aimed at (i) suggesting links containing code very similar to automatically generated code, and (ii) whenever p...

ID: 2510.01077v1 cs.SE, cs.AI

arXiv PDF

📄 Clarifying Semantics of In-Context Examples for Unit Test Generation

2025-10-04

Авторы:

Chen Yang, Lin Yang, Ziqi Wang, Dong Wang, Jianyi Zhou, Junjie Chen

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Recent advances in large language models (LLMs) have enabled promising performance in unit test generation through in-context learning (ICL). However, the quality of in-context examples significantly influences the effectiveness of generated tests-poorly structured or semantically unclear test examples often lead to suboptimal outputs. In this paper, we propose CLAST, a novel technique that systematically refines unit tests to improve their semantic clarity, thereby enhancing their utility as in...

ID: 2510.01994v1 cs.SE, cs.AI

arXiv PDF

📄 SIEVE: Towards Verifiable Certification for Code-datasets

2025-10-04

Авторы:

Fatou Ndiaye Mbodji, El-hacen Diallo, Jordan Samhi, Kui Liu, Jacques Klein, Tegawendé F. Bissyande

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Code agents and empirical software engineering rely on public code datasets, yet these datasets lack verifiable quality guarantees. Static 'dataset cards' inform, but they are neither auditable nor do they offer statistical guarantees, making it difficult to attest to dataset quality. Teams build isolated, ad-hoc cleaning pipelines. This fragments effort and raises cost. We present SIEVE, a community-driven framework. It turns per-property checks into Confidence Cards-machine-readable, verifiabl...

ID: 2510.02166v1 cs.SE, cs.AI

arXiv PDF

📄 Automatically Generating Web Applications from Requirements Via Multi-Agent Test-Driven Development

2025-10-02

Авторы:

Yuxuan Wan, Tingshuo Liang, Jiakai Xu, Jingyu Xiao, Yintong Huo, Michael R. Lyu

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Developing full-stack web applications is complex and time-intensive, demanding proficiency across diverse technologies and frameworks. Although recent advances in multimodal large language models (MLLMs) enable automated webpage generation from visual inputs, current solutions remain limited to front-end tasks and fail to deliver fully functional applications. In this work, we introduce TDDev, the first test-driven development (TDD)-enabled LLM-agent framework for end-to-end full-stack web appl...

ID: 2509.25297v2 cs.SE, cs.AI

arXiv PDF

📄 A Cartography of Open Collaboration in Open Source AI: Mapping Practices, Motivations, and Governance in 14 Open Large Language Model Projects

2025-10-02

Авторы:

Johan Linåker, Cailean Osborne, Jennifer Ding, Ben Burtenshaw

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The proliferation of open large language models (LLMs) is fostering a vibrant ecosystem of research and innovation in artificial intelligence (AI). However, the methods of collaboration used to develop open LLMs both before and after their public release have not yet been comprehensively studied, limiting our understanding of how open LLM projects are initiated, organized, and governed as well as what opportunities there are to foster this ecosystem even further. We address this gap through an e...

ID: 2509.25397v1 cs.SE, cs.AI, cs.LG

arXiv PDF

📄 PIPer: On-Device Environment Setup via Online Reinforcement Learning

2025-10-02

Авторы:

Alexander Kovrigin, Aleksandra Eliseeva, Konstantin Grotov, Egor Bogomolov, Yaroslav Zharov

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Environment setup-the process of configuring the system to work with a specific software project-represents a persistent challenge in Software Engineering (SE). Automated environment setup methods could assist developers by providing fully configured environments for arbitrary repositories without manual effort. This also helps SE researchers to scale execution-based benchmarks. However, recent studies reveal that even state-of-the-art Large Language Models (LLMs) achieve limited success in auto...

ID: 2509.25455v1 cs.SE, cs.AI, cs.LG

arXiv PDF

📄 DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation

2025-10-02

Авторы:

Esakkivel Esakkiraja, Denis Akhiyarov, Aditya Shanmugham, Chitra Ganapathy

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Current search techniques are limited to standard RAG query-document applications. In this paper, we propose a novel technique to expand the code and index for predicting the required APIs, directly enabling high-quality, end-to-end code generation for auto-completion and agentic AI applications. We address the problem of API leaks in current code-to-code benchmark datasets by introducing a new dataset built from real-world ServiceNow Script Includes that capture the challenge of unclear API usa...

ID: 2509.25716v1 cs.SE, cs.AI, cs.IR

arXiv PDF

📄 R-Log: Incentivizing Log Analysis Capability in LLMs via Reasoning-based Reinforcement Learning

2025-10-02

Авторы:

Yilun Liu, Ziang Chen, Song Xu, Minggui He, Shimin Tao, Weibin Meng, Yuming Xie, Tao Han, Chunguang Zhao, Jingzhou Du, Daimeng Wei, Shenglin Zhang, Yongqian Sun

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The growing complexity of log data in modern software systems has prompted the use of Large Language Models (LLMs) for automated log analysis. Current approaches typically rely on direct supervised fine-tuning (SFT) on log-label pairs. However, this exacerbates the domain discrepancy between general-purpose LLMs and specialized log data, causing overfitting. Furthermore, SFT's imbalanced loss computation often allows lengthy contexts to overwhelm critical, concise details in model answers, leadi...

ID: 2509.25987v1 cs.SE, cs.AI

arXiv PDF

📄 Improving the Efficiency of LLM Agent Systems through Trajectory Reduction

2025-10-01

Авторы:

Yuan-An Xiao, Pengfei Gao, Chao Peng, Yingfei Xiong

## Контекст Large Language Models (LLMs) становятся все более популярными в сфере приложений, включая системы агентов в рамках становления интеллектуальных систем в трехмерной графической среде. Эти системы эффективны в решении задач, особенно в сфере программирования. Однако одной из главных проблем этих систем является высокая вычислительная стоимость. Это связано со сложностью траекторий, которые включают в себя многочисленные действия и слои информации. Несмотря на то, что эффективность этих систем широко изучается, внимание к оптимизации траекторий остается заумным вопросом. Этот аспект является ключевым мотивационным фактором для разработки новых методов снижения траектории. ## Метод Для улучшения эффективности траекторий LLMs-агентов мы предлагаем метод **AgentDiet**. Это подход модифицирует траекторию во время выполнения, удаляя ненужную, повторяющуюся и устаревшую информацию. Мы проводим анализ существующих траекторий, чтобы определить эти элементы и создать алгоритм, который автоматически их удаляет. Метод **AgentDiet** использует простую, но эффективную архитектуру, которая может быть легко интегрирована в существующие системы. Наша инновационная технология может быть применена к различным LLMs-агентам, уменьшая траекторию без потери качества. ## Результаты Мы провели ряд экспериментов, чтобы проверить эффективность **AgentDiet**. Мы использовали несколько LLMs-агентов, включая передовые модели, и две различные бенчмарк-коллекции. Эксперименты показали, что **AgentDiet** уменьшает траекторию на 39,9% ~ 59,7%, при этом сохраняя качество выполнения агента. Это приводит к снижению вычислительного затрат до 21,1% ~ 35,9%, что является ключевым преимуществом. Наши результаты показывают, что траектория может быть эффективно упрощена без потери качества, что является важной оптимизацией для LLMs-агентов. ## Значимость Наши результаты имеют значительное значение для развития LLMs-агентов в сферах, таких как программирование, контент-генерация и интеллектуальные системы. Метод **AgentDiet** позволяет повысить эффективность траекторий, что приводит к уменьшению вычислительных затрат и повышению скорости ответа. Это улучшение может быть применено в системах, где высокая скорость реакции и экономия ресурсов критичны. Развитие таких подходов может вести к новым возможностям в использовании LLMs в различных приложениях. ## Выводы Мы представили **AgentDiet**, метод для уменьшения траектории LLMs-агентов, который позволяет уменьшить вычислительные затраты без потери качества. Н

Annotation:

Multi-turn agent systems based on Large Language Models (LLMs) have been increasingly popular for software engineering tasks. While LLM agents show decent effectiveness, the high computational cost of input tokens due to the ever-growing trajectory remains an efficiency concern for their applications. Efficiency is largely neglected in existing studies and agent products, and this paper fills the gap by introducing an inference-time trajectory reduction approach to reduce the cost of agents. T...

ID: 2509.23586v1 cs.SE, cs.AI

arXiv PDF

Показано 191 - 200 из 341 записей