📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 82

Последнее обновление: сегодня

📄 Agentic AI Home Energy Management System: A Large Language Model Framework for Residential Load Scheduling

2025-11-01

Авторы:

Reda El Makroum, Sebastian Zwickl-Bernhard, Lukas Kranzl

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The electricity sector transition requires substantial increases in residential demand response capacity, yet Home Energy Management Systems (HEMS) adoption remains limited by user interaction barriers requiring translation of everyday preferences into technical parameters. While large language models have been applied to energy systems as code generators and parameter extractors, no existing implementation deploys LLMs as autonomous coordinators managing the complete workflow from natural langu...

ID: 2510.26603v1 cs.AI, cs.MA, cs.SY, eess.SY

arXiv PDF

📄 Counterfactual-based Agent Influence Ranker for Agentic AI Workflows

2025-10-31

Авторы:

Amit Giloni, Chiara Picardi, Roy Betser, Shamik Bose, Aishvariya Priya Rathina Sabapathy, Roman Vainshtein

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

An Agentic AI Workflow (AAW), also known as an LLM-based multi-agent system, is an autonomous system that assembles several LLM-based agents to work collaboratively towards a shared goal. The high autonomy, widespread adoption, and growing interest in such AAWs highlight the need for a deeper understanding of their operations, from both quality and security aspects. To this day, there are no existing methods to assess the influence of each agent on the AAW's final output. Adopting techniques fro...

ID: 2510.25612v1 cs.AI, cs.MA

arXiv PDF

📄 TDFlow: Agentic Workflows for Test Driven Software Engineering

2025-10-30

Авторы:

Kevin Han, Siddharth Maddikayala, Tim Knappe, Om Patel, Austen Liao, Amir Barati Farimani

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We introduce TDFlow, a novel test-driven agentic workflow that frames repository-scale software engineering as a test-resolution task, specifically designed to solve human-written tests. Given a set of tests, TDFlow repeatedly proposes, revises, and debugs repository-scale patches using precisely engineered sub-agents and tightly constrained tools. The workflow decomposes software engineering program repair into four components governed by respective sub-agents. This simple, forced decoupling of...

ID: 2510.23761v1 cs.SE, cs.AI, cs.MA

arXiv PDF

📄 Affordance Representation and Recognition for Autonomous Agents

2025-10-30

Авторы:

Habtom Kahsay Gidey, Niklas Huber, Alexander Lenz, Alois Knoll

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The autonomy of software agents is fundamentally dependent on their ability to construct an actionable internal world model from the structured data that defines their digital environment, such as the Document Object Model (DOM) of web pages and the semantic descriptions of web services. However, constructing this world model from raw structured data presents two critical challenges: the verbosity of raw HTML makes it computationally intractable for direct use by foundation models, while the sta...

ID: 2510.24459v1 cs.AI, cs.MA, cs.SE

arXiv PDF

📄 Solving Continuous Mean Field Games: Deep Reinforcement Learning for Non-Stationary Dynamics

2025-10-29

Авторы:

Lorenzo Magnino, Kai Shao, Zida Wu, Jiacheng Shen, Mathieu Laurière

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Mean field games (MFGs) have emerged as a powerful framework for modeling interactions in large-scale multi-agent systems. Despite recent advancements in reinforcement learning (RL) for MFGs, existing methods are typically limited to finite spaces or stationary models, hindering their applicability to real-world problems. This paper introduces a novel deep reinforcement learning (DRL) algorithm specifically designed for non-stationary continuous MFGs. The proposed approach builds upon a Fictitio...

ID: 2510.22158v1 cs.LG, cs.AI, cs.MA, math.OC

arXiv PDF

📄 SPIRAL: Self-Play Incremental Racing Algorithm for Learning in Multi-Drone Competitions

2025-10-29

Авторы:

Onur Akgün

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

This paper introduces SPIRAL (Self-Play Incremental Racing Algorithm for Learning), a novel approach for training autonomous drones in multi-agent racing competitions. SPIRAL distinctively employs a self-play mechanism to incrementally cultivate complex racing behaviors within a challenging, dynamic environment. Through this self-play core, drones continuously compete against increasingly proficient versions of themselves, naturally escalating the difficulty of competitive interactions. This pro...

ID: 2510.22568v1 cs.RO, cs.AI, cs.MA, cs.SY, eess.SY, I.2.9; I.2.11; I.2.6

arXiv PDF

📄 Curriculum-Based Iterative Self-Play for Scalable Multi-Drone Racing

2025-10-29

Авторы:

Onur Akgün

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The coordination of multiple autonomous agents in high-speed, competitive environments represents a significant engineering challenge. This paper presents CRUISE (Curriculum-Based Iterative Self-Play for Scalable Multi-Drone Racing), a reinforcement learning framework designed to solve this challenge in the demanding domain of multi-drone racing. CRUISE overcomes key scalability limitations by synergistically combining a progressive difficulty curriculum with an efficient self-play mechanism to ...

ID: 2510.22570v1 cs.RO, cs.AI, cs.MA, cs.SY, eess.SY, I.2.9; I.2.11; I.2.6

arXiv PDF

📄 TABL-ABM: A Hybrid Framework for Synthetic LOB Generation

2025-10-29

Авторы:

Ollie Olby, Rory Baggott, Namid Stillman

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The recent application of deep learning models to financial trading has heightened the need for high fidelity financial time series data. This synthetic data can be used to supplement historical data to train large trading models. The state-of-the-art models for the generative application often rely on huge amounts of historical data and large, complicated models. These models range from autoregressive and diffusion-based models through to architecturally simpler models such as the temporal-atte...

ID: 2510.22685v1 q-fin.CP, cs.AI, cs.MA, q-fin.TR

arXiv PDF

📄 Policies over Poses: Reinforcement Learning based Distributed Pose-Graph Optimization for Multi-Robot SLAM

2025-10-29

Авторы:

Sai Krishna Ghanta, Ramviyas Parasuraman

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We consider the distributed pose-graph optimization (PGO) problem, which is fundamental in accurate trajectory estimation in multi-robot simultaneous localization and mapping (SLAM). Conventional iterative approaches linearize a highly non-convex optimization objective, requiring repeated solving of normal equations, which often converge to local minima and thus produce suboptimal estimates. We propose a scalable, outlier-robust distributed planar PGO framework using Multi-Agent Reinforcement Le...

ID: 2510.22740v1 cs.RO, cs.AI, cs.MA

arXiv PDF

📄 Multi-Agent Conditional Diffusion Model with Mean Field Communication as Wireless Resource Allocation Planner

2025-10-29

Авторы:

Kechen Meng, Sinuo Zhang, Rongpeng Li, Xiangming Meng, Chan Wang, Ming Lei, Zhifeng Zhao

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

In wireless communication systems, efficient and adaptive resource allocation plays a crucial role in enhancing overall Quality of Service (QoS). While centralized Multi-Agent Reinforcement Learning (MARL) frameworks rely on a central coordinator for policy training and resource scheduling, they suffer from scalability issues and privacy risks. In contrast, the Distributed Training with Decentralized Execution (DTDE) paradigm enables distributed learning and decision-making, but it struggles wit...

ID: 2510.22969v1 cs.AI, cs.MA

arXiv PDF

Показано 51 - 60 из 161 записей