Optimizing LLM Code Suggestions: Feedback-Driven Timing with Lightweight State Bounds

2511.18842v1 cs.SE, cs.AI, cs.HC 2025-11-26

Авторы:

Mohammad Nour Al Awad, Sergey Ivanov, Olga Tikhonova

Abstract

Large Language Models (LLMs) have transformed code auto-completion by generating context-aware suggestions. Yet, deciding when to present these suggestions remains underexplored, often leading to interruptions or wasted inference calls. We propose an adaptive timing mechanism that dynamically adjusts the delay before offering a suggestion based on real-time developer feedback. Our suggested method combines a logistic transform of recent acceptance rates with a bounded delay range, anchored by a high-level binary prediction of the developer's cognitive state. In a two-month deployment with professional developers, our system improved suggestion acceptance from 4.9% with no delay to 15.4% with static delays, and to 18.6% with adaptive timing-while reducing blind rejections (rejections without being read) from 8.3% to 0.36%. Together, these improvements increase acceptance and substantially reduce wasted inference calls by 75%, making LLM-based code assistants more efficient and cost-effective in practice.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Optimizing LLM Code Suggestions: Feedback-Driven Timing with Lightweight State Bounds

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Generative AI for Self-Adaptive Systems: State of the Art and Research Roadmap

Catching UX Flaws in Code: Leveraging LLMs to Identify Usability Flaws at the De...

Pre-Filtering Code Suggestions using Developer Behavioral Telemetry to Optimize ...

AI for Requirements Engineering: Industry adoption and Practitioner perspectives

CodeAlignBench: Assessing Code Generation Models on Developer-Preferred Code Adj...

Навигация