The Oracle and The Prism: A Decoupled and Efficient Framework for Generative Recommendation Explanation

2511.16543v1 cs.IR, cs.AI, cs.CL, cs.LG 2025-11-21
Авторы:

Jiaheng Zhang, Daqiang Zhang

Abstract

The integration of Large Language Models (LLMs) into explainable recommendation systems often leads to a performance-efficiency trade-off in end-to-end architectures, where joint optimization of ranking and explanation can result in suboptimal compromises. To resolve this, we propose Prism, a novel decoupled framework that rigorously separates the recommendation process into a dedicated ranking stage and an explanation generation stage. Inspired by knowledge distillation, Prism leverages a powerful teacher LLM (e.g., FLAN-T5-XXL) as an Oracle to produce high-fidelity explanatory knowledge. A compact, fine-tuned student model (e.g., BART-Base), the Prism, then specializes in synthesizing this knowledge into personalized explanations. This decomposition ensures that each component is optimized for its specific objective, eliminating inherent conflicts in coupled models. Extensive experiments on benchmark datasets demonstrate that our 140M-parameter Prism model significantly outperforms its 11B-parameter teacher in human evaluations of faithfulness and personalization, while achieving a 24 times speedup and a 10 times reduction in memory consumption during inference. These results validate that decoupling, coupled with targeted distillation, provides an efficient and effective pathway to high-quality explainable recommendation.

Ссылки и действия

Связанные статьи

LLM-Enhanced Linear Autoencoders for Recommendation

## Контекст Интеллектуальные рекомендательные системы (IRS) широко используются для поиска и предоставления полезной инф...

2025-08-21

Personalized Product Search Ranking: A Multi-Task Learning Approach with Tabular...

## Контекст Поиск продуктов на основе персонализации является ключевым аспектом современных электронных магазинов, позв...

2025-08-15