LLM-as-a-Judge: Toward World Models for Slate Recommendation Systems
2511.04541v1
cs.IR, cs.AI
2025-11-08
Авторы:
Baptiste Bonin, Maxime Heuillet, Audrey Durand
Abstract
Modeling user preferences across domains remains a key challenge in slate
recommendation (i.e. recommending an ordered sequence of items) research. We
investigate how Large Language Models (LLM) can effectively act as world models
of user preferences through pairwise reasoning over slates. We conduct an
empirical study involving several LLMs on three tasks spanning different
datasets. Our results reveal relationships between task performance and
properties of the preference function captured by LLMs, hinting towards areas
for improvement and highlighting the potential of LLMs as world models in
recommender systems.
Ссылки и действия
Дополнительные ресурсы: