Sequences of Logits Reveal the Low Rank Structure of Language Models

2510.24966v1 cs.LG, cs.AI, cs.CL, stat.ML 2025-10-31

Авторы:

Noah Golowich, Allen Liu, Abhishek Shetty

Abstract

A major problem in the study of large language models is to understand their inherent low-dimensional structure. We introduce an approach to study the low-dimensional structure of language models at a model-agnostic level: as sequential probabilistic models. We first empirically demonstrate that a wide range of modern language models exhibit low-rank structure: in particular, matrices built from the model's logits for varying sets of prompts and responses have low approximate rank. We then show that this low-rank structure can be leveraged for generation -- in particular, we can generate a response to a target prompt using a linear combination of the model's outputs on unrelated, or even nonsensical prompts. On the theoretical front, we observe that studying the approximate rank of language models in the sense discussed above yields a simple universal abstraction whose theoretical predictions parallel our experiments. We then analyze the representation power of the abstraction and give provable learning guarantees.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Sequences of Logits Reveal the Low Rank Structure of Language Models

Авторы:

Abstract

Ссылки и действия

Связанные статьи

LLM Output Drift: Cross-Provider Validation & Mitigation for Financial Workflows

Towards Scalable Meta-Learning of near-optimal Interpretable Models via Syntheti...

Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Ste...

Deep sequence models tend to memorize geometrically; it is unclear why

Reducing the Probability of Undesirable Outputs in Language Models Using Probabi...

Навигация