📊 Статистика дайджестов
Всего дайджестов: 34123 Добавлено сегодня: 101
Последнее обновление: сегодня
Авторы:
Laurent Bonnasse-Gahot, Jean-Pierre Nadal
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
In animals, category learning enhances discrimination between stimuli close
to the category boundary. This phenomenon, called categorical perception, was
also empirically observed in artificial neural networks trained on
classification tasks. In previous modeling works based on neuroscience data, we
show that this expansion/compression is a necessary outcome of efficient
learning. Here we extend our theoretical framework to artificial networks. We
show that minimizing the Bayes cost (mean of the...
Авторы:
Renzhao Liang, Sizhe Xu, Chenggang Xie, Jingru Chen, Feiyang Ren, Shu Yang, Takahiro Yabe
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Time series forecasting plays a pivotal role in critical domains such as
energy management and financial markets. Although deep learning-based
approaches (e.g., MLP, RNN, Transformer) have achieved remarkable progress, the
prevailing "long-sequence information gain hypothesis" exhibits inherent
limitations. Through systematic experimentation, this study reveals a
counterintuitive phenomenon: appropriately truncating historical data can
paradoxically enhance prediction accuracy, indicating that e...
Авторы:
Reuben Dorent, Polina Golland, William Wells III
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Mutual Information (MI) is a fundamental measure of statistical dependence
widely used in representation learning. While direct optimization of MI via its
definition as a Kullback-Leibler divergence (KLD) is often intractable, many
recent methods have instead maximized alternative dependence measures, most
notably, the Jensen-Shannon divergence (JSD) between joint and product of
marginal distributions via discriminative losses. However, the connection
between these surrogate objectives and MI re...
Авторы:
Ali Hussaini Umar, Franky Kevin Nando Tezoh, Jean Barbier, Santiago Acevedo, Alessandro Laio
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
In supervised classification tasks, models are trained to predict a label for
each data point. In real-world datasets, these labels are often noisy due to
annotation errors. While the impact of label noise on the performance of deep
learning models has been widely studied, its effects on the networks' hidden
representations remain poorly understood. We address this gap by systematically
comparing hidden representations using the Information Imbalance, a
computationally efficient proxy of conditi...
Авторы:
Shuangyi Chen, Ashish Khisti
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We study black-box detection of machine-generated text under practical
constraints: the scoring model (proxy LM) may mismatch the unknown source
model, and per-input contrastive generation is costly. We propose SurpMark, a
reference-based detector that summarizes a passage by the dynamics of its token
surprisals. SurpMark quantizes surprisals into interpretable states, estimates
a state-transition matrix for the test text, and scores it via a generalized
Jensen-Shannon (GJS) gap between the test...
📄 Efficient Generalization via Multimodal Co-Training under Data Scarcity and Distribution Shift
2025-10-11Авторы:
Tianyu Bell Pan, Damon L. Woodard
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
This paper explores a multimodal co-training framework designed to enhance
model generalization in situations where labeled data is limited and
distribution shifts occur. We thoroughly examine the theoretical foundations of
this framework, deriving conditions under which the use of unlabeled data and
the promotion of agreement between classifiers for different modalities lead to
significant improvements in generalization. We also present a convergence
analysis that confirms the effectiveness of ...
📄 Some theoretical improvements on the tightness of PAC-Bayes risk certificates for neural networks
2025-10-11Авторы:
Diego García-Pérez, Emilio Parrado-Hernández, John Shawe-Taylor
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
This paper presents four theoretical contributions that improve the usability
of risk certificates for neural networks based on PAC-Bayes bounds. First, two
bounds on the KL divergence between Bernoulli distributions enable the
derivation of the tightest explicit bounds on the true risk of classifiers
across different ranges of empirical risk. The paper next focuses on the
formalization of an efficient methodology based on implicit differentiation
that enables the introduction of the optimizatio...
Авторы:
Nandan Kumar Jha, Brandon Reagen
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
As large language models (LLMs) scale, the question is not only how large
they become, but how much of their capacity is effectively utilized. Existing
scaling laws relate model size to loss, yet overlook how components exploit
their latent space. We study feed-forward networks (FFNs) and recast width
selection as a spectral utilization problem. Using a lightweight diagnostic
suite -- Hard Rank (participation ratio), Soft Rank (Shannon rank), Spectral
Concentration, and the composite Spectral Ut...
Авторы:
Zihui Zhao, Yuanbo Tang, Jieyu Ren, Xiaoping Zhang, Yang Li
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Dictionary learning is traditionally formulated as an $L_1$-regularized
signal reconstruction problem. While recent developments have incorporated
discriminative, hierarchical, or generative structures, most approaches rely on
encouraging representation sparsity over individual samples that overlook how
atoms are shared across samples, resulting in redundant and sub-optimal
dictionaries. We introduce a parsimony promoting regularizer based on the
row-wise $L_\infty$ norm of the coefficient matri...
Авторы:
Haozhe Lei, Hao Guo, Tommy Svensson, Sundeep Rangan
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Modern wireless systems require not only position estimates, but also
quantified uncertainty to support planning, control, and radio resource
management. We formulate localization as posterior inference of an unknown
transmitter location from receiver measurements. We propose Monte Carlo
Candidate-Likelihood Estimation (MC-CLE), which trains a neural scoring network
using Monte Carlo sampling to compare true and candidate transmitter locations.
We show that in line-of-sight simulations with a mu...
Показано 21 -
30
из 58 записей