Unlocking the Power of Boltzmann Machines by Parallelizable Sampler and Efficient Temperature Estimation

2512.02323v1 cs.LG, quant-ph, stat.ML 2025-12-04
Авторы:

Kentaro Kubo, Hayato Goto

Abstract

Boltzmann machines (BMs) are powerful energy-based generative models, but their heavy training cost has largely confined practical use to Restricted BMs (RBMs) trained with an efficient learning method called contrastive divergence. More accurate learning typically requires Markov chain Monte Carlo (MCMC) Boltzmann sampling, but it is time-consuming due to the difficulty of parallelization for more expressive models. To address this limitation, we first propose a new Boltzmann sampler inspired by a quantum-inspired combinatorial optimization called simulated bifurcation (SB). This SB-inspired approach, which we name Langevin SB (LSB), enables parallelized sampling while maintaining accuracy comparable to MCMC. Furthermore, this is applicable not only to RBMs but also to BMs with general couplings. However, LSB cannot control the inverse temperature of the output Boltzmann distribution, which hinders learning and degrades performance. To overcome this limitation, we also developed an efficient method for estimating the inverse temperature during the learning process, which we call conditional expectation matching (CEM). By combining LSB and CEM, we establish an efficient learning framework for BMs with greater expressive power than RBMs. We refer to this framework as sampler-adaptive learning (SAL). SAL opens new avenues for energy-based generative modeling beyond RBMs.

Ссылки и действия

Связанные статьи

Investigation of D-Wave quantum annealing for training Restricted Boltzmann Mach...

## Контекст Область исследования сосредоточена на исследовании возможностей использования квантовых аннелинг-машин D-Wa...

2025-08-23

Comparison of D-Wave Quantum Annealing and Markov Chain Monte Carlo for Sampling...

## Контекст Область исследования связана с применением квантовых вычислений для решения задач семплирования с помощью Re...

2025-08-18

Comparison of D-Wave Quantum Annealing and Markov Chain Monte Carlo for Sampling...

## Контекст Сети Больцмана — дискретные статистические модели, применяемые в машинном обучении, визуальном распознавани...

2025-08-16