Estimating Bidirectional Causal Effects with Large Scale Online Kernel Learning
2511.05050v1
stat.ML, cs.LG, stat.ME
2025-11-11
Авторы:
Masahiro Tanaka
Abstract
In this study, a scalable online kernel learning framework is proposed for
estimating bidirectional causal effects in systems characterized by mutual
dependence and heteroskedasticity. Traditional causal inference often focuses
on unidirectional effects, overlooking the common bidirectional relationships
in real-world phenomena. Building on heteroskedasticity-based identification,
the proposed method integrates a quasi-maximum likelihood estimator for
simultaneous equation models with large scale online kernel learning. It
employs random Fourier feature approximations to flexibly model nonlinear
conditional means and variances, while an adaptive online gradient descent
algorithm ensures computational efficiency for streaming and high-dimensional
data. Results from extensive simulations demonstrate that the proposed method
achieves superior accuracy and stability than single equation and polynomial
approximation baselines, exhibiting lower bias and root mean squared error
across various data-generating processes. These results confirm that the
proposed approach effectively captures complex bidirectional causal effects
with near-linear computational scaling. By combining econometric identification
with modern machine learning techniques, the proposed framework offers a
practical, scalable, and theoretically grounded solution for large scale causal
inference in natural/social science, policy making, business, and industrial
applications.