MARIA: A Framework for Marginal Risk Assessment without Ground Truth in AI Systems
2510.27163v1
cs.SE, cs.AI, cs.HC, D.2.8; D.2.9.m; I.2
2025-11-04
Авторы:
Jieshan Chen, Suyu Ma, Qinghua Lu, Sung Une Lee, Liming Zhu
Abstract
Before deploying an AI system to replace an existing process, it must be
compared with the incumbent to ensure improvement without added risk.
Traditional evaluation relies on ground truth for both systems, but this is
often unavailable due to delayed or unknowable outcomes, high costs, or
incomplete data, especially for long-standing systems deemed safe by
convention. The more practical solution is not to compute absolute risk but the
difference between systems. We therefore propose a marginal risk assessment
framework, that avoids dependence on ground truth or absolute risk. It
emphasizes three kinds of relative evaluation methodology, including
predictability, capability and interaction dominance. By shifting focus from
absolute to relative evaluation, our approach equips software teams with
actionable guidance: identifying where AI enhances outcomes, where it
introduces new risks, and how to adopt such systems responsibly.