BIRD: Bronze Inscription Restoration and Dating
2511.01589v1
cs.CL, I.2.7
2025-11-06
Авторы:
Wenjie Hua, Hoang H. Nguyen, Gangyan Ge
Abstract
Bronze inscriptions from early China are fragmentary and difficult to date.
We introduce BIRD(Bronze Inscription Restoration and Dating), a fully encoded
dataset grounded in standard scholarly transcriptions and chronological labels.
We further propose an allograph-aware masked language modeling framework that
integrates domain- and task-adaptive pretraining with a Glyph Net (GN), which
links graphemes and allographs. Experiments show that GN improves restoration,
while glyph-biased sampling yields gains in dating.
Ссылки и действия
Дополнительные ресурсы: