BIRD: Bronze Inscription Restoration and Dating

2511.01589v1 cs.CL, I.2.7 2025-11-06
Авторы:

Wenjie Hua, Hoang H. Nguyen, Gangyan Ge

Abstract

Bronze inscriptions from early China are fragmentary and difficult to date. We introduce BIRD(Bronze Inscription Restoration and Dating), a fully encoded dataset grounded in standard scholarly transcriptions and chronological labels. We further propose an allograph-aware masked language modeling framework that integrates domain- and task-adaptive pretraining with a Glyph Net (GN), which links graphemes and allographs. Experiments show that GN improves restoration, while glyph-biased sampling yields gains in dating.

Ссылки и действия