基于注意力机制及多特征融合的中医文本拼写纠错方法

打开文本图片集
中图分类号:TP391.1 文献标识码:A文章编号:2096-4706(2026)03-0076-06
Abstract: The spelling correction method for Traditional Chinese Medicine (TCM) texts aims to identifyand correct speling errors inunstructured TCMtexts.High-qualityTCMtexts facilitate theappicationandresearchofdownstreamtasks. Traditional spellngcoection modelslack theabilitytodeeplyfuseandextract textfeatures,failig tofullyexploreand understandcontextual information.Toaddresstheabove issues,aspellingcorrctionmodelbasedoAtentionMechanismand mult-featurefusion,amedTCM-CCisproposedtoanetheorectiocaabilityCM-CCistrainedontheCieBert pre-traindmodel,andasylableiformationcasiferwithamulti-eadAtentionMechanismisaddedontopofitsaracter pinyin,andglyphembeddinglayers.Thiscasifierextractssyllblefeatureinformationandintegatesitintothecoretion processasanuilaysigaleablngittobetercapture teasociationbetwesylblesandcaactes.Expeimetalsults show that TCM-CSC precision and F1 score reached 77.3% and 81.99% ,respectively, in the spelling correction task of TCM texts, significantly improving the model's correction performance.
Keywords: Traditional Chinese Medicine text; spelling correction; Attention Mechanism; multi-feature fusion
0 引言
中医文本作为中华优秀传统文化的重要载体也是医学智慧的结晶,其承载着数千年的中医文化及实践经验,在历史传承、现代中医学应用、临床实践指导、国际传播以及数字化保护等方面都具有不可替代的重要价值。(剩余10864字)