基于DRSN-Conformer的电力调度语音识别

  • 打印
  • 收藏
收藏成功


打开文本图片集

中图分类号:TN912.3-34 文献标识码:A 文章编号:1004-373X(2026)06-0112-08

Power dispatching speech recognition based on DRSN-Conformer

HAN Yaxu1,GAO Lu1, ZHANGFei²,QIN Ling1 ,WANG Yongping1, ZHANG Xiaolin1 o 2.SchloftomatinandEletricalEineeingrogoliaUvesityfieneandholootou4)

Abstract:Inallusion totheproblemsof Transformer network-basedspeech recognitionmodels,such asinsuficientability toextractaudiofeaturesinpowerdispatchingnoisescenarios,lowaccuracyinrecognizingprofesional terms,andpoor robustness in powerdispatch noise scenarios,a powerdispatching scenariospeech recognition methodcombining deepresidual shrinkagenetwork(DRSN)andConformer network isproposed.InDRSN,twotypesofcontractionmodules,namelychanel thresholdsharingmoduleandindependentshrinkagemodule,aredesignedtolearnthenoisethresholdsofaudiosignals,thereby reducinginterferencecausedbynoise.TheaudiosignaloutputbytheDRSNblock isencoded intoaudiofeaturesbymeansof Conformerblock,andthecross-attentionisusedtofusethetextfeaturesencodedbythetextencodertoobtaintherelevance representationofaudioandtext.Themodelistrainedanddecodedbycombining withtheCTClossandAtentionloss.The experimentalresultsonthepublicdatasetsAshell-1,Thchs3O,andaninteralpowerdispatchdatasetofapowergridcompany inInerMongoliademonstratethat,incomparisonwiththeTransformermodel,thereisan8.5%decreaseintheultimate charactererrorrate,and5.2%decreaseinthecharactererorateofspecialtermrecognition,validatingtheefectivenessand advantages of the proposed method in solving the power dispatch speech recognition tasks.

Keywords: speech recognition; DRSN; Conformer; power dispatching; attention mechanism; BiLSTM

0 引言

随着各行各业电力需求的日益增加,电力系统调度工作愈发繁重。(剩余13331字)

monitor
客服机器人