GTransFusion：基于Transformer的多模态表示学习与图结构对齐的融合方法

打印
收藏

收藏成功

微博 QQ空间微信

打开文本图片集

中图分类号：TP242.6 文献标识码：A 文章编号：2096-4706（2026）04-0049-07

GTransFusion： Fusion Method of Multimodal Representation Learning and Graph Structure Alignment Based on Transformer

ZHANG Xian， PANG Hui， LIU Jiajun （SchoolofInformationEngineering，Hebei UniversityofArchitecture，Zhangjiakou O75ooo，China）

Abstract： With the emergence of multi-source medical data such as high-throughput genome sequencing and highresolution digitalpathologicalimages，multimodal biological modeling becomes the keytoartificialintellgence-asssted pathological dagnosis.Thisstudyproposesanewultimodalepresentationleaingmethod，GransFusion，tojointlyalye pathological Whole Slide Imagesandomicsdata，soasto improve thediagnosticaccuracyofvariouscancers.Thismethodmaps diferentmodaldata intoaunifiedsequencerepresentation througha Transformer-based jointrepresentationlearningmodule， explicitlymodels modal typeencoding intheprocess，andrealizesdynamicmodalweighting byvirtueoftheself-attention mechanism.Meanwhile，thismethodconstructsacross-modal featurealignmentgraphstructure，utilizesaGraphNeuralNetwork tocapture inter-modalassociationandcommoninformation，andfeedsbacktotheTransfomerrepresentationlearingtoealize cross-modalfeature alignmentandrelationshipmodeling.Experimentsonmultipletumordatasetsshowthattheproposed method is significantlysuperir tocomparisonmethodsinsurvivalpredictionperformanceindicators，whichverifes theefectivnesof multimodal joint representation and graph structure alignment.

Keywords： multimodal fusion; Transformer; heterogeneous graph; joint representation learning

0 引言

病理学是现代医学的基石，在癌症诊断和治疗规划中发挥着重要作用。（剩余10086字）

试读结束

购买全文6.00元下一篇基于Y0L0v8n的轻量化森林火灾烟雾检测算法研究

现代信息科技

2026年04期

¥18.00/本