档案文献遗产高质量数据集建设路径研究

  • 打印
  • 收藏
收藏成功


打开文本图片集

Abstract: High- quality datasets have becomea critical strategic resource fornational tech nological competition and the construction ofcultural soft power. Archival documentary heritage, asascarceresourceencompassing informational value,identitysignificance,and data asset attributes,holds great importance in enhancingthe performance of large languagemodelsand safeguarding cultural sovereignty.Based on the multifaceted characteristics of archival documentary heritage value and guided by the "three highs" principle-high- value application, high knowledge density,and high technological content— thisstudy constructs a nine- dimensional demand matrix tailored for large language model training and knowledge services. It systematically analyzes current challenges in data resource development,data knowledge transformation,and technologicalempowerment.Furthermore,a "three- step"implementation pathway centered on"system planning-engineering construction— qualityinspection"isproposed.Thisapproach aimsto facilitate the transformation of archival documentary heritage from fragmented resourcesinto high-quality,circulatable,and trustworthy data assets,providing a theoretical framework andpractical reference for constructing highquality datasets that support the national cultural digitalization strategy and the development of artificial intelligence.

Keywords:High-quality dataset;Archival documentary heritage;Dataelement;National cultural digitalization;Al

随着人工智能发展进入“数据驱动"的新阶段,高质量数据集越来越成为提升国家核心竞争力和维护国家安全的关键性生产要素和基础性战略资源。(剩余16201字)

monitor
客服机器人