专题:中国式现代化进程中的古籍知识组织

古籍文献知识组织由静态检索向动态表征趋向的实证检验*

展开
  • 1.成都航空职业技术学院   四川成都   610100
    2.西北师范大学商学院   甘肃兰州   730070
于蓓莉,女,成都航空职业技术学院馆员,研究方向:信息素养与信息技术;刘蕾,女,西北师范大学商学院硕士研究生;周文杰,男,西北师范大学商学院教授,研究方向:信息社会问题。

收稿日期: 2022-10-20

  网络出版日期: 2023-01-20

基金资助

*本文系国家自然科学基金项目面上项目“信息致贫的微观机理与信息减贫的宏观制度关联研究”(项目编号:71874141)研究成果之一。

An Empirical Study on the Knowledge Organization Tendency of Ancient Literatures from Static Retrieval to Dynamic Representation

Expand

Received date: 2022-10-20

  Online published: 2023-01-20

摘要

我国古籍文献数字化进展迅速,促进了中华传统文化的挖掘与传播。在数字化时代下,古籍文献整理应由静态检索升级到动态表征,利用新型技术提高古籍文献服务效率,对古籍文献序化整理的转型发展具有重要的意义。文章以“世界3”作为理论基础,利用jiayan(甲言)库和隐马尔可夫模型,对《史记》进行分词和词频统计,再对部分高频词的全文分布进行可视化分析,并考虑周边关联词的搭配,为读者提供更加全面的语境信息。文章应用自然语言处理方法对古籍文献做出探索性处理,以望为古籍文献知识组织工作提供可能的参考。

本文引用格式

于蓓莉 刘 蕾 周文杰 . 古籍文献知识组织由静态检索向动态表征趋向的实证检验*[J]. 图书与情报, 2022 , 42(05) : 17 -23 . DOI: 10.11968/tsyqb.1003-6938.2022065

Abstract

The rapid progress of digitalization of ancient literatures in China has promoted the excavation and dissemination of traditional Chinese culture dynamically. In the digital era, the collation of ancient literatures should be upgraded from static retrieval to dynamic characterization, and the use of new technologies to improve the service efficiency of ancient literatures is of great significance to the transformation and development of ancient literatures' classification. Based on the theory of "World 3", this research conduct a analytical procedure of word segmentation and word frequency on Shi Ji (Historical Records) via Jiayan and Hidden Markov Model and then makes a visual analysis of the full text distribution of some high-frequency words, taking into account the collocation of surrounding related words, so as to provide readers with more comprehensive linguistic information. The procedures of Natural Language Processing conduct by this research on ancient literatures is expected to provide a possible reference for the knowledge organization of ancient books.
文章导航

/