人工智能和数字经济是当下时代的核心特征。文章探讨数智时代图书馆数据馆藏的内涵特征、逻辑构成及构建路径,提出在AI技术发展背景下,图书馆需要从文献管理转向数据服务。数据馆藏作为面向智能体的知识数据集组合,具有面向机器、关联互通、场景驱动、粒度多元等特征,其逻辑构成涵盖图书馆业务类数据集(如文献语料类数据集、图书馆用户类数据集、图书馆业务活动类数据集)及面向行业应用的数据集。为满足智慧图书馆的服务需求,提出数据馆藏规划与数据集构建方法,强调利用AI技术进行数据架构设计与加工处理,充分释放馆藏资源的数据价值。
Artificial intelligence and the digital economy are core characteristics of our era. This paper explores the connotative characteristics, logical structure and construction path of library data collections in the digital and intelligent era and proposes that libraries need to shift from document management to data services against the backdrop of the development of AI technology. Data collection is the knowledge data set combination for intelligent agents of different granularities formed by the digitization of literature, characterized by machine-oriented, interconnected, scenario-driven and multi-granular. The logical composition of the data collection includes document corpus data sets, library user datasets, library business activity data sets, and application datasets. The construction path of the data collection system should include system architecture and content architecture. Among them, data architecture, data collection evaluation, and data value co-creation constitute key issues that constrain the development of data collection resource construction’s satisfy the service requirements of smart libraries, methods for data collection planning and dataset construction are proposed, with an emphasis on leveraging AI technology for data architecture design and processing to fully unleash the data value of library resources.