专题:人工智能评价体系研究

全球人工智能评价指标体系结构解构与评价范式研究*

展开
  • (1.扬州大学商学院   江苏扬州   225127)
    (2.北京大学信息管理系   北京   100871)
李翔,男,扬州大学商学院硕士研究生;李广建,男,北京大学信息管理系教授,博士生导师;罗立群,男,扬州大学商学院教授。

收稿日期: 2025-08-08

  网络出版日期: 2025-09-08

基金资助

*本文系国家社会科学基金重大项目“数智转型背景下智能情报关键技术应用研究”(项目编号:23ZD228)与江苏省社会科学基金重点项目“江苏省重点产业关键技术‘卡脖子’问题情报跟踪研究”(项目编号:22TQA001)研究成果之一。

Structural Deconstruction and Paradigm Analysis of Global AI Evaluation Indicator Systems

Expand

Received date: 2025-08-08

  Online published: 2025-09-08

摘要

全球人工智能评价体系存在“测量黑箱”与“战略误判”挑战。为系统性地解构其内在结构,文章整合了22个主流人工智能评价指数(637项原始指标),构建了包含307项可比指标的标准化库,并采用双维度分析(注意力分布与复杂网络建模)揭示其内在运行逻辑。研究首次识别并命名了该领域主导的国家能力本位的规模化增长评价范式。该范式以国家能力为评价核心,由“政策-技术”双引擎驱动,在衡量标准上存在显著的规模导向偏好,并因此在创新质量、成果转化等维度存在结构性盲区。文章通过提供这一范式的结构蓝图,为规避战略误判、设计更均衡的下一代评价框架提供了关键的实证路径。

本文引用格式

李 翔 李广建 罗立群 . 全球人工智能评价指标体系结构解构与评价范式研究*[J]. 图书与情报, 2025 , 45(04) : 15 -24 . DOI: 10.11968/tsyqb.1003-6938.2025042

Abstract

The global artificial intelligence (AI) evaluation system faces challenges of "measurement black boxes" and "strategic misjudgments." To systematically deconstruct its internal structure, this study integrates 22 mainstream AI evaluation indexes (637 original indicators), constructs a standardized library of 307 comparable indicators, and uses a dual-dimensional analysis (attention distribution and complex network modeling) to reveal its internal operational logic. The study, for the first time, identifies and names the dominant "national capacity-based, scale-oriented growth" evaluation paradigm in the field. This paradigm takes national capacity as its evaluation core, is driven by a "policy-technology" dual engine, exhibits a significant scale-oriented bias in its measurement standards, and consequently has structural blind spots in dimensions such as innovation quality and conversion efficiency. By providing a structural blueprint of this paradigm, this study offers a critical empirical pathway for avoiding strategic misjudgments and designing more balanced next-generation evaluation frameworks.
文章导航

/