人工智能评价指标体系不仅是全球人工智能创新发展的风向标,更隐含了发起国的人工智能国家意志和国家战略。文章首先对全球范围内22个具有代表性的人工智能评价指数进行梳理,根据评价对象和评价目标构建了一个包含综合实力、发展基础、研发能力、伦理治理和市场应用五个类别的人工智能评价指标体系分类框架;其次剖析了各类评价指标体系在目标设定、指标侧重和方法论上的差异,揭示了其背后隐含的国家战略、政策导向和价值观;最后从评价范式、评价话语权、评价目标、评价时效四个维度系统地分析当前全球人工智能评价指标体系的发展趋势。
Artificial intelligence evaluation index systems not only serve as a weather vane for the innovative development of global AI but also implicitly reflect the national will and strategic interests of their originating countries. This paper begins by reviewing 22 representative AI evaluation indices from around the world. Based on the evaluation subjects and objectives, it constructs a classification framework for AI evaluation index systems, comprising five categories: comprehensive strength, development foundation, R&D capabilities, ethics and governance, and market application. Secondly, the paper provides an analysis of the differences among these various evaluation index systems in terms of their goal-setting, areas of emphasis in their indicators, and methodologies, revealing the underlying national strategies, policy orientations, and values. Finally, it systematically analyzes four major trends in current global AI evaluation index systems from the perspectives of evaluation paradigms, the right to discourse in evaluation, evaluation objectives, and the timeliness of evaluation.