农业图书情报学报

• •    

AI模型元数据规范发展现状与构建研究

姜恩波1,2, 秦瑜1,2   

  1. 1. 中国科学院成都文献情报中心,成都 610041
    2. 中国科学院大学 经济与管理学院,北京 100190
  • 收稿日期:2025-06-22 出版日期:2025-10-29
  • 作者简介:

    姜恩波,正高级工程师,中国科学院成都文献情报中心知识系统部,副主任,研究方向为数字图书馆平台建设

    秦瑜,硕士研究生,中国科学院成都文献情报中心,研究方向为智慧数据与智慧图书馆

  • 基金资助:
    中国科学院文献情报能力建设专项“科技态势感知与分析能力建设”

Development and Construction of Metadata Specifications for AI Models

JIANG Enbo1,2, QIN Yu1,2   

  1. 1. Chengdu Library and Information Center, Chinese Academy of Sciences, Chengdu 610041
    2. School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190
  • Received:2025-06-22 Online:2025-10-29

摘要:

【目的/意义】 本研究旨在回应当前人工智能模型不透明性、难以解释、可追溯性差等问题,提出建立统一的AI模型元数据规范,以提升模型的可发现性、透明度、互操作性和可重用性,进而推动可信任AI的发展。 【方法/过程】 文章以元数据质量评估理论和生命周期理论为基础,采用文献调研法、比较分析、问卷调查等方法,系统梳理和分析国内外已有的AI模型元数据实践,深入调查用户对元数据的认知与需求,并提出面向全生命周期的元数据构建方案。 【结果/结论】 用户认为AI模型元数据规范重要但对现有规范并不了解。现有AI模型元数据规范在元素命名、组织架构、内容细粒度解释等方面存在明显短板,影响模型信息的共享与复用。为此,文章提出了一个元数据框架,涵盖模型、数据、算法、技术特征、性能评估、风险与伦理、法律信息、相关资源等核心实体,并描述其间语义关系。研究认为,建立统一的AI模型元数据框架不仅有助于模型的信息化管理和平台互联互通,也将成为连接技术、伦理与治理的重要基础设施。未来,随着规范体系的不断完善与行业采纳,AI模型将更具可控性与可信赖性,推动技术生态的规范发展与跨界融合。

关键词: AI模型, 模型透明度, 元数据规范, 人工智能

Abstract:

[Purpose/Significance As artificial intelligence (AI) systems are being widely deployed across diverse domains such as education, healthcare, and public governance, the absence of standardized metadata specifications has led to fragmented descriptions, inconsistent documentation, and difficulties in model evaluation and reuse. This study aims to address the pressing issues of opacity, lack of interpretability, and poor traceability in current AI models, which have increasingly become obstacles to the development of transparent and responsible AI. To overcome these challenges, this study proposes the establishment of a unified metadata specification for AI models to enhance their discoverability, transparency, interoperability, and reusability, thereby advancing the development of trustworthy AI and facilitating effective model governance. [Method/Process Grounded in metadata quality assessment theory and lifecycle theory, the study adopted a combination of research methods, including literature review, comparative analysis of existing specifications, and questionnaire surveys.We first conducted a systematic examination of domestic and international practices related to AI model metadata specifications to identify representative standards, frameworks, and implementation approaches. Through comparative analysis, the study investigated the structure, element organization, and semantic relationships of different specifications, highlighting their similarities, differences, and areas for improvement. Meanwhile, a targeted questionnaire survey was administered to researchers, developers, and practitioners to explore user awareness, perceptions, practical experiences, and specific needs regarding metadata specification and interoperability. Based on these findings, the study ultimately proposed a lifecycle-oriented framework for metadata specification construction, ensuring that it aligns with the key stages of AI model development, deployment, evaluation, and governance. [Results/Conclusions The findings reveal that, although users generally recognize the importance of metadata specifications for AI models, they are unaware of of the existing specifications. The current AI model metadata specifications have significant shortcomings in terms of element naming, structural organization, and descriptive granularity. These shortcomings hinder the effective sharing and reuse of model information. In response, the study proposed a comprehensive metadata framework encompassing key entities such as models, datasets, algorithms, technical features, performance evaluations, risks and ethics, legal information, and related resources, as well as the semantic relationships among these entities. The research concluded that establishing a unified metadata specification for AI models not only contributes to effective information management and cross-platform interoperability, but also serves as a critical infrastructure that links technology, ethics, and governance. As the metadata specification system matures and gains wider industry adoption, AI models will become increasingly controllable and trustworthy. This will promote a more regulated, collaborative, sustainable and integrated AI ecosystem.

Key words: AI models, model transparency, metadata specification, AI

中图分类号:  G251

引用本文

姜恩波, 秦瑜. AI模型元数据规范发展现状与构建研究[J/OL]. 农业图书情报学报. https://doi.org/10.13998/j.cnki.issn1002-1248.25-0338.

JIANG Enbo, QIN Yu. Development and Construction of Metadata Specifications for AI Models[J/OL]. Journal of library and information science in agriculture. https://doi.org/10.13998/j.cnki.issn1002-1248.25-0338.