农业图书情报学报 ›› 2023, Vol. 35 ›› Issue (3): 52-70.doi: 10.13998/j.cnki.issn1002-1248.23-0214

• 研究论文 • 上一篇    下一篇

基于自适应特征选择的科研论文跨学科性测度方法研究

王晋飞1, 孙巍1,2,*, 张学福1,2, 杨璐1   

  1. 1.中国农业科学院农业信息研究所,北京 100081;
    2.农业农村部 农业大数据重点实验室,北京 100081
  • 收稿日期:2023-02-04 出版日期:2023-03-05 发布日期:2023-05-31
  • 通讯作者: *孙巍(1973- ),女,博士,研究员,博士生导师,研究方向为信息检索可视化、数据挖掘、知识组织。Email:sunwei@caas.cn
  • 作者简介:王晋飞(1999- ),女,硕士,研究方向为信息管理与信息系统。张学福(1966- ),男,博士,研究员,博士生导师,研究方向为信息资源管理。杨璐(1988- ),男,博士,研究方向为数据分析与模型计算。
  • 基金资助:
    NSTL文献专项任务基金项目“下一代开放知识服务平台关键技术优化集成与系统研发”(2022XM28)子课题“基于专利的领域技术分析与预测系统集成”; 中国农业科学院基本科研业务费专项课题“农业研究热点前沿探测及领域技术布局分析研究”(Y2022ZK07)

Interdisciplinarity Measurement Method of Scientific Research Papers based on Adaptive Feature Selection

WANG Jinfei1, SUN Wei1,2,*, ZHANG Xuefu1,2, YANG Lu1   

  1. 1. Institute of Agricultural Information, Chinese Academy of Agricultural Sciences, Beijing 100081;
    2. Key Laboratory of Agricultural Big Data, Ministry of Agriculture and Rural Affairs, Beijing 100081
  • Received:2023-02-04 Online:2023-03-05 Published:2023-05-31

摘要: [目的/意义]跨学科研究能够通过知识整合和渗透,创造性地解决自然环境和人类社会中的复杂问题。随着跨学科研究成果的大量增长,跨学科性测度评估变得越来越有必要,如何构建有效的跨学科性测度方法,实现对论文跨学科性综合全面的测度是亟待解决的问题。[方法/过程]本研究首先基于跨学科研究的内涵和特点,从学科属性、知识网络拓扑结构和知识整合文本内容3个维度提取科研论文跨学科性特征指标,并给出特征指标的计算方法;其次,对跨学科性特征指标进行自适应计算,构建一种基于机器学习的跨学科性测度方法;最后,以植物纳米生物技术领域为例进行实证研究,对领域中高跨学科性的论文进行识别和筛选。[结果/结论]本文提出的自适应特征选择能够对跨学科性相关特征指标进行有效筛选,提升结果的可靠性,实现全面、深入的科研论文跨学科性测度。该测度方法避免了定性评估可能会出现的主观性缺陷以及不同测度指标可能出现相互矛盾结果的问题,为跨学科性测度提供新的思路与方向。

关键词: 跨学科性, 自适应特征选择, 论文测度

Abstract: [Purpose/Significance] Interdisciplinary research can creatively solve complex problems in natural environment and human society through knowledge integration and penetration. With the increase of interdisciplinary research results, the evaluation of interdisciplinarity becomes increasingly necessary. How to establish an effective method for interdisciplinarity measurement and achieve a comprehensive measurement of scientific research papers is an urgent problem to be solved. [Method/Process] Based on the above background, this study takes the data of scientific research papers as the analysis source, deconstructs the interdisciplinarity of scientific research papers from multiple dimensions, constructs the feature set of interdisciplinarity of scientific research papers, and on this basis proposes the method for measuring interdisciplinarity based on the adaptive method of machine learning, and conducts a comprehensive measurement of interdisciplinarity. This study has certain positive significance for researchers to understand the interdisciplinary papers in the field. The work process is as follows: First of all, the basic concepts of interdisciplinarity are sorted out and related concepts are discriminated, and the index of interdisciplinarity of different dimensions is analyzed. Based on the connotation and characteristics of interdisciplinary research, the characteristic index of interdisciplinarity of scientific research papers is extracted from three dimensions: subject attribute, knowledge network topology and knowledge integration text content. Secondly, an interdisciplinarity measurement method based on machine learning is constructed. By analyzing information gain and feature similarity of input indexes and data in feature sets, a feature selection calculation method based on adaptive feature selection is proposed, and the accuracy of feature classification is maximized by machine learning classifier. At the same time, the feature subset that can best express the interdisciplinary is selected based on the adaptive selection of the minimum number of features, and the selected adaptive feature set is used in the calculation of the interdisciplinary of the paper, and the results of the calculation of the original feature set are analyzed comprehensively. Finally, an empirical study was carried out in the field of plant nanobiotechnology to verify the effectiveness of the index system and adaptive feature selection listed above, identify and screen papers with high interdisciplinarity in the field, measure the interdisciplinarity of papers and identify key influencing factors based on the calculation of subject attributes, knowledge network topology and knowledge integration text content features. [Results/Conclusions] The main empirical results show that, among the subject attributes, the balance degree and the difference degree have a greater effect on the interdisciplinary evaluation. The overall effect of knowledge network topology structure features is satisfactory, the distribution breadth of knowledge integration text content features has a greater effect on interdisciplinary evaluation, and the calculation effect is further improved by fitness weighted summation of each feature. The results demonstrate that the adaptive feature selection proposed in this paper can effectively screen the interdisciplinary related feature indexes, improve the reliability of the results, and achieve a comprehensive and in-depth measurement of the interdisciplinary of scientific research papers. This measure method avoids the subjective defects that may occur in qualitative evaluation and the problems that different measure indicators may produce contradictory results. It provides a new idea and direction for interdisciplinary measurement.

Key words: interdisciplinarity, adaptive feature selection, paper measurement

中图分类号: 

  • TP391.1

引用本文

王晋飞, 孙巍, 张学福, 杨璐. 基于自适应特征选择的科研论文跨学科性测度方法研究[J]. 农业图书情报学报, 2023, 35(3): 52-70.

WANG Jinfei, SUN Wei, ZHANG Xuefu, YANG Lu. Interdisciplinarity Measurement Method of Scientific Research Papers based on Adaptive Feature Selection[J]. Journal of Library and Information Science in Agriculture, 2023, 35(3): 52-70.