中文    English

Journal of Library and Information Science in Agriculture ›› 2023, Vol. 35 ›› Issue (4): 19-31.doi: 10.13998/j.cnki.issn1002-1248.23-0293

Previous Articles     Next Articles

Comparative Study and Optimization Strategies of Knowledge Graph Construction Management Systems

MA Weilu1, XIAN Guojian1,2, ZHAO Ruixue1,3, LI Jiao1,3, HUANG Yongwen1,3, SUN Tan2,4, *   

  1. 1. Agricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081;
    2. Key Laboratory of Agricultural Big Data, Ministry of Agriculture and Rural Affairs, Beijing 100081;
    3. Key Laboratory of Knowledge Mining and Knowledge Services in Agricultural Converging Publishing, National Press and Publication Administration, Beijing 100081;
    4. Chinese Academy of Agricultural Sciences, Beijing 100081
  • Received:2023-03-11 Online:2023-04-05 Published:2023-07-12

Abstract: [Purpose/Significance] Knowledge Graph has become a major research hotspot in the era of artificial intelligence due to its ability to provide a new means of organization and representation of knowledge. As the field continues to evolve, numerous scholars have proposed advanced algorithms and technologies for each core stage of constructing a knowledge graph, and many large domestic and foreign enterprises have also developed their independent knowledge graph management systems. However, the majority of these graph tools developed are designed for commercial use and are often too expensive and difficult to deploy locally for small and medium-sized research teams. This presents a challenge for information organizations such as research libraries with massive resources, which require a more adaptable, universal, and efficient tool to build and manage knowledge graphs. To meet this need, it is important to develop an open-source, user-friendly, and customizable knowledge graph management system that can be easily deployed by small and medium-sized research teams. [Method/Process] In summary, this article offers a thorough and informative analysis of six mainstream knowledge graph management systems, both domestically and internationally. It delves into the unique characteristics of each system within the business process and provides an in-depth comparative analysis based on several important factors, including system functionality, technology selection, open-source availability, and application domains. The article refers to the standard construction process of knowledge graphs and highlights the platform characteristics of each system during the construction process while also examining their limitations based on current data characteristics. In response to practical needs, the article focuses on multi-path, multi-engine, distributed, and collaborative construction, integrating advanced graph algorithms and considering a well-developed underlying graph storage strategy. [Results/Conclusions] As a result,the article presents an in-depth analysis of the construction model for a collaborative development and management system of an integrated knowledge graph. It not only investigates the current state of knowledge graph management systems but also proposes novel optimization ideas. These ideas include distributed collaborative construction, which allows for simultaneous contributions from multiple sources, and parallel management of multiple graphs, enabling efficient organization and retrieval. Additionally, some suggestions are put forward: developing multi-path knowledge extraction techniques to enhance the knowledge acquisition process, and using specialized multi-graph storage engines for optimized storage and retrieval. Last, the article emphasizes the importance of incorporating cross-media and multimodal knowledge into the graph for a comprehensive representation of information.

Key words: knowledge graph, collaborative parallelism, multi-path extraction, multi-graph engine, management system

CLC Number: 

  • G251
[1] 王萌, 王昊奋, 李博涵, 等. 新一代知识图谱关键技术综述[J]. 计算机研究与发展, 2022, 59(9): 1947-1965.
WANG M, WANG H F, LI B H, et al.Survey on key technologies of new generation knowledge graph[J]. Journal of computer research and development, 2022, 59(9): 1947-1965.
[2] BIZER C, HEATH T, BERNERS-LEE T, et al.Linked data: The story so far[M]. USA: IGI global, 2011.
[3] 王颖. 科技文献内容语义描述模型研究[J]. 农业图书情报学报, 2020, 32(8): 12-24.
WANG Y.Semantic Models for the Content of Scientific Literature[J]. Journal of library and information science in agriculture, 2020, 32(8): 12-24.
[4] Official google blog: Introducing the knowledge graph: things, not strings[EB/OL]. [2023-06-08].https://blog.google/products/search/introducing-knowledge-graph-things-not/.
[5] CiteSpace[EB/OL].[2023-05-30].https://sourceforge.net/projects/citespace/.
[6] 王东浩. 基于科学知识图谱的图书情报学科可视化比较研究——评《中外图书情报学科知识图谱比较研究》[J]. 中国科技论文, 2022, 17(10): 1180.
WANG D H.Comparative study on visualization of library and information science based on scientific knowledge map - Comment on comparative study of knowledge map of library and information science between China and foreign countries[J]. China sciencepaper, 2022, 17(10): 1180.
[7] 唐亮, 罗轩, 王颖. 两类知识图谱差异辨析及其在科技出版中的应用[J]. 出版参考, 2019(1): 66-68.
TANG L, LUO X, WANG Y.Differentiation and analysis of the differences between two types of knowledge maps and their application in sci-tech publishing[J]. Publishing reference, 2019(1): 66-68.
[8] 杨云飞, 穗志方. 面向医学知识图谱的可视化方法设计与实现[J]. 中文信息学报, 2022, 36(2): 40-48.
YANG Y F, SUI Z F.Design and implementation of visualization for medical knowledge graph[J]. Journal of Chinese information processing, 2022, 36(2): 40-48.
[9] 张海涛, 栾宇, 周红磊, 等. 总体国家安全观下重大突发事件的智能决策情报体系研究[J]. 情报学报, 2022, 41(11): 1174-1187.
ZHANG H T, LUAN Y, ZHOU H L, et al.Intelligent decision-making information system for major emergencies: A holistic approach to national security[J]. Journal of the China society for scientific and technical information, 2022, 41(11): 1174-1187.
[10] 马玉凤, 向南, 豆亚杰, 等. 军事系统工程中的知识图谱应用及研究[J]. 系统工程与电子技术, 2022, 44(1): 146-153.
MA Y F, XIANG N, DOU Y J, et al.Application and research of knowledge graph in military system engineering[J]. Systems engineering and electronics, 2022, 44(1): 146-153.
[11] 于升峰. 面向科技智库的知识图谱系统构建[J]. 智库理论与实践, 2021, 6(1): 56-64.
YU S F.Construction of mapping knowledge system for science and technology think tanks[J]. Think tank: Theory & practice, 2021, 6
12 (1): 56-64.
[12] 唐玫, 赵婉忻, 李晶, 等. 面向科学数据管理的机构知识库建设与启示[J]. 数字图书馆论坛, 2022(10): 65-72.
TANG M, ZHAO W X, LI J, et al.Development situation and enlightenment of the institutional repositories for research data management[J]. Digital library forum, 2022(10): 65-72.
[13] 021-2020 C T. 知识图谱构建平台认证技术规范[S]. 北京: 中国电子技术标准化研究院赛西实验室, 2020.
021-2020 C T. Technical specification for certification of knowledge graph construction platforms[S]. Beijing: Chinese Electronic Standardization Institute, CESI Laboratory, 2020.
[14] BALLARDINI R M, HE K, ROOS T.AI-generated content: Authorship and inventorship in the age of artificial intelligence[M]// Online distribution of content in the EU. England: Edward elgar publishing LTD, 2019.
[15] 2022人工智能分类排行[EB/OL]. [2023-03-14].https://baijiahao.baidu.com/s?id=1739952164626665131&wfr=spider&for=pc.
2022 AI classification ranking[EB/OL]. [2023-03-14].https://baijiahao.baidu.com/s?id=1739952164626665131&wfr=spider&for=pc.
[16] PoolParty[EB/OL]. [2023-03-14].https://www.poolparty.biz/.
[17] Hume - mission-critical graph analytics[EB/OL].[2023-03-14].https://graphaware.com/products/hume/.
[18] Neo4j[EB/OL]. [2023-05-30].https://neo4j.com/.
[19] RelationalAI: Build data apps with intelligence[EB/OL]. [2023-03-14].https://relational.ai/.
[20] 华为云知识图谱服务[EB/OL].[2023-03-14].https://support.huaweicloud.com/kg/index.html.
Huawei cloud knowledge graph service[EB/OL].[2023-03-14].https://support.huaweicloud.com/kg/index.html.
[21] 对象存储服务OBS[EB/OL].[2023-05-30].https://www.huaweicloud.com/product/obs.html.
Object storage service(OBS)[EB/OL].[2023-05-30].https://www.huaweicloud.com/product/obs.html.
[22] 百度AI开放平台[EB/OL]. [2023-03-14].https://ai.baidu.com/solution/kgaas.
Baidu AI open platform[EB/OL].[2023-03-14].https://ai.baidu.com/solution/kgaas.
[23] 百度知识图谱开放平台[EB/OL].[2023-05-30].https://kgopen.baidu.com/index.
Baidu knowledge graph open platform[EB/OL].[2023-05-30].https://kgopen.baidu.com/index.
[24] EasyData智能数据服务平台[EB/OL].[2023-05-30].https://ai.baidu.com/easydata/.
EasyData intelligent data service platform[EB/OL]. [2023-05-30].https://ai.baidu.com/easydata/.
[25] 北京大学知识图谱自动化构建平台gBuilder[EB/OL]. [2023-03-14]. http://openkg.cn/tool/gbuilder.
gBuilder: Beijing university knowledge graph automation construction platform[EB/OL]. [2023-03-14]. http://openkg.cn/tool/gbuilder.
[26] 北京大学知识库问答系统gAnswer[EB/OL]. [2023-05-30].https://answer.gstore.cn/pc/index.html.
gAnswer: Peking university knowledge base question-answering system[EB/OL]. [2023-05-30].https://answer.gstore.cn/pc/index.html.
[27] 北京大学原生图数据库系统gStore[EB/OL]. [2023-05-30].https://www.gstore.cn/pcsite/index.html#/.
gStore: Peking university native graph database system[EB/OL]. [2023-05-30].https://www.gstore.cn/pcsite/index.html#/.
[28] KHADIR A C, ALIANE H, GUESSOUM A.Ontology learning: Grand tour and challenges[J]. Computer science review, 2021, 39: 100339.
[29] 邓依依, 邬昌兴, 魏永丰, 等. 基于深度学习的命名实体识别综述[J]. 中文信息学报, 2021, 35(9): 30-45.
DENG Y Y, WU C X, WEI Y F, et al.A survey on named entity recognition based on deep learning[J]. Journal of Chinese information processing, 2021, 35(9): 30-45.
[30] SMIRNOV A, LEVASHOVA T.Knowledge fusion patterns: A survey[J]. Information fusion, 2019, 52: 31-40.
[31] CHEN X J, JIA S B, XIANG Y.A review: Knowledge reasoning over knowledge graph[J]. Expert systems with applications, 2020, 141: 112948.
[32] HOGAN A, BLOMQVIST E, COCHEZ M, et al.Knowledge graphs[J]. ACM computing surveys, 2021, 54(4): 71.
[33] MySQL[EB/OL]. [2023-05-30].https://www.mysql.com/.
[34] Oracle[EB/OL]. [2023-05-30].https://www.oracle.com/.
[35] Microsoft SQL server[EB/OL]. [2023-05-30].https://www.microsoft.com/zh-cn/sql-server/sql-server-downloads.
[36] PostgreSQL[EB/OL]. [2023-05-30].https://www.postgresql.org/.
[37] GraphDB[EB/OL]. [2023-05-30]. http://www.graphdb.net/.
[38] RDF4J[EB/OL]. [2023-05-30].https://rdf4j.org/.
[39] BGraph[EB/OL]. [2023-05-30].https://ai.baidu.com/tech/kg/bgraph.
[40] 王传庆, 李阳阳, 费超群, 等. 知识图谱平台综述[J]. 计算机应用研究, 2022, 39(11): 3201-3210.
WANG C Q, LI Y Y, FEI C Q, et al.Survey of knowledge graph platform[J]. Application research of computers, 2022, 39(11): 3201-3210.
[41] LI M L, NI Z H, TIAN L, et al.Research on hierarchical knowledge graphs of data, information, and knowledge based on multiple data sources[J]. Applied sciences, 2023, 13(8): 4783.
[42] 王鑫, 邹磊, 王朝坤, 等. 知识图谱数据管理研究综述[J]. 软件学报, 2019, 30(7): 2139-2174.
WANG X, ZOU L, WANG C K, et al.Research on knowledge graph data management: A survey[J]. Journal of software, 2019, 30(7): 2139-2174.
[43] CHEN Z K, ZHAO Y.The technology of military knowledge graph construction based on multiple open data sources[C]// 2020 5th international conference on mechanical, control and computer engineering(ICMCCE). Piscataway, New Jersey: IEEE, 2021: 1993-1997.
[44] GAO J, LI P, CHEN Z K, et al.A survey on deep learning for multimodal data fusion[J]. Neural computation, 2020, 32(5): 829-864.
[45] 刘宝珠, 王鑫, 柳鹏凯, 等. KGDB: 统一模型和语言的知识图谱数据库管理系统[J]. 软件学报, 2021, 32(3): 781-804.
LIU B Z, WANG X, LIU P K, et al.KGDB: Knowledge graph database system with unified model and query language[J]. Journal of software, 2021, 32(3): 781-804.
[46] Virtuoso[EB/OL]. [2023-05-30].https://sourceforge.net/projects/virtuoso/.
[47] NebulaGraph[EB/OL]. [2023-05-30].https://www.nebula-graph.io/.
[1] CHANG ZhiJun, XU LiYuan, YU QianQian, ZHANG JianYong, WANG YongJi. Scientific and Technical Literature Data Management System Based on Life Cycle Model [J]. Journal of Library and Information Science in Agriculture, 2022, 34(6): 36-49.
[2] YANG Xuehong, HE Bin, WANG Luyan. Circulation Services Based on an Intelligent Library Management System: A Case Study in the CAU Library [J]. Journal of Library and Information Science in Agriculture, 2021, 33(7): 92-99.
[3] YANG Siluo, TIAN Peilin, ZHU Chuanyu, QIU Junping. Characteristics of UNESCO's Humanities and Social Sciences Research: Topic, Evolution and Cooperation [J]. Journal of Library and Information Science in Agriculture, 2021, 33(6): 6-17.
[4] XU Yongle, CHEN Yuanyuan, YANG Tingting, WAN Xiangli. Comparative Analysis of the Research on the Influence of Chinese and International Think Tanks [J]. Journal of Library and Information Science in Agriculture, 2021, 33(11): 50-62.
[5] LYU Lucheng, HAN Tao. Artificial Intelligence Empowers Library and Information Service ——Review of Forums about Information Technology for Library 2019 [J]. Journal of Library and Information Science in Agriculture, 2020, 32(5): 13-18.
[6] LI Zhongjun, SUN Ruiying, ZHANG Tao. Analysis of the Research Status of Public Opinion Ecology in China Based on Bibliometrics (2004-2019) [J]. Journal of Library and Information Science in Agriculture, 2020, 32(2): 5-13.
[7] ZHANG Tao, SUN Ruiying, LI Zhongjun. Subject Clustering and Evolutionary Trend of Public Opinion Documents in China [J]. Journal of Library and Information Science in Agriculture, 2020, 32(2): 14-21.
[8] LIU Xuhui. Model Construction and Function Realization of Intelligence Management System of University Library in Cloud Environment [J]. Agricultural Library and Information, 2019, 31(10): 46-53.
[9] CHEN Qingyun, CAO Jianfei, CHEN Rongzhen. Research and Practices From the Thesaurus to Knowledge Graph [J]. Agricultural Library and Information, 2019, 31(1): 44-53.
[10] ZHI Yingying. Exploration on the Application of Machine Learning in Library Discover System —Taking the Discover Tool Yewno Based on Knowledge Graph as Example [J]. , 2018, 30(7): 47-50.
[11] CHEN Fen, ZHU Tianxiu. Research on University Library's Subject Service Based on Bibliometrics and Knowledge Graph Analysis [J]. , 2018, 30(1): 99-103.
[12] FENG Jiao. Application and Research of Fuzzy Comprehensive Evaluation Method in Library Management System for Liaoning University of Technology [J]. , 2017, 29(6): 44-46.
[13] ZHOU Jian, ZHAO Jianguo. The Overall Conception and Project Design of Military Information Retrieval Language Management and Service System [J]. , 2017, 29(5): 75-79.
[14] WEI Wei. Construction of Library Management System Based on Service Platform Design [J]. , 2017, 29(11): 57-60.
[15] HUANG Chun-xiao. The Realization of Student’s Papers Management System Based on NoteExpress [J]. , 2015, 27(9): 39-41.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!