中文    English

Journal of library and information science in agriculture ›› 2014, Vol. 26 ›› Issue (4): 36-40.doi: 10.13998/j.cnki.issn1002-1248.2014.4.009

• Resources construction • Previous Articles     Next Articles

On the Data Acquisition and Cleaning Preparation for Bibliometric Analysis:a case study of the Chinese journal papers of CAFS

YAN Xue, OU yang Haiying, ZENG Shou-ying,GE Chang-shui, TANG Lin, SHAO Ping,CHEN Bai-song   

  1. Chinese Academy of Fishery Sciences; Beijing 100141, China
  • Received:2013-10-30 Online:2014-04-05 Published:2014-06-06

Abstract: Data preparation is the foundation of the literature metrology, which determines the objectivity and accuracy of the final analysis result. Based on the experience of analyzing the output of scientific literature of CAFS, This paper discusses the method for data preparation in both data acquisition and data cleaning. Using data from both literature database and management department of scientific research, it also summarizes the main forms of dirty data and methods for cleaning of these dirty data using the VBA programing. The practical application has showed that this method can improve the efficiency and accuracy of literature analysis significantly.

Key words: Bibliometric Analysis, Data Acquisition, Data Cleaning

CLC Number: 

  • G350
【1】苏燕,孙继林,于建荣,等.生物医药领域文献计量评价的创新和改进.图书情报工作,2013,(7).
【2】沈艳红,张娣.文献计量分析中的数据准备工作研究.图书馆建设,2012,(5):90-92.
【3】董琳.学科评价之文献计量数据准备.情报理论与实践,2010,(6).
【4】张晋辉,刘清.基于推理机的SCI地址字段数据清洗方法设计.情报科学,2010,(5):741-746.
【5】沈艳红.信息检索中检索词的选择对查全率的影响.情报探索,2006,(11):73-74.
【6】董琳.学科评价之文献计量数据准备.情报理论与实践,2010,(6):49-52.
【7】蒋勇青,杨奕虹,杨贺.论数据清洗对信息检索质量的影响及清洗方法.中国索引,2012,10(1):16-20.
【8】贾自艳,黄友平,罗平,等.面向数据质量的ETL过程建模与实现.系统仿真学报,2004,16(5):907-911.
【9】王曰芬,章成志,张蓓蓓,等.数据清洗研究综述.现代图书情报技术,2007,(12):50-56.
【10】张云.基于开源软件的中文学术文献计量软件的开发实践.现代图书情报技术,2010,(4):87-91.
【11】张力,赵星,叶鹰.信息可视化软件CiteSpace与VOSviewer的应用比较.信息资源管理学报,2011,(1):95-98.
[1] QIAN Li, YANG Yanxi, ZHANG Yuanzhe, HU Maodi, CHANG Zhijun. The Impacts and Implications of OpenClaw for Scientific and Technical Literature Intelligence Work [J]. Journal of library and information science in agriculture, 2026, 38(4): 4-12.
[2] GAI Yingzhao, HUANG Qimeng, WANG Ning, ZHANG Ying, ZHOU Qun. Construction and Empirical Study of Journal Hotspot Index Based on Highly Cited Papers [J]. Journal of library and information science in agriculture, 2026, (): 1-11.
[3] HU Anqi. Construction of an Artificial Intelligence Literacy Ability Framework and Training System for College Students [J]. Journal of library and information science in agriculture, 2026, 38(2): 42-55.
[4] YANG Guancan, SHI Yingying, ZHANG Zihe. Research on the Construction and Evaluation of a Low-Altitude Economy Urban Development Index [J]. Journal of library and information science in agriculture, 2026, 38(2): 4-15.
[5] WANG Jian. Collaborative Governance, Knowledge Interfaces, and Flow Closed-Loop: A Mechanism Study on Rural Reading Spaces as Agricultural Knowledge Diffusion Nodes [J]. Journal of library and information science in agriculture, 2026, 38(1): 71-78.
[6] YANG Guancan, ZHANG Zihe. Construction of a Dynamic Perception System for Talent Supply-Demand Matching: Theoretical Framework and Implementation Path [J]. Journal of library and information science in agriculture, 2025, 37(9): 4-17.
[7] WANG Xiaoyu, HU Jingyuan, WU Ruoyu, WANG Shu, ZHAI Yujia. An LLM-based Data Augmentation Method for Constructing Science & Technology Topic Linkages: Taking the Energy Conservation Field as an Example [J]. Journal of library and information science in agriculture, 2025, 37(9): 63-81.
[8] ZHANG Tao, WU Sihang. How Achievement Goal Orientation Influences College Students' Usage Behaviors of AI Tutoring Tools: An Empirical Study Based on Dual Mediation [J]. Journal of library and information science in agriculture, 2025, 37(7): 91-105.
[9] DONG Ke, SONG Yuchen, WU Jiachun. Layout and Characteristics of European AI Data Governance Policy [J]. Journal of library and information science in agriculture, 2025, 37(7): 4-18.
[10] QIAN Li, WANG Qianying, LIU Yi, ZHANG Yuanzhe, CHANG Zhijun. Agent Technology and Its Applications in Scientific Research [J]. Journal of library and information science in agriculture, 2025, 37(5): 5-14.
[11] LI Xiao, QU Jiansheng. Influencing Factors of User Participation Intention of Crowdsourcing in Evidence Synthesis [J]. Journal of library and information science in agriculture, 2025, 37(3): 92-105.
[12] CAI Yiran, HU Zhengyin, LIU Chunjiang. Analysis of Progress in Data Mining of Scientific Literature Using Large Language Models [J]. Journal of library and information science in agriculture, 2025, 37(2): 4-22.
[13] LIU Xiwen, FU Yun, WEI Huanan. DIS Agent: New Paradigm of S&T Documentation and Information Service for the Fifteenth Five-Year Plan [J]. Journal of library and information science in agriculture, 2024, 36(12): 20-34.
[14] MAN Zhenliang, WANG Xinwei. Prevention and Control of Information Fog from the Perspective of Overall National Security Concept [J]. Journal of library and information science in agriculture, 2024, 36(3): 83-91.
[15] WANG Shan, TAN Zongying. Identification of Key Core Technologies Enables the Development of New Quality Productive Forces [J]. Journal of library and information science in agriculture, 2024, 36(2): 26-35.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!