中文    English

›› 2014, Vol. 26 ›› Issue (4): 36-40.doi: 10.13998/j.cnki.issn1002-1248.2014.4.009

• Resources construction • Previous Articles     Next Articles

On the Data Acquisition and Cleaning Preparation for Bibliometric Analysis:a case study of the Chinese journal papers of CAFS

YAN Xue, OU yang Haiying, ZENG Shou-ying,GE Chang-shui, TANG Lin, SHAO Ping,CHEN Bai-song   

  1. Chinese Academy of Fishery Sciences; Beijing 100141, China
  • Received:2013-10-30 Online:2014-04-05 Published:2014-06-06

Abstract: Data preparation is the foundation of the literature metrology, which determines the objectivity and accuracy of the final analysis result. Based on the experience of analyzing the output of scientific literature of CAFS, This paper discusses the method for data preparation in both data acquisition and data cleaning. Using data from both literature database and management department of scientific research, it also summarizes the main forms of dirty data and methods for cleaning of these dirty data using the VBA programing. The practical application has showed that this method can improve the efficiency and accuracy of literature analysis significantly.

Key words: Bibliometric Analysis, Data Acquisition, Data Cleaning

CLC Number: 

  • G350
【1】苏燕,孙继林,于建荣,等.生物医药领域文献计量评价的创新和改进.图书情报工作,2013,(7).
【2】沈艳红,张娣.文献计量分析中的数据准备工作研究.图书馆建设,2012,(5):90-92.
【3】董琳.学科评价之文献计量数据准备.情报理论与实践,2010,(6).
【4】张晋辉,刘清.基于推理机的SCI地址字段数据清洗方法设计.情报科学,2010,(5):741-746.
【5】沈艳红.信息检索中检索词的选择对查全率的影响.情报探索,2006,(11):73-74.
【6】董琳.学科评价之文献计量数据准备.情报理论与实践,2010,(6):49-52.
【7】蒋勇青,杨奕虹,杨贺.论数据清洗对信息检索质量的影响及清洗方法.中国索引,2012,10(1):16-20.
【8】贾自艳,黄友平,罗平,等.面向数据质量的ETL过程建模与实现.系统仿真学报,2004,16(5):907-911.
【9】王曰芬,章成志,张蓓蓓,等.数据清洗研究综述.现代图书情报技术,2007,(12):50-56.
【10】张云.基于开源软件的中文学术文献计量软件的开发实践.现代图书情报技术,2010,(4):87-91.
【11】张力,赵星,叶鹰.信息可视化软件CiteSpace与VOSviewer的应用比较.信息资源管理学报,2011,(1):95-98.
[1] ZHAO Youlin, CAO Hongnan. Government Microblog Information Exchange Efficiency and Its Influencing Factors for Emergency Management [J]. Journal of Library and Information Science in Agriculture, 2022, 34(9): 72-85.
[2] WAN Hao, ZHANG Fujun, LV Qianqian. The Validity of Peer Review Results of DEA Based Super Efficiency Projects [J]. Journal of Library and Information Science in Agriculture, 2022, 34(2): 88-101.
[3] FENG Shaohua, ZAN Dong, SU Ju, ZHANG Zhan. Characteristics of Global "Marine Aquatic Feed" Domain Development Based on Patent Analysis [J]. Journal of Library and Information Science in Agriculture, 2021, 33(12): 71-82.
[4] HAN Zhengbiao, ZHOU Mingfeng, YUE Hang. Rural Residents' Health Information Avoidance Behavior in Lower Risk Disease Context [J]. Journal of Library and Information Science in Agriculture, 2021, 33(11): 4-15.
[5] CHU Jingli, LIU Peiyi, WENG Yanqin, LI Nan, YAN Qun, XIAO Yue. Investigation and Analysis of Different Roles' Recognition and Acceptance of Open Access Journals [J]. Journal of Library and Information Science in Agriculture, 2021, 33(9): 4-17.
[6] AI Yuxi, XU Jian, HE Lin, QI Yun. A Construction Method of the Classification System Oriented to Content Analysis of Ancient Books [J]. Journal of Library and Information Science in Agriculture, 2021, 33(9): 18-26.
[7] MA Xiaowen, HE Lin, LIU Jianbin, LI Zhangchao, GAO Dan. The Trigger Verb Classification Method of Event Sentences in Ancient Chinese Classics Based on Bi-LSTM [J]. Journal of Library and Information Science in Agriculture, 2021, 33(9): 27-36.
[8] REN Ni, GUO Ting, SUN Yiwei, DAI Hongjun, ZHANG Chengcheng. An Analysis of Global Smart Agriculture Research Situation [J]. Journal of Library and Information Science in Agriculture, 2021, 33(9): 48-63.
[9] LIU Xiwen, GUO Shijie. A Database Construction of S&T Intelligence Cognition Models [J]. Journal of Library and Information Science in Agriculture, 2021, 33(1): 32-40.
[10] CHEN Yunwei. Review on Quantitative Methods of Science and Technology Evaluation [J]. Journal of Library and Information Science in Agriculture, 2020, 32(8): 4-11.
[11] CAO Qi. System Analysis of the Next-generation Library Service Platform Based on Microservice Architecture——Taking FOLIO as an Example [J]. Journal of Library and Information Science in Agriculture, 2020, 32(4): 51-58.
[12] CAO Qi. Visual Modeling of Keyword Dimension Reduction in Double First-Class University Funds Based on t-SNE Algorithm [J]. Journal of Library and Information Science in Agriculture, 2020, 32(2): 47-57.
[13] LI Feifan. Research on the Universities Scientific Cooperation Network and Evolution: Taking 211 and Co-construction of Provincial and Subordinate universities of Beijing, Tianjin and Hebei region as an Example [J]. Agricultural Library and Information, 2019, 31(8): 31-39.
[14] LIU Zhihui, WEI Juanxia. Research on SMEs' Competitive Technology Intelligence Methodology System Oriented Open Innovation [J]. Agricultural Library and Information, 2019, 31(6): 12-20.
[15] YANG Siluo, YU Yonghao. Comparison of Artificial Intelligence Papers and Books Based on Citation and Altmetric Indicators [J]. Agricultural Library and Information, 2019, 31(5): 5-12.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!