中文    English

Journal of Library and Information Science in Agriculture ›› 2022, Vol. 34 ›› Issue (10): 57-69.doi: 10.13998/j.cnki.issn1002-1248.22-0330

;

Previous Articles     Next Articles

Structure-Utility of Descriptive Information of Agricultural Scientific Data from the Perspective of Users

FAN Zhixuan1, WANG Jian1, SA Xu1, ZHANG Guilan2   

  1. 1. Agricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing 10008;
    2. Institute of Scientific and Technical Information of China, Beijing 100038
  • Received:2022-05-09 Online:2022-10-05 Published:2022-11-28

Abstract: [Purpose/Significance] This paper aims to study the structure-utility relationship of descriptive information of scientific data to provide a new perspective for the theoretical study of scientific data description and a reference for the best description of agricultural scientific data in the digital environment. [Method/Process] Based on information processing theory, the lens model, the probabilistic mental model theory and the adaptive decision-making behavior framework, the relationship model between descriptive information structure and informing utility was constructed. A situational experiment was designed according to the model. In this study, 47 postgraduates from 14 institutes were invited for quasi-experimental observation by using qualitative and quantitative methods such as eye-tracking, semi-structured interview and questionnaire. First, this study used a semi-structured interview to obtain a user's cognitive interpretation of fixation points and collected the descriptive items of agricultural scientific data and their use frequency by encoding the interview text. Second, this study combined descriptive item usage path coding and user judgment confidence to obtain the combination of descriptive items with high utility. Finally, the study used multiple regression analysis to identify the descriptive items with high utility and their predictive ability, and analyzed the impact of data literacy and data utilization type on the utility of descriptive items. [Results/Conclusions] The study identified 42 descriptive items of 11 categories of agricultural scientific data and their usage characteristics. Among them, the top 5 frequently used descriptive items were subject, data, overall description, source and data production information, which played an important role in user relevance judgment. Then this study identified the combination of descriptive items with high utility and found that users' use patterns of descriptive items were diverse. Compared with making a judgment with "relevant" result, users often needed less information to achieve a high level of confidence when making an "irrelevant" judgment. This study also found that the descriptive items with high utility include source, data, use and evaluation, and data production information. It is determined that user data literacy and data utilization purpose were the influencing factors of descriptive information utility, and the effects of the two factors were preliminarily analyzed. Based on this research, the paper put forward some suggestions for improving agricultural scientific data metadata and scientific data sharing. In the future, this study will be repeated in groups with different academic backgrounds and data literacy levels, so as to enhance the generalization ability of research conclusions and construct a more effective structure of scientific data descriptive information.

Key words: scientific data, data description, metadata, information utility, eye-tracking

CLC Number: 

  • G250
[1] YOON A.Data reusers' trust development[J]. Journal of the association for information science and technology, 2017, 68(4): 946-956.
[2] 孙玉伟, 成颖, 谢娟. 科研人员数据复用行为研究: 系统综述与元综合[J]. 中国图书馆学报, 2019, 45(3): 110-130.
SUN Y W, CHENG Y, XIE J.A review on the data reuse behavior of scholars: System review and meta synthesis[J]. Journal of library science in China, 2019, 45(3): 110-130.
[3] TENOPIR C, ALLARD S, DOUGLASS K, et al.Data sharing by scientists: Practices and perceptions[J]. Plos one, 2011, 6(6): 1-21.
[4] FEAR K.User understanding of metadata in digital image collections: Or, what exactly do you mean by" coverage"?[J]. The American archivist, 2010, 73(1): 26-60.
[5] NIU J.Perceived documentation quality of social science data[M]. University of Michigan, 2009.
[6] TENOPIR C, ALLARD S, DOUGLASS K, et al.Data sharing by scientists: Practices and perceptions[J]. PloS one, 2011, 6(6): E21101.
[7] BIRNHOLTZ J P, BIETZ M J.Data at work: Supporting sharing in science and engineering[C]//SCHMIDT K, PENDERGAS M, TREMAINE M, SIMONE C(eds). Proceedings of the 2003 international ACM SIGGROUP conference on supporting group work, Sanibel Island, Florida, USA: ACM, 2003: 339-348.
[8] BORGMAN C L.Big data, little data, no data: Scholarship in the networked world[M]. MIT press, 2017.
[9] GREGORY K, GROTH P, SCHARNHORST A, et al.Lost or found? Discovering data needed for research[J]. Harvard data science review, 2020, 2(2): 1-63.
[10] 邱春艳. 国内外科学数据相关实践中的元数据研究进展[J]. 情报资料工作, 2021, 42(5): 104-112.
QIU C Y.Research progress of metadata in scientific data-related practices at home and abroad[J]. Information and documentation services, 2021, 42(5): 104-112.
[11] DREWRY M, CONOVER H, MCCOY S, et al.Metadata: Quality vs. quantity[C]. IEEE computer society metadata conference, 1997: 1-11.
[12] 苏广利, 姜翠景. 机读目录格式与元数据格式用于网络资源组织的比较研究[J]. 图书馆杂志, 2002(1): 19-22.
SU G L, JIANG C J.Research on application of metadata and marc in organizing web resources[J]. Library journal, 2002(1): 19-22.
[13] 花文博. 浅论基础地理信息数据档案的管理[J]. 兰台世界, 2010(s2): 128-129.
HUA W B.Talking about the management of basic geographic information data archives[J]. Lantai world, 2010(s2): 128-129.
[14] CHAO T C.Enhancing metadata for research methods in data curation[J]. Proceedings of the American society for information science and technology, 2014, 51(1): 1-4.
[15] 刘凤红, 崔金钟, 韩芳桥, 等. 数据论文:大数据时代新兴学术论文出版类型探讨[J]. 中国科技期刊研究, 2014, 25(12): 1451-1456.
LIU F H, CUI J Z, HAN F Q, et al.Data papers: Discussion on the emerging academic paper publishing types in the era of big data[J]. Chinese journal of scientific and technical periodicals, 2014, 25(12): 1451-1456.
[16] RIALL R L, MARINCIONI F, LIGHTSOM F L.Content metadata standards for marine science: A case study[M]. US department of the interior, US geological survey, 2004.
[17] FANIEL I M, FRANK R D, YAKEL E.Context from the data reuser's point of view[J]. Journal of documentation, 2019, 75(6):1274-1297.
[18] KOESTEN L, SIMPERL E, EMILIA M K, et al.Everything you always wanted to know about a dataset: Studies in data summarisation[J]. International journal of human-computer studies, 2020, 135: 1-21.
[19] BEHNERT C.Investigating the effects of popularity data on predictive relevance judgments in academic search systems[C]. Proceedings of the 2019 conference on human information interaction and retrieval, 2019: 437-440.
[20] 满芮, 王健. 我国农业元数据标准初探[J]. 中国科技资源导刊,2016, 48(2): 7-12.
MAN R, WANG J.Analysis of status quo about metadata standard in the agricultural field[J]. China science & technology resources review, 2016, 48(2): 7-12.
[21] KIM J.An analysis of data paper templates and guidelines: Types of contextual information described by data journals[J]. Science editing, 2020, 7(1): 16-23.
[22] CHIN JR G, LANSING C S.Capturing and supporting contexts for scientific data sharing via the biological sciences collaboratory[C]. Proceedings of the 2004 ACM conference on computer supported cooperative work, 2004: 409-418.
[23] KOESTEN L, GREGORY K, GROTH P, et al.Talking datasets-understanding data sensemaking behaviours[J]. International journal of human-computer studies, 2021, 146: 102562.
[24] 常颖聪, 何琳. 科学实验数据元数据模型构建研究——以植物学基因表达实验为例[J]. 图书情报工作, 2015, 59(13): 117-125.
CHANG Y C, HE L.Research on construction of metadata model for scientific experimental data: An example as gene expression experiment of botany[J]. Library and information service, 2015, 59(13): 117-125.
[25] 赵华, 周国民, 王健, 等. 科学数据元数据认知价值评价研究[J]. 情报科学, 2016, 34(7): 81-85.
ZHAO H, ZHOU G M, WANG J, et al.Study of evaluation of cognitive value of scientific metadata[J]. Information science, 2016, 34(7): 81-85.
[26] 海斯蒂, 道斯. 不确定世界的理性选择[M]. 北京: 人民邮电出版社, 2013.
HASTIE R, DAWES R M.Rational choice in an uncertain world[M]. Beijing: Posts & telecom press, 2013.
[27] GIGERENZER G, HOFFRAGE U, KLEINB?LTING H. Probabilistic mental models: A brunswikian theory of confidence[J]. Psychological review, 1991, 98(4): 506.
[28] PAYNE J W, PAYNE J W, BETTMAN J R, et al.The adaptive decision maker[M]. Cambridge university press, 1993.
[29] 刘建平. 科学数据用户相关性判断模型研究[D]. 北京: 中国农业科学院, 2020.
LIU J P.Research on relevance judgement model of scientific data users[D]. Beijing: Chinese academy of agricultural sciences, 2020.
[30] KOESTEN L.A user centered perspective on structured data discovery[C]. Companion proceedings of the web conference 2018, 2018: 849-853.
[31] BRICKLEY D, BURGESS M, NOY N.Google dataset search: Building a search engine for datasets in an open web ecosystem[C]. The world wide web conference, 2019: 1365-1375.
[32] AU V, THOMAS P, JAYASINGHE G K.Query-biased summaries for tabular data[C]. Proceedings of the 21st Australasian document computing symposium, 2016: 69-72.
[33] YOU S.Evaluative metadata in educational digital libraries: How users use evaluative metadata in the process of document selection[J]. TCDL bulletin, 2010, 4(2): 1-11.
[34] TANG M C.A study of academic library users' decision-making process: A lens model approach[J]. Journal of documentation, 2009, 65(6): 938-957.
[35] 于春, 彭爱东, 王波, 等. 信息用户对信息检索相关性判断的因素分析[J]. 图书情报工作, 2009, 53(3): 103-107.
YU C, PENG A D, WANG B, et al.The relevance factors of judgment in the information retrieval of the information users[J]. Library and information service, 2009, 53(3): 103-107.
[36] GREGORY K, GROTH P, COUSIJN H, et al.Searching data: A review of observational data retrieval practices in selected disciplines[J]. Journal of the association for information science and technology, 2019, 70(5): 419-432.
[37] ROUET J F, ROS C, GOUMI A, et al.The influence of surface and deep cues on primary and secondary school students' assessment of relevance in web menus[J]. Learning and instruction, 2011, 21(2): 205-219.
[38] 孟祥保, 李爱国. 国外高校图书馆科学数据素养教育研究[J]. 大学图书馆学报, 2014, 32(3): 11-16.
MENG X B, LI A G.Scientific data literacy education in overseas academic libraries[J]. Journal of academic libraries, 2014, 32(3): 11-16.
[39] 孟宪学. 农业科学数据核心元数据I[M]. 北京: 中国农业科学技术出版社, 2008.
MENG X X.Agricultural scientific data core metadata I[M]. Bei-jing: China agriculture science and technology press, 2008.
[1] CHEN Shuxian, LIU Guifeng, LIU Qiong. Research Progress and Implementation of FAIR Principles for Scientific Data Management [J]. Journal of Library and Information Science in Agriculture, 2022, 34(8): 30-41.
[2] CHAI Miaoling, ZOU Yixing, TAN Rongzhi, ZENG Yi, REN Yunyue. Research and Practice on Association of Scientific Data and Scientific Literature Oriented to Knowledge Service of Agricultural Industry [J]. Journal of Library and Information Science in Agriculture, 2022, 34(3): 37-50.
[3] DING Qiujing. The Construction of the Union Catalog System for Resource Discovery [J]. Journal of Library and Information Science in Agriculture, 2021, 33(8): 71-78.
[4] SUN Tan, HUANG Yongwen, XIAN Guojian, CUI Yunpeng, LIU Juan. Considerations for the Development of Agricultural Informatization Driven by a New Generation of Information Technologies [J]. Journal of Library and Information Science in Agriculture, 2021, 33(3): 4-15.
[5] MENG Yintao, ZHAO Leixia, YU Qianqian. Scientific Data Evaluation Index System for Scientific Data Preservation [J]. Journal of Library and Information Science in Agriculture, 2021, 33(12): 48-59.
[6] YE Xinyou, ZHANG Lulu, KONG Chengguo, ZHANG Qun. Evaluation of Scientific Data Literacy Competency for Postgraduates in China and Construction of Data Literacy Education System [J]. Journal of Library and Information Science in Agriculture, 2021, 33(11): 63-73.
[7] JIANG Enbo, LI Na. Analysis and Evaluation of Chinese Open Government Agricultural Data [J]. Journal of Library and Information Science in Agriculture, 2020, 32(10): 4-15.
[8] WU Dingfeng, LIU Tingting, WANG Jian, HU Lin. Applications of Dynamic Trust Management Model in Agricultural Scientific Data Security [J]. Journal of Library and Information Science in Agriculture, 2020, 32(10): 16-24.
[9] CHAI Miaoling, HUANG Lin, REN Yunyue. A Review of Construction of Major Agricultural Open Scientific Data Resources [J]. Journal of Library and Information Science in Agriculture, 2020, 32(10): 25-34.
[10] CHI Yuzhuo. Literature Review of Scientific Data Competence: the Connotation, Framework and Influencing Factors [J]. Journal of Library and Information Science in Agriculture, 2020, 32(1): 23-29.
[11] WANG Qiao. Study on Self-organizing Model of Scientific Data Set in University Library in the Era of Big Data [J]. , 2018, 30(9): 42-45.
[12] PAN Hong. The Selection and Design of the Metadata of Ancient Agricultural Books under the Background of Mobile Reading [J]. , 2018, 30(8): 27-30.
[13] HAN Hongjun. Development of Scientific Data Sharing in Europe and Its Enlightenment [J]. , 2018, 30(4): 118-121.
[14] WANG Wei. Research on the Construction of Embedded Subject Service System in University Library Based on Scientific data Management [J]. , 2017, 29(3): 148-151.
[15] LV Xin. Study on the Initial Scientific Data Management in Chinese Medicine Universities in the Environment of E-science [J]. , 2017, 29(12): 193-197.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!