中文    English

Journal of library and information science in agriculture ›› 2023, Vol. 35 ›› Issue (6): 51-59.doi: 10.13998/j.cnki.issn1002-1248.23-0288

Previous Articles     Next Articles

Application of Large-scale Pre-Training Language Model in Network Health Information Identification

WANG Chao1, KONG Xianghui2,*   

  1. 1. Library of Liaoning University of Technology, Jinzhou 121000;
    2. Library of Jinzhou Medical University, Jinzhou 121000
  • Received:2023-05-08 Online:2023-06-05 Published:2023-08-02

Abstract: [Purpose/Significance] Taking the popular "chat robot" ChatGPT and the recently launched similar product "iFLYTEK Spark" as the research object, this paper explores their applications in the identification of online health information, and discusses their advantages and disadvantages, in order to provide reference for the large-scale pre-training language model in the field of health information identification. Based on the review of relevant literature on online health information authentication, deep learning models have been widely applied in the task of online health information authentication in recent years. With the rapid development of large pre-training language models such as ChatGPT, it is a novel idea to explore their discriminating ability in online health information. [Method/Process] Researchers selected health-related information from the most authoritative rumor-refuting websites in China, used "ChatGPT" and "iFLYTEK Spark" to verify the authenticity of the online health information, evaluated their performance, and compared their identification results with the expert identification results. The identification accuracy of ChatGPT and iFLYTEK Spark language model was 93.9% and 92.9%, respectively, and the F1 value was 0.951 and 0.946, respectively, which had a good application effect. The generated explanatory texts were more detailed and the language was relatively smooth. In terms of the length and dispersion of the explanatory text, ChatGPT is closer to that of medical experts, while iFLYTEK Spark's explanatory text is relatively long and less discrete. In terms of semantic similarity, ChatGPT and iFLYTEK Spark were almost equal in performance, and their understanding of health information was close to that of human experts to some extent. Through the analysis of typical samples, it can be seen that an AI large model cannot accurately identify news or emergency information for the time being, and the understanding of individual health propositions with complex semantics will occasionally be biased. [Results/Conclusions] The experimental results show that ChatGPT and iFLYTEK Spark have good discriminative effect in the field of online health information identification, but there are shortcomings, and manual intervention is needed to ensure the accuracy and reliability of the results. In the future, in the field of AI large model research, researchers are suggested to attach importance to the construction and application of high-quality corpora in vertical fields. In the field of online health information identification, practitioners can use models such as ChatGPT as tools to help identify and refine health information. There are also limitations in this article. For example, the amount of data involved in the test is not large enough, ChatGPT uses GPT3.5 model, and the online application time of iFLYTEK Spark language model is relatively short. In future studies, the amount of online health information can be further increased, and the updated version of an AI large model can be tested and evaluated.

Key words: artificial intelligence, health information, identification, ChatGPT

CLC Number: 

  • G252
[1] 秦盼盼, 谢莉琴, 雷行云, 等. 基于联邦学习的区域健康医疗大数据中心建设探析[J]. 卫生软科学, 2023, 37(5): 73-78.
QIN P P, XIE L Q, LEI X Y, et al.Research on the construction of regional big data center in healthcare based on federated learning[J]. Soft science of health, 2023, 37(5): 73-78.
[2] 王超, 卢智增. 农村大学生健康信息替代搜寻意向影响因素探析[J]. 情报探索, 2023(4): 24-29.
WANG C, LU Z Z.Analysis of influencing factors of intention to surrogate health information seeking of rural college students[J]. Information research, 2023(4): 24-29.
[3] 曹雅宁, 柯青. 为什么人们对虚假健康信息的易感性不同? ——基于信息加工过程组态的分析[J]. 现代情报, 2023, 43(1): 40-54.
CAO Y N, KE Q.Why people show differences in their susceptibility to health misinformation? - Research based on information processing configurations[J]. Journal of modern information, 2023, 43(1): 40-54.
[4] 王超, 卢智增. 农村老年人健康信息替代搜寻者特征及其行为影响因素探析[J]. 图书情报导刊, 2022, 7(10): 37-44.
WANG C, LU Z Z.Analysis on the characteristics and behavioral influencing factors of rural elderly health information surrogate seeker[J]. Journal of library and information science, 2022, 7(10): 37-44.
[5] 张智雄, 曾建勋, 夏翠娟, 等. 回应AIGC的信息资源管理学人思考[J]. 农业图书情报学报, 2023, 35(1): 4-28.
ZHANG Z X, ZENG J X, XIA C J, et al.Information resource management researchers' thinking about the opportunities and challenges of AIGC[J]. Journal of library and information science in agriculture, 2023, 35(1): 4-28.
[6] CHOU W Y S, OH A, KLEIN W M P. Addressing health-related misinformation on social media[J]. JAMA, 2018, 320(23): 2417-2418.
[7] WASZAK P M, KASPRZYCKA-WASZAK W, KUBANEK A.The spread of medical fake news in social media - The pilot quantitative study[J]. Health policy and technology, 2018, 7(2): 115-118.
[8] PURNOMO M H, SUMPENO S, SETIAWAN E I, et al.Keynote speaker II: Biomedical engineering research in the social network analysis era: Stance classification for analysis of hoax medical news in social media[J]. Procedia computer science, 2017, 116: 3-9.
[9] KINSORA A, BARRON K, MEI Q Z, et al.Creating a labeled dataset for medical misinformation in health forums[C]// 2017 IEEE International Conference on Healthcare Informatics (ICHI). Piscataway, New Jersey: IEEE, 2017: 456-461.
[10] GHENAI A, MEJOVA Y. Catching zika fever: Application of crowdsourcing and machine learning for tracking health misinformation on twitter[EB/OL].2017: arXiv: 1707.03778. https://arxiv.org/abs/1707.03778
[11] MRIDHA M F, KEYA A J, HAMID M A, et al.A comprehensive review on fake news detection with deep learning[J]. IEEE access, 2021, 9: 156151-156170.
[12] WANG Y T, NAZIR S, SHAFIQ M.An overview on analyzing deep learning and transfer learning approaches for health monitoring[J]. Computational and mathematical methods in medicine, 2021, 2021: 1-10.
[13] ABDULAZIZ A A, AHMED H M H. Deep learning prediction model for heart disease for elderly patients[J]. Intelligent automation & soft computing, 2023, 35(2): 2527-2540.
[14] 徐璐璐, 洪闅, 叶鹰. ChatGPT及GPT类技术的医学信息学应用前景探讨[J]. 情报理论与实践, 2023, 46(6): 38-42.
XU L L, HONG B, YE Y.Exploring the application prospect of ChatGPT and GPT-type technology in medical informatics[J]. Information studies: Theory & application, 2023, 46(6): 38-42.
[15] 李雅静, 孙佳佳. ChatGPT在网络健康信息鉴别中的应用探讨[J]. 图书馆论坛, 2023, 43(6): 85-92.
LI Y J, SUN J J.Discussion on the application of ChatGPT in online health information identification[J]. Library tribune, 2023, 43(6): 85-92.
[1] GUO Yanli, GAO Rui, ZOU Meifeng, LIU Zidan. Identification of Product Innovation Opportunity Based on Problem and Suggestions Using Dual-Source Data [J]. Journal of library and information science in agriculture, 2026, 38(4): 47-60.
[2] AN Lin. Governance of Personal Information Security in the Iteration of Generative AI: From the Perspective of the Technological Evolution of Large Models [J]. Journal of library and information science in agriculture, 2026, 38(4): 61-70.
[3] LI Baiyang, REN Shangsheng. Technical Evolution and Application Scenarios of Open-Source Agents:A Case Study of "OpenClaw" [J]. Journal of library and information science in agriculture, 2026, 38(4): 23-35.
[4] HU Anqi. Construction of an Artificial Intelligence Literacy Ability Framework and Training System for College Students [J]. Journal of library and information science in agriculture, 2026, 38(2): 42-55.
[5] HUANG Xiaotang, YAO Qibin. Collaborative Development Path of GLAM Institutions Based on AIGC Technology Application [J]. Journal of library and information science in agriculture, 2026, 38(2): 66-78.
[6] YI Chenhe, ZHANG Yuting. Risk Assessment and Early Warning of Generative Artificial Intelligence Impact on Network Public Opinion Based on Optimized BP Neural Network [J]. Journal of library and information science in agriculture, 2026, 38(2): 30-41.
[7] GUO Hailing, ZENG Meiyun, FENG Yuxi. Model Construction and Strategies for AI-enabled University Library Services to Facilitate Scientific and Technological Achievement Transformation [J]. Journal of library and information science in agriculture, 2026, 38(2): 56-65.
[8] ZHANG Ling. Integrating Digital Humanities and Agricultural Knowledge Services A Simulation Modeling Perspectives [J]. Journal of library and information science in agriculture, 2026, 38(2): 79-89.
[9] JIANG Jingze, ZHOU Tianmin, LI Mei, CHENG Cheng, CHEN Haiyan. A study of the Core Competence Model of Compound AI Librarians in the Intelligent Transformation of University Libraries [J]. Journal of library and information science in agriculture, 2025, 37(9): 97-109.
[10] SHEN Hongjie, SHEN Hongwei, WANG Junli. Generative AI Empowering Information Literacy Education in Digital Libraries: Path Exploration, Challenge Analysis, and Response Strategies [J]. Journal of library and information science in agriculture, 2025, 37(7): 50-60.
[11] DONG Ke, SONG Yuchen, WU Jiachun. Layout and Characteristics of European AI Data Governance Policy [J]. Journal of library and information science in agriculture, 2025, 37(7): 4-18.
[12] ZHAI Jun, MENG Zihan, LI Fangsu, SHEN Lixin. AI Guides in Research Libraries of North America under the AI4S Context: Based on the Survey of 125 ARL Libraries [J]. Journal of library and information science in agriculture, 2025, 37(7): 35-49.
[13] CHEN Yuanyuan, FU Bin, GAO Yuan, QIAO Junwei. Identification of Emerging Technology Topics and Prediction of Trends Using a Method Integrating BERTopic and IWOA-BiLSTM Models [J]. Journal of library and information science in agriculture, 2025, 37(6): 55-69.
[14] SHI Xujie, YUAN Fan, LI Jia. Searching as Learning in the Context of Generative Artificial Intelligence: Technological Pathways, Behavioral Evolution, and Ethical Challenges [J]. Journal of library and information science in agriculture, 2025, 37(5): 40-57.
[15] CHEN Jiayong, GONG Jiaoteng, WANG Yuyi. Research of Interdisciplinary Comparison and Collaborative Paradigm on the Concept of Agent in Library Science [J]. Journal of library and information science in agriculture, 2025, 37(5): 27-39.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!