农业图书情报学报

• •    

基于大语言模型的科技政策评论方面级情感分析研究——以新能源汽车产业为例

李鑫鑫1, 马雨萌2,3(), 鞠孜涵3, 王敬4   

  1. 1. 西安交通大学 图书馆,西安 710061
    2. 中国科学院文献情报中心,北京 100190
    3. 北京大学 信息管理系,北京 100871
    4. 中共安徽省委党校图书和文化馆,合肥 230022
  • 收稿日期:2025-07-21 出版日期:2025-10-21
  • 通讯作者: 马雨萌
  • 作者简介:

    李鑫鑫(1996- ),女,硕士,馆员,研究方向为学科竞争情报分析

    鞠孜涵(1999- ),女,博士研究生,研究方向为科技情报分析

    王敬(1988- ),男,硕士,馆员,研究方向为开放科学

  • 基金资助:
    国家社会科学基金青年项目“科技政策大数据语义分析方法与决策支持研究”(20CTQ030)

Aspect-Level Sentiment Analysis of Science and Technology Policy Reviews Based on Large Language Models: A Case Study of the New Energy Vehicle Industry

LI Xinxin1, MA Yumeng2,3(), JU Zihan3, WANG Jing4   

  1. 1. Xi'an Jiaotong University Library, Xi'an 710061
    2. National Science Library, Chinese Academy of Sciences, Beijing 100190
    3. Department of Information Management, Peking University, Beijing 100871
    4. Party School of Anhui Provincial Committee of C. P. C/Anhui Academy of Governance Library, Hefei 230022
  • Received:2025-07-21 Online:2025-10-21
  • Contact: MA Yumeng

摘要:

【目的/意义】 探索基于大语言模型的科技政策评论方面级情感分析方法,挖掘科技政策评论文本中的潜在知识信息,为政策的实施效果及后续优化提供数据支持。 【方法/过程】 以新能源汽车产业为例,利用Python爬虫技术采集科技政策评论数据,提取评论属性词,利用聚类算法生成方面类簇,最后采用微调后的大语言模型进行情感倾向识别。 【结果/结论】 研究结果表明,与BERT模型相比,本方法在准确率、召回率和F1值上分别提升11.49%、12.43%、11.43%,公众对汽车性能配置、汽车销售市场的关注度较高,而在汽车安全、基础设施建设方面的满意度较低,提出优化新能源汽车补贴政策、强化政策宣传、完善基础设施建设、健全售后服务保障体系等建议。

关键词: 科技政策评论, 方面级情感分析, 政策关注度, 政策满意度, 大语言模型

Abstract:

[Purpose/Significance] In recent years, the rapid rise of large language model technology has shown significant advantages in understanding semantic context and capturing multidimensional sentiment tendencies. This study explores an aspect-level sentiment analysis method for science and technology policy comments based on large language models, aiming to uncover latent knowledge within these texts and provide data support for evaluating the effectiveness and subsequent optimization of policies. [Method/Process] Taking the electric vehicle industry as an example, a burgeoning sector vital to achieving the "dual carbon" goals and promoting green low-carbon development, this study proposed a policy satisfaction evaluation model. The model uses large language models for fine-grained aspect-level sentiment analysis of policy comment texts. The process includes the following steps: 1) Data collection and preprocessing: Comments related to electric vehicle policies were collected from the "Interactive Topics" section of the "Autohome" website using Python. Deep learning techniques were applied to set rules for the comment texts and automatically add punctuation marks to Chinese texts for data pre-processing. 2) Aspect word extraction: The steps include text tokenization, determining a candidate aspect word set, expanding the aspect word set, and clustering aspect words. A total of 3 405 aspect words were extracted from 35 000 comments, forming six clusters: infrastructure construction, vehicle performance configuration, national policies, technological development, automotive safety, and automotive sales market. Aspect-level sentences were extracted using aspect words and punctuation information, with a subset of sentences manually labeled to build training and validation corpora, resulting in 14 911 aspect-level sentences. 3) Sentiment tendency recognition model training: A prompt template for aspect-level sentiment classification tasks was designed, and the LoRA method was used to fine-tune the large language model with the manually labeled training set. The model's performance was evaluated using a validation set, resulting in the classification of comments on electric vehicle policies into positive, neutral, and negative sentiments. 4) Comparative experiment: The fine-tuned large model was compared with the mainstream sentiment classification model, BERT, to assess the performance of different models in aspect-level sentiment classification tasks. [Results/Conclusions] The results show that compared to the BERT model, the proposed method outperformed other methods in multiple metrics, including accuracy, recall, and F1 score, with improvements of 11.49%, 12.43% and 11.43%, respectively. Overall, public attention is higher towards vehicle performance configuration and automotive sales market, while infrastructure construction receives the lowest attention. The overall public satisfaction with electric vehicles is relatively low, with negative comments outweighing positive comments across all aspects, consistent with the "negative bias" theory in social psychology. Satisfaction issues are particularly prominent in the areas of automotive safety and infrastructure construction. Finally, policy recommendations have been proposed to optimize electric vehicle subsidy policies, strengthen policy promotion, improve infrastructure construction, and enhance after-sales service support systems.

Key words: science and technology policy review, aspect-level sentiment analysis, policy attention, policy satisfaction, large language model

中图分类号:  G353.1

引用本文

李鑫鑫, 马雨萌, 鞠孜涵, 王敬. 基于大语言模型的科技政策评论方面级情感分析研究——以新能源汽车产业为例[J/OL]. 农业图书情报学报. https://doi.org/10.13998/j.cnki.issn1002-1248.25-0396.

LI Xinxin, MA Yumeng, JU Zihan, WANG Jing. Aspect-Level Sentiment Analysis of Science and Technology Policy Reviews Based on Large Language Models: A Case Study of the New Energy Vehicle Industry[J/OL]. Journal of library and information science in agriculture. https://doi.org/10.13998/j.cnki.issn1002-1248.25-0396.