农业图书情报学报 ›› 2023, Vol. 35 ›› Issue (6): 16-28.doi: 10.13998/j.cnki.issn1002-1248.23-0347

• 特约综述 • 上一篇    下一篇

论证挖掘研究现状与进展

李娇1,2, 赵瑞雪1,2,4,*, 鲜国建1,2,4, 黄永文1,2, 孙坦3,4   

  1. 1.中国农业科学院农业信息研究所,北京 100081;
    2.国家新闻出版署 农业融合出版知识挖掘与知识服务重点实验室,北京 100081;
    3.中国农业科学院,北京 100081;
    4.农业农村部 农业大数据重点实验室,北京 100081
  • 收稿日期:2023-05-05 出版日期:2023-06-05 发布日期:2023-08-02
  • 通讯作者: * 赵瑞雪(1968- ),女,博士,研究员,博士生导师,研究方向为信息与信息系统、知识服务研究。Email:zhaoruixue@caas.cn
  • 作者简介:李娇(1989- ),女,博士,助理研究员,研究方向为文本挖掘、知识图谱与知识服务研究。鲜国建(1982- ),男,博士,研究员,博士生导师,研究方向为关联数据与知识服务。黄永文(1975- ),女,博士,副研究馆员,研究方向为科学数据与知识组织。孙坦(1970- ),男,博士,研究馆员,博士生导师,研究方向为数字信息描述与组织
  • 基金资助:
    中国科协青年人才托举工程项目“面向科研论文的科学论证语义识别与解析研究”(2022QNRC001)

Research Advances in Argument Mining

LI Jiao1,2, ZHAO Ruixue1,2,4,*, XIAN Guojian1,2,4, HUANG Yongwen1,2, SUN Tan3,4   

  1. 1. Agricultural Information Institute of CAAS, Beijing 100081;
    2. Key Laboratory of Knowledge Mining and Knowledge Services in Agricultural Converging Publishing, National Press and Publication Administration, Beijing 100081;
    3. Chinese Academy of Agricultural Sciences, Beijing 100081;
    4. Key Laboratory of Agricultural Big Data, Ministry of Agriculture and Rural Affairs, Beijing 100081
  • Received:2023-05-05 Online:2023-06-05 Published:2023-08-02

摘要: [目的/意义]论证挖掘是近年来计算语言学领域的热点研究内容,为论证计算模型提供机器可处理的结构化数据,对其研究现状及进展进行总结和分析可为后续的研究及应用深化提供借鉴。[方法/过程]本研究通过对国内外论证挖掘重要文献进行收集、整理和分析,从相关研究基础、技术体系、应用实践等多个维度进行系统性综述,通过梳理总结论证挖掘发展路径展示该研究领域的发展全貌和特征,并重点描述多模态论证挖掘研究的现状。[结果/结论]论证挖掘任务与自然语言处理等人工智能技术息息相关,相关研究经历了“机器学习-深度学习”“文本-多模态”的发展变迁,且领域发展与应用水平不一;如何实现多粒度、多模态内容泛化,以及如何促进其应用落地实践将是下一步研究的热点和重点。

关键词: 论证挖掘, 技术体系, 发展路径, 多模态

Abstract: [Purpose/Significance] Argument mining, a research hotspot in the field of computational linguistics, provides machine processable structured data for computational models of argument. Argument mining tasks are closely related to artificial intelligence (AI) technologies, such as natural language processing and knowledge representation. There are numerous systematic studies in academia and a clear technical realization route has come into being. New research results continue to emerge as a result of rich resources and rapid development and iteration of deep learning, large language models (LLMs), and other technologies. This study, which reviews the research status and progress of argument mining, can serve as a resource for future research and application development. [Method/Process] Through literature review, this paper systematically reviews the relevant research basis (including foundational techniques and semantic representation models), summarizes the related technical system in terms of task framework, influencing factors of technological complexity, and method classification, and then introduces the argument mining practice and application cases for specific fields and research objectives and makes a comparative analysis. Most importantly, the overall development and characteristics of this research field are summarized, with a focus on tracking the progress of multimedia argument mining in the context of the new AI environment. [Results/Conclusions] Relevant research has experienced the development of "machine learning - deep learning" and "text only - multimodal", and the levels of development and application of various fields vary much. Future research may focus on how to achieve multigranularity and multimodal content generalization, as well as how to promote its application and implementation in practice. Possible research directions include: 1) the use of LLMs in argument mining, because they exhibit significant benefits in downstream applications such as natural language processing and multimodal learning, and can also provide certain technical conditions for the generation of argument content; 2) the use of domain knowledge organization systems such as vocabulary, knowledge base and knowledge graph: with these systems, researchers can combine domain-specific argument mining models with rich knowledge structure, to strengthen semantic representation and organization improve the systematization and dig deeper into argument mining model research in the domain; 3) promoting the application research and practice of argument mining in more fields or across disciplines, and improving the retrieval and visualization of argument information, such as combining information retrieval methods with argument mining to build the next generation of argument search engines.

Key words: argument mining, technical system, development path, multimodal

中图分类号: 

  • G254

引用本文

李娇, 赵瑞雪, 鲜国建, 黄永文, 孙坦. 论证挖掘研究现状与进展[J]. 农业图书情报学报, 2023, 35(6): 16-28.

LI Jiao, ZHAO Ruixue, XIAN Guojian, HUANG Yongwen, SUN Tan. Research Advances in Argument Mining[J]. Journal of Library and Information Science in Agriculture, 2023, 35(6): 16-28.