中文    English

Journal of library and information science in agriculture ›› 2021, Vol. 33 ›› Issue (9): 27-36.doi: 10.13998/j.cnki.issn1002-1248.21-0262

;

Previous Articles     Next Articles

The Trigger Verb Classification Method of Event Sentences in Ancient Chinese Classics Based on Bi-LSTM

MA Xiaowen, HE Lin*, LIU Jianbin, LI Zhangchao, GAO Dan   

  1. College of Information Management, Nanjing Agricultural University, Nanjing 210095
  • Received:2021-04-11 Online:2021-09-05 Published:2021-09-28

Abstract: [Purpose/Significance] It is of great significance to carry out research on the recognition and classification of trigger verbs in ancient books oriented to digital humanities for the deep mining and content revealing of ancient texts. This paper uses the deep learning classification algorithm to explore an automated method for multivariate classification of event sentence text based on trigger words in ancient books. [Method/Process] Based on the construction of the classic event trigger word classification system and trigger dictionary, four different types of event sentence texts are selected as experimental data, and the category labels and sentence texts are coded separately using Onehot and Tokenizer, and then the classifier is trained in the Bi-LSTM model, and a comparative experiment is set by adjusting the parameters, and the performance of the classifier is analyzed by using a general evaluation index. [Results/Conclusions] The classifier after many training and adjustments has an accuracy of 0.95 in the evaluation of the test set, which proves that the experimental method based on deep learning and the constructed trigger word data set can effectively help us realize automatic multivariate classification of event sentence text of ancient books.

Key words: trigger word classification, Bi-LSTM model, multivariate classification, Zuo Zhuan

CLC Number: 

  • G350
[1] 魏晓萍. 数字人文背景下数字化古籍的深度开发利用[J]. 农业图书情报学刊, 2018, 30(9): 106-110.
WEI X P.In-depth development and utilization of digital ancient books under the background of digital humanities[J]. Journal of agricultural library and information science, 2018, 30(9): 106-110.
[2] 邓三鸿, 胡昊天, 王昊, 等. 古文自动处理研究现状与新时代发展趋势展望[J]. 科技情报研究, 2021, 3(1): 1-20.
DENG S H, HU H T, WANG H, et al.Research status of automatic processing of ancient texts and prospects for development trends in the new era[J]. Science and technology information research, 2021, 3(1): 1-20.
[3] 苏晓丹. 开放域事件抽取关键技术研究[D]. 郑州: 解放军信息工程大学, 2017.
SU X D.Research on key technologies of open domain event extraction[D]. Zhengzhou: PLA information engineering university, 2017.
[4] 张建海. 基于深度学习的生物医学事件抽取研究[D]. 大连: 大连理工大学, 2016.
ZHANG J H.Research on biomedical event extraction based on deep learning[D]. Dalian: Dalian university of technology, 2016.
[5] 张仲华, 苏方方, 姬东鸿. 生物医学事件触发词识别研究[J]. 计算机应用研究, 2017, 34(3): 661-664, 670.
ZHANG Z H, SU F F, JI D H.Research on trigger word recognition of biomedical events[J]. Application research of computers, 2017, 34(3): 661-664, 670.
[6] 何馨宇. 基于文本挖掘的生物事件抽取关键问题研究[D]. 大连:大连理工大学, 2019.
HE X Y.Research on key issues of biological event extraction based on text mining[D]. Dalian: Dalian university of technology, 2019.
[7] 贾遂民, 张玉, 张腾飞. 一种基于介词用法的灾难事件信息抽取方法[J]. 计算机与现代化, 2015(7): 116-119.
JIA S M, ZHANG Y, ZHANG T F.A disaster event information extraction method based on the usage of prepositions[J]. Computer and modernization, 2015(7): 116-119.
[8] 邱奇志, 周三三, 刘长发, 等. 基于文体和词表的突发事件信息抽取研究[J]. 中文信息学报, 2018, 32(9): 56-65, 74.
QIU Q Z, ZHOU S S, LIU C F, et al.Research on the extraction of emergency information based on style and vocabulary[J]. Chinese journal of information, 2018, 32(9): 56-65, 74.
[9] 王宁, 陈湧, 郭玮, 等. 基于知识元的突发事件案例信息抽取方法[J]. 系统工程, 2014, 32(12): 133-139.
WANG N, CHEN Y, GUO W, et al.Information extraction method of emergencies based on knowledge element[J]. Systems engineering, 2014, 32(12): 133-139.
[10] 蒋德良. 基于规则匹配的突发事件结果信息抽取研究[J]. 计算机工程与设计, 2010, 31(14): 3294-3297.
JIANG D L.Research on information extraction of emergencies based on rule matching[J]. Computer engineering and design, 2010, 31(14): 3294-3297.
[11] 万齐智, 万常选, 胡蓉, 等. 基于句法语义依存分析的中文金融事件抽取[J]. 计算机学报, 2021, 44(3): 508-530.
WAN Q Z, WAN C X, HU R, et al.Chinese financial event extraction based on syntactic and semantic dependency analysis[J]. Chinese journal of computers, 2021, 44(3): 508-530.
[12] 罗明, 黄海量. 基于词汇-语义模式的金融事件信息抽取方法[J]. 计算机应用, 2018, 38(1): 84-90.
LUO M, HUANG H L.Financial event information extraction method based on vocabulary-semantic model[J]. Computer applications, 2018, 38(1): 84-90.
[13] 李江龙, 吕学强, 周建设, 等. 金融领域的事件句抽取[J]. 计算机应用研究, 2017, 34(10): 2915-2918, 2945.
LI J L, LU X Q, ZHOU J S, et al.Event sentence extraction in the financial field[J]. Application research of computers, 2017, 34(10): 2915-2918, 2945.
[14] 赵小明, 朱洪波, 陈黎, 等. 基于多分类器的金融领域多元关系信息抽取算法[J]. 计算机工程与设计, 2011, 32(7): 2348-2351.
ZHAO X M, ZHU H B, CHEN L, et al.Multi-classifier-based multi-relation information extraction algorithm in the financial field[J]. Computer engineering and design, 2011, 32(7): 2348-2351.
[15] 薛聪, 高能, 查达仁, 等. 事件库构建技术综述[J]. 信息安全学报, 2019, 4(2): 83-106.
XUE C, GAO N, CHA D R, et al.Overview of event library construction technology[J]. Journal of information security, 2019, 4(2): 83-106.
[16] 高强, 游宏梁. 事件抽取技术研究综述[J]. 情报理论与实践, 2013, 36(4): 114-117, 128.
GAO Q, YOU H L.A review of event extraction technology re-search[J]. Information theory and practice, 2013, 36(4):114-117, 128.
[17] 轩小星, 廖涛, 高贝贝. 中文事件触发词的自动抽取研究[J]. 计算机与数字工程, 2015, 43(3): 457-461.
XUAN X X, LIAO T, GAO B B.Research on automatic extraction of Chinese event trigger words[J]. Computer and digital engineering, 2015, 43(3): 457-461.
[18] 谭红叶. 中文事件抽取关键技术研究[D]. 哈尔滨: 哈尔滨工业大学, 2008.
TAN H Y.Research on key technologies of Chinese event extraction[D]. Harbin: Harbin institute of technology, 2008.
[19] 李培峰, 周国栋, 朱巧明. 基于语义的中文事件触发词抽取联合模型[J]. 软件学报, 2016, 27(2): 280-294.
LI P F, ZHOU G D, ZHU Q M.A joint model of Chinese event trig-ger word extraction based on semantics[J]. Journal of software, 2016, 27(2): 280-294.
[20] 项威, 王邦. 中文事件抽取研究综述[J]. 计算机技术与发展, 2020, 30(2): 1-6.
XIANG W, WANG B.A review of Chinese event extraction research[J]. Computer technology and development, 2020, 30(2): 1-6.
[21] 吴敏. 网络短文本主题聚类研究[D]. 武汉: 华中科技大学, 2015.
WU M.Research on topic clustering of internet short texts[D].Wuhan: Huazhong university of science and technology, 2015.
[22] 徐鑫鑫, 刘彦隆, 宋明. 利用加权词句向量的文本相似度计算方法[J]. 小型微型计算机系统, 2019, 40(10): 2072-2076.
XU X X, LIU Y L, SONG M.Text similarity calculation method using weighted words and sentence vectors[J]. Small microcomputer system, 2019, 40(10): 2072-2076.
[23] 李岩. 基于深度循环神经网络的关系抽取方法研究[D]. 开封: 河南大学, 2019.
LI Y.Research on relation extraction method based on deep recurrent neural network[D]. Kaifeng: Henan university, 2019.
[24] 梁杰, 陈嘉豪, 张雪芹, 等. 基于独热编码和卷积神经网络的异常检测[J]. 清华大学学报(自然科学版), 2019, 59(7): 523-529.
LIANG J, CHEN J H, ZHANG X Q, et al.Anomaly detection based on one-hot encoding and convolutional neural network[J]. Journal of Tsinghua university (natural science edition), 2019, 59(7): 523-529.
[25] 易士翔, 尹宏鹏, 郑恒毅. 基于BiLSTM的公共安全事件触发词识别[J]. 工程科学学报, 2019, 41(9): 1201-1207.
YI S X, YIN H P, ZHENG H Y.Trigger word recognition of public safety events based on BiLSTM[J]. Journal of engineering science, 2019, 41(9): 1201-1207.
[26] 王立荣. Word2vec-CNN-Bi-LSTM短文本情感分类[J]. 福建电脑, 2020, 36(1): 11-16.
WANG L R.Word2vec-CNN-Bi-LSTM short text sentiment classification[J]. Fujian computer, 2020, 36(1): 11-16.
[27] 和志强, 杨建, 罗长玲. 基于Bi-LSTM神经网络的特征融合短文本分类算法[J]. 智能计算机与应用, 2019, 9(2): 21-27.
HE Z Q, YANG J, LUO C L.Feature fusion short text classification algorithm based on Bi-LSTM neural network[J]. Intelligent computers and applications, 2019, 9(2): 21-27.
[28] 阳萍, 谢志鹏. 基于Bi-LSTM模型的定义抽取方法[J]. 计算机工程, 2020, 46(3): 40-45.
YANG P, XIE Z P.Definition extraction method based on Bi-LSTM model[J]. Computer engineering, 2020, 46(3): 40-45.
[29] CHEN Y, XU L, LIU K.Event extraction via dynamicmulti-pooling convolutional neural networks[C]//Proceedings of the 53rd ACL and the 7th IJCNLP. Beijing, China: ACL, 2015: 167-176.
[30] NGUYEN T H, CHO K, GRISHMAN R.Joint event extraction via recurrent neural networks[C]//Proceedings of the 2016 NAACL: Human language technologies. San Diego, California: ACL, 2016: 300-309.
[1] QIAN Li, YANG Yanxi, ZHANG Yuanzhe, HU Maodi, CHANG Zhijun. The Impacts and Implications of OpenClaw for Scientific and Technical Literature Intelligence Work [J]. Journal of library and information science in agriculture, 2026, 38(4): 4-12.
[2] GAI Yingzhao, HUANG Qimeng, WANG Ning, ZHANG Ying, ZHOU Qun. Construction and Empirical Study of Journal Hotspot Index Based on Highly Cited Papers [J]. Journal of library and information science in agriculture, 2026, (): 1-11.
[3] YANG Guancan, SHI Yingying, ZHANG Zihe. Research on the Construction and Evaluation of a Low-Altitude Economy Urban Development Index [J]. Journal of library and information science in agriculture, 2026, 38(2): 4-15.
[4] HU Anqi. Construction of an Artificial Intelligence Literacy Ability Framework and Training System for College Students [J]. Journal of library and information science in agriculture, 2026, 38(2): 42-55.
[5] WANG Jian. Collaborative Governance, Knowledge Interfaces, and Flow Closed-Loop: A Mechanism Study on Rural Reading Spaces as Agricultural Knowledge Diffusion Nodes [J]. Journal of library and information science in agriculture, 2026, 38(1): 71-78.
[6] YANG Guancan, ZHANG Zihe. Construction of a Dynamic Perception System for Talent Supply-Demand Matching: Theoretical Framework and Implementation Path [J]. Journal of library and information science in agriculture, 2025, 37(9): 4-17.
[7] WANG Xiaoyu, HU Jingyuan, WU Ruoyu, WANG Shu, ZHAI Yujia. An LLM-based Data Augmentation Method for Constructing Science & Technology Topic Linkages: Taking the Energy Conservation Field as an Example [J]. Journal of library and information science in agriculture, 2025, 37(9): 63-81.
[8] ZHANG Tao, WU Sihang. How Achievement Goal Orientation Influences College Students' Usage Behaviors of AI Tutoring Tools: An Empirical Study Based on Dual Mediation [J]. Journal of library and information science in agriculture, 2025, 37(7): 91-105.
[9] DONG Ke, SONG Yuchen, WU Jiachun. Layout and Characteristics of European AI Data Governance Policy [J]. Journal of library and information science in agriculture, 2025, 37(7): 4-18.
[10] QIAN Li, WANG Qianying, LIU Yi, ZHANG Yuanzhe, CHANG Zhijun. Agent Technology and Its Applications in Scientific Research [J]. Journal of library and information science in agriculture, 2025, 37(5): 5-14.
[11] LI Xiao, QU Jiansheng. Influencing Factors of User Participation Intention of Crowdsourcing in Evidence Synthesis [J]. Journal of library and information science in agriculture, 2025, 37(3): 92-105.
[12] CAI Yiran, HU Zhengyin, LIU Chunjiang. Analysis of Progress in Data Mining of Scientific Literature Using Large Language Models [J]. Journal of library and information science in agriculture, 2025, 37(2): 4-22.
[13] LIU Xiwen, FU Yun, WEI Huanan. DIS Agent: New Paradigm of S&T Documentation and Information Service for the Fifteenth Five-Year Plan [J]. Journal of library and information science in agriculture, 2024, 36(12): 20-34.
[14] MAN Zhenliang, WANG Xinwei. Prevention and Control of Information Fog from the Perspective of Overall National Security Concept [J]. Journal of library and information science in agriculture, 2024, 36(3): 83-91.
[15] WANG Shan, TAN Zongying. Identification of Key Core Technologies Enables the Development of New Quality Productive Forces [J]. Journal of library and information science in agriculture, 2024, 36(2): 26-35.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!