农业图书情报学报 ›› 2020, Vol. 32 ›› Issue (3): 4-9.doi: 10.13998/j.cnki.issn1002-1248.2020.02.17-0073

• 特约稿件 •    下一篇

基于人民日报语料的中央一号文件词频历时分析

黄水清1,2, 王东波1,2   

  1. 1.南京农业大学信息科学技术学院,南京 210095;
    2.南京农业大学领域知识关联研究中心,南京 210095
  • 收稿日期:2020-02-06 出版日期:2020-03-05 发布日期:2020-03-23
  • 作者简介:黄水清(ORCID:0000-0002-1646-9300),教授,博士生导师,南京农业大学信息科学技术学院。王东波(ORCID:0000-0002-9894-9550),教授,博士生导师,南京农业大学信息科学技术学院。

Diachronic Analysis of Word Frequency in the First Document of Chinese Party Central Committee Based on People's Daily Corpus

HUANG Shuiqing1,2, Wang Dongbo1,2   

  1. 1. College of Information Science and Technology, Nanjing Agricultural University, Nanjing 210095;
    2. Research Center for Correlation of Domain Knowledge, Nanjing Agricultural University, Nanjing 210095
  • Received:2020-02-06 Online:2020-03-05 Published:2020-03-23

摘要: [目的/意义]揭示中央一号文件重要词语的时代特征。[方法/过程]以经过人工分词处理的人民日报语料为数据对象,统计2020年中央一号文件中各部分的若干重要词语在2015年以来每年1月份《人民日报》中出现的频次,从历时性的角度对比不同时期《人民日报》中频次的变化。[结果/结论]首先,宏观性的顶层政策以及普遍性的社会现象涉及的词语在《人民日报》中有比较高的频次;其次,与“三农”相关的词汇在《人民日报》中的频次总体都是不很高,特别是部分与具体工作相关的词汇。最后,以人民日报语料为基础对政策文件的词频做历时性分析是可行的,能够得出有益的结论。

关键词: 人民日报语料, NEPD, 中央一号文件, 词频

Abstract: [Purpose/Significance] The aim of this study is to reveal the characteristics of our times of important words in the No. 1 Central Document, the first document issued by the Central Committee of the Communist Party of China every year. [Method/Process] Taking the People's Daily corpus processed by manual word segmentation as the research object, the frequency of several important words in the No.1 Central Document of 2020 which have appeared in People's Daily corpus in January since 2015 is calculated. From a diachronic perspective, this paper compares the changes of word frequency in People's Daily in different periods and analyzes the reasons. [Results/Conclusions] First, words about macro top-level decision making policies and common social phenomena are found to have higher frequency in People's Daily corpus. Second, words related to three rural issues, i.e. agriculture, rural area and farmers, do not have higher frequency, especially those related to specific jobs. Finally, it is feasible to use People's Daily corpus as a basis to conduct the diachronic analysis of word frequency of policy making documents.

Key words: People's Daily corpus, NEPD, First Document of Chinese Party Central Committee, word frequency

中图分类号: 

  • G353

引用本文

黄水清, 王东波. 基于人民日报语料的中央一号文件词频历时分析[J]. 农业图书情报学报, 2020, 32(3): 4-9.

HUANG Shuiqing, Wang Dongbo. Diachronic Analysis of Word Frequency in the First Document of Chinese Party Central Committee Based on People's Daily Corpus[J]. Journal of Library and Information Science in Agriculture, 2020, 32(3): 4-9.