Libraly Journal ›› 2020, Vol. 39 ›› Issue (11): 97-105.
• DIGITAL HUMANITIES • Previous Articles Next Articles
Qin Heran, Wang Dongbo
Online:
Published:
Abstract: As an interdisciplinary subject, Digital Humanities emphasizes the integration and development of computing technology and humanities. Ancient Chinese classics is an important part of the study of humanities. In this context, we use computer technology to extract keywords from the digitized classics of the Spring and Autumn period, so as to analyze the distribution of keywords in the classics of the Spring and Autumn period. In this paper, three keyword extraction algorithms are used, which are based on unsupervised textrank algorithm, traditional TF-IDF algorithm and LDA topic model algorithm. Based on evaluation method of pooling, it is found that textrank algorithm can extract better keywords with an accuracy of 84%. The accuracy of traditional TF-IDF algorithm and LDA topic model algorithm is 62% and 74% respectively. At the same time, according to the keywords drawn out, we can find that the chronicles of the Spring and Autumn period mainly focus on the interrogation, alliance, expedition, marriage and funeral, usurpation and killing among the vassal states. Keywords Digital Humanities, TextRank,
Qin Heran, Wang Dongbo. The Application of Key Words Extraction in Pre-Qin Ancient Chinese: Taking the Spring and Autumn Annals as an Example for Digital Humanities[J]. Libraly Journal, 2020, 39(11): 97-105.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.libraryjournal.com.cn/EN/
https://www.libraryjournal.com.cn/EN/Y2020/V39/I11/97