图书馆杂志

图书馆杂志 ›› 2023, Vol. 42 ›› Issue (388): 82-88.

• 数字人文 • 上一篇    下一篇

古代经典辞书知识图谱构建与应用研究

钱智勇1 陈 涛2 徐 毅1 李 强1 张 达3
(1 南通大学图书馆 2 中山大学信息管理学院 3 南通大学文学院)
  

  • 出版日期:2023-08-15 发布日期:2023-08-23
  • 作者简介:钱智勇 南通大学科技情报研究所,所长。南通大学 图书馆,信息检索教研室副主任,研究馆员。研究方向:古代文献知识组织、数字人文。作者贡献:设计论文总体框架并完成全文写作。E-mail:qzy@ntu.edu.cn 江苏南通 226007 陈 涛 中山大学信息管理学院,副教授。研究 方向:语义网技术、知识组织、数字人文。作者贡献:辞书知识图谱本体构建和关联数据发布技术。广东广州 510006 徐 毅 南通大学图书馆,馆长,教授。研究方向:域外汉籍文献收集整理与数字人文研究。作者贡献:古代辞书文献的体例分析和特点总结、辞书知识图谱的作用分析。江苏南通 226019 李 强 南通大学图书馆,文献建设部副主任,副研究馆员。研究方向:文献数字化、数字人文。作者贡献:辞书《尔雅注疏》文献数字化、语义标注。江苏南通 226019 张 达 南通大学文学院,学生。研究方向: 汉语言文字学古代汉语方向、数字人文。作者贡献:《尔雅》文本识别和实体抽取。江苏南通 226019

Research on Construction and Application of Knowledge Graph of Vocabulary Interpretation in Ancient Classical Dictionaries

Qian Zhiyong1, Chen Tao2, Xu Yi1, Li Qiang1, Zhang Da3 (1 Nantong University Library; 2 School of Information Management, Sun Yat-sen University; 3 School of Liberal Arts, Nantong University)   

  • Online:2023-08-15 Published:2023-08-23
  • About author:Qian Zhiyong1, Chen Tao2, Xu Yi1, Li Qiang1, Zhang Da3 (1 Nantong University Library; 2 School of Information Management, Sun Yat-sen University; 3 School of Liberal Arts, Nantong University)

摘要: 古代经典辞书是重要的馆藏文献文化遗产,对其进行知识组织可实现古汉语词汇的语义关联和知识发现。文章深入分析了古代辞书的体例和内容结构,依据词汇语义学、本体术语学与资源描述框架标准,设计辞书知识本体,提出以词汇符号—概念—释义的结构化知识表示为基础的辞书知识图谱构建框架,依此进行词汇概念、关系、释义、例证的抽取,并完成关联数据的转换、存储与发布。最后以经典辞书《尔雅》为例,阐述了古代辞书知识图谱的构建过程以及在语义搜索、智慧学习、数字人文等场景中的应用。辞书知识图谱构建可以促进典籍辞书的数字人文研究。

Abstract: Ancient classical dictionaries are important cultural heritages of collections, and semantic association and knowledge discovery of ancient Chinese words can be realized by organizing the knowledge in them. Knowledge graph is a significant knowledge organization technique in artificial intelligence. The dictionary definition of knowledge graph is composed of entities, concepts and their relationships, which can realize intelligent retrieval and knowledge discovery. This article analyzes in depth the style and content structure of ancient dictionaries, designs the lexicographical knowledge ontology based on lexical semantics, termontography and resource description framework standards, and puts forward an idea to build a model through lexicographical knowledge graph on the basis of the structuralized knowledge representation of lexicon, concept and definition. According to this model, this article conducts the extraction of vocabulary concepts, relations, definitions and examples, finishes the transformation, storage and dissemination of associated data in the meantime. Finally, taking the classical dictionary Erya as an example, the article expounds on the construction process of knowledge graph of ancient dictionaries and its application in semantic search, intelligent learning, digital humanities and other scenarios.