Libraly Journal

Libraly Journal ›› 2025, Vol. 44 ›› Issue (415): 75-87.

Previous Articles     Next Articles

Dual-Channel Multi-Granularity Feature Fusion for Named Entity Recognition in Ancient Poetry A Case Study of the Tang and Song Dynasties

Zhao Jingsheng1 2 Yang Xinyi1 Qu Weilong1 Zheng Jiashang1Zhu Qiaoming2(1 School of Information and Control Engineering Qingdao University of Technology 2 School of Computer Science and Technology Soochow University)   

  • Online:2025-11-15 Published:2025-11-26
  • About author:

    Zhao Jingsheng1 2 Yang Xinyi1 Qu Weilong1 Zheng Jiashang1Zhu Qiaoming2(1 School of Information and Control Engineering Qingdao University of Technology 2 School ofComputer Science and Technology Soochow University)

Abstract:

In response to the scarcity of training data in the field of ancient Chinese poetry we constructthe POEM-NER dataset for ancient poetry focusing on collecting descriptive poetry data related to naturalscenery and utilizing the BIOES method for entity annotation. To overcome the limitations of traditionalnamed entity recognition NER methods in handling the complex sentence structures of ancient poetryand the extraction of relatively singular features we propose the DMFF-APNER method based on dualchannelmulti-granularity feature fusion for ancient poetry NER. First we pre-train the SikuBERTmodel on ancient poetry corpora obtain multi-granularity feature vectors using Word2Vec and use FLATfor embedding fusion. Next the information vectors are input into the BiLSTM and IDCNN dualchannelsfor parallel feature extraction. The extracted contextual and local features are then dynamicallyfused through an attention mechanism. Finally predicted sequence labels are obtained through CRFdecoding. Comparative and ablation experiments on the open-source C-CLUE dataset and the POEM-NERdataset demonstrate that DMFF-APNER model can effectively utilize multi-granularity features to enhancethe model’s semantic representation capability. Furthermore the use of dual-channel technology in the feature extraction layer complements the two types of features thereby significantly improving entityextraction performance. The F1 values on the C-CLUE dataset and the POEM-NER ancient poetry datasetreach 82. 66% and 86. 02% respectively.

Key words:

Named entity recognition, Multi-granularity feature fusion, Dual-channel, Ancient Poetry, Digital humanities