Libraly Journal

Libraly Journal ›› 2025, Vol. 44 ›› Issue (415): 28-39.

Previous Articles     Next Articles

Improving the Accuracy of Scientific Literature Retrieval through Term Weighting Algorithms Based on Semantic Information

Zhang Min1 4 Li Wei2 Fan Qing3(1 Wuhan Library Chinese Academy of Sciences 2 Wuhan Vocational College of Software andEngineering Wuhan Open University 3 National Cultural Industry Research Center of Central ChinaNormal University 4 Hubei Key Laboratory of Big Data in Science and Technology)   

  • Online:2025-11-15 Published:2025-11-26
  • About author:

    Zhang Min1 4 Li Wei2 Fan Qing3(1 Wuhan Library Chinese Academy of Sciences 2 Wuhan Vocational College of Software andEngineering Wuhan Open University 3 National Cultural Industry Research Center of Central ChinaNormal University 4 Hubei Key Laboratory of Big Data in Science and Technology)

Abstract:

Traditional term weighting algorithms often overlook the semantic information of terms inscientific literature thereby failing to accurately assess the importance of terms and leading to lowaccuracy in the retrieval of scientific literature. In order to fully utilize the semantic information of termsthis paper proposes a semantic-based term weighting algorithm aimed at improving the accuracy ofretrieval in scientific literature. The proposed algorithm will utilize semantic information weights tomeasure the importance of term􀆳s semantic information. Simultaneously the algorithm will also calculatethe keyword weight of terms based on the TF-IDF algorithm. By combining these two weights acomprehensive term weight is obtained to gauge the importance of the term. In the experiments the termweighting algorithm proposed in this paper demonstrated better performance compared to traditional TFIDFand BM25 algorithms effectively enhancing the accuracy of retrieval in scientific literature.

Key words:

Term weighting, Semantic information weight, Search accuracy, Scientific literatureretrieval, TF-IDF