Libraly Journal

Libraly Journal ›› 2025, Vol. 44 ›› Issue (407): 113-127.

Previous Articles     Next Articles

Empowering Computational Research in the History of Ideas with Generative Artificial Intelligence: Model Construction and Applications  

Liu Jiangfeng, Zhang Ran, Zhang Jundong, Pei Lei (1 Data Intelligence and Cross Inno­vation Laboratory, Nanjing University; 2 School of Information Management, Nanjing University)   

  • Online:2025-03-15 Published:2025-03-17
  • About author:

    Liu Jiangfeng, Zhang Ran, Zhang Jundong, Pei Lei (1 Data Intelligence and Cross Inno­vation Laboratory, Nanjing University; 2 School of Information Management, Nanjing University)

Abstract:

The large language model has changed the natural language processing and is enhancing the computational analysis of historical texts. Taking the Baichuan Large Language Model as the benchmark model and using the text of the book series Biographies of Chinese Thinkers as the data source, the Thinkers Model was constructed by using domain-specific pre-training, supervised fine-tuning, and direct preference optimization, whereas the performance was evaluated. Evaluation results show that the Thinkers Model outperforms general models in this specialized domain, demonstrating its potential in computational humanities research. The Thinker Model reduces the professional barriers to knowledge exchange and can address challenges in natural language interpretation within computational humanities research.

Key words: Computational Historiography, AIGC, Thinkers, Large language model, Biographies of Chinese Thinkers, Computational humanities