Libraly Journal

Libraly Journal ›› 2023, Vol. 42 ›› Issue (392): 22-35.

Previous Articles     Next Articles

Data Governance and Application Development of LargeLanguage Models in Library and Information Services

Liu Qianqian1, Liu Shengying2, Liu Wei1(1 Shanghai Library; 2 East China Normal University Library   

  • Online:2023-12-15 Published:2023-12-29
  • About author:Liu Qianqian1, Liu Shengying2, Liu Wei1(1 Shanghai Library; 2 East China Normal University Library

Abstract:

This article primarily discusses the data governance requirements and development patternsof large language models in the field of library and information science. Large language models rely onmassive amounts of text data for unsupervised pre-training and supervised fine-tuning. Domain-specificlarge models, on the other hand, are models that have been fine-tuned on domain-specific data to possessdomain knowledge and solve domain-specific problems to meet the needs of domain applications.The article aims to explore how to better apply generative artificial intelligence to libraries and relatedindustries to promote the development of smart libraries and provide the driving force for high-qualityservices. The article first reviews the breakthrough progress of generative artificial intelligence, and thenintroduces the basic principles and current applications of large models, as well as analyzes the datafactors and data requirements behind the various task capabilities of large models. Finally, the articlediscusses the application potential and development patterns of domain-specific large models. The maincontribution of this article is to analyze the application patterns and data governance of large models inthe field of library and information services, providing a theoretical basis and practical guidance for theapplication of generative artificial intelligence technology in the library industry. At the same time, it alsodiscusses the issues and limitations that need to be considered when applying and evaluating industryspecificlarge models.