Libraly Journal

Libraly Journal ›› 2022, Vol. 41 ›› Issue (3): 126-134.

Previous Articles     Next Articles

Research of Instrument Named Entity Recognition (NER) in Research Paper Based on the Full Text

Fan Wuyou (Shanghai Jiao Tong University Library)   

  • Online:2022-03-15 Published:2022-03-21
  • About author:Fan Wuyou (Shanghai Jiao Tong University Library)

Abstract: The full-text of research papers contain information of instruments which have not been
recorded. The effective extraction of the instrumental information from the text can be used as the basis
for quantitative research such as instrument performance evaluation. In this paper, chemical papers and
large-scale analytical instruments are taken as the object, and the unknown instrument name is found from
the literature through semantic similarity and word formation rules, the instrument name fuzzy is retrieved
for PDF typesetting, and the distinction between the actual used instruments and the unused instruments
or the entities with the same name is made based on document type, main text end identification, usage
identification words and corresponding relationship of full name abbreviation. The accuracy of such
efforts is verified by comparing with the results of manual annotation.