图书馆杂志

图书馆杂志 ›› 2017, Vol. 36 ›› Issue (7): 60-65.

• 工作研究 • 上一篇    下一篇

数字图书馆影像资源众包文本建设研究

付跃安   

  1. 付跃安 博士,研究馆员,广州图书馆网络服务部副主任。研究方向:数字图书馆建设和服务、数字图书馆用户体验、儿童阅读推广。E-mail: 1215022629@qq.com 广东广州 510623
  • 出版日期:2017-07-18 发布日期:2017-11-13

Crowdsourcing Text-building of Picture Resource in Digital Libraries

Fu Yuean   

  • Online:2017-07-18 Published:2017-11-13

摘要:

随着公共数字图书馆的发展,影像资源的文本建设成为一项重要工作。本文将众包理念引入,研究了在文本建设中实行众包的有关问题。首先要对项目进行专业评估,在此基础上选择恰当的建设策略和实施方式,前者包括影像转录与文本校正,后者包括对照式表单输入、验证码和游戏等。在获得加工数据后,还要对加工结果进行正确性判断,文章提出了人工审核、基于推断的评价和一致性检验等评价方式。最后,文章就确保众包文本建设的质量提出了建议。

关键词:

数字图书馆, 众包, 影像资源, 文本建设

Abstract:

With the popularity of digital public libraries comes the important task of text-building of picture resource. This paper studies the issue of incorporating crowdsourcing into text-building. Firstly,professional evaluation should be carried out, and building strategies (e.g., transcription and proofreading)
as well as implementation methods (e.g., text input, CAPTCHA, games) need to be considered. The text contributed by volunteers must be checked. This paper summarizes three ways of checking, namely, human checking, checking based on guessing, and consistency checking. At last, four measures for ensuring textbuilding quality are discussed.

Key words:

Digital libraries, Crowdsourcing, Picture resource, Text-building