图书馆杂志

图书馆杂志

• 理论探索 • 上一篇    下一篇

图书馆法与知识产权保护研究 文本和数据挖掘的社会、政治和法律问题

刘晶晶   

  • 出版日期:2017-02-21 发布日期:2017-02-25
  • 作者简介:刘晶晶 女,中国科学院文献情报中心。研究方向:数据期刊、数据出版;图书馆著作权;科技信息编辑与传播。E-mail:liujingjing@mail.las.ac.cn北京 100190

The Social, Political and Legal Aspects of Text and Data Mining (TDM)

Liu Jingjing   

  • Online:2017-02-21 Published:2017-02-25

摘要:

文本和数据挖掘(textual or data mining,简称TDM)的概念及其随后的分析可以追溯到几百甚至上千年前。文本和数据分析最初是手工进行的,目前它已经发展成为一种新的工具,能够帮助科研人员从文本语料库得出新见解。然而,为发掘TDM的潜在益处,我们需要克服一些非技术壁垒。这些壁垒包括复杂的著作权、数据库权利和许可造成的法律不确定性;一些出版商目前并不支持TDM提供给学术界的机会;很多学者缺乏对TDM的认识以及相关工具技能的使 用。

关键词: 文本和数据挖掘, 社会, 政治, 法律

Abstract:

The ideas of textual or data mining (TDM) and subsequent analysis go back hundreds, if not thousands, of years. Originally carried out manually, textual and data analysis has long been a tool enabling new insights drawn from text corpora. However, for the potential benefits of TDM to be unlocked,
a number of non-technological barriers need to be overcome. These include legal uncertainty resulting from complicated copyright, database rights and licensing, the fact that some publishers are not currently embracing the opportunities TDM offers for the academic community, and a lack of awareness of TDM among many academics, alongside a gap of skills.

Key words: Text and data mining, Social, Political, Legal