图书馆杂志

图书馆杂志 ›› 2026, Vol. 45 ›› Issue (2): 69-81.

• 数据科学 • 上一篇    下一篇

公安系统内个人数据的敏感性识别与隐私计量研究

王清飞,臧国全,张凯亮,肖洋,柴文科,李哲,张恒苗   

  • 出版日期:2026-02-15 发布日期:2026-02-27
  • 作者简介:王清飞 郑州大学信息管理学院,博士研究生,副研究馆员。研究方向:数据隐私。作者贡献:论文写作与修改。E-mail: 43951043@qq.com 河南郑州450001
    臧国全 郑州大学信息管理学院,博士,教授,博士生导师。研究方向:数据隐私。作者贡献:确定选题、提出研究思路、终稿修订。 河南郑州450001
    张凯亮 郑州大学信息管理学院,博士,副研究员。研究方向:数据隐私。作者贡献:论文框架设计。河南郑州450001
    肖洋 郑州大学信息管理学院,博士研究生。研究方向:数据隐私。作者贡献:数据收集和处理。河南郑州450001
    柴文科联勤保障部队工程大学,助理馆员。研究方向:数据隐私。作者贡献:图表制作。重庆400030
    李哲 四川警察学院,讲师。研究方向:治安管理。作者贡献:数据搜集。四川泸州646000
    张恒苗 郑州大学信息管理学院,在读本科生。研究方向:数据挖掘。作者贡献:数据处理。河南郑州450001

Research on Sensitivity Identification and Privacy Measurement of Personal Data Within the Public Security System

Wang Qingfei, Zang Guoquan, Zhang Kailiang, Xiao Yang, Chai Wenke, Li Zhe, Zhang Hengmiao   

  • Online:2026-02-15 Published:2026-02-27
  • About author:Wang Qingfei, Zang Guoquan, Zhang Kailiang, Xiao Yang, Chai Wenke, Li Zhe, Zhang Hengmiao

摘要: 我国《数据安全法》将分类分级作为数据保护的基本制度,规定公安机关应对工作中收集和产生的数据安全负责,但目前公安行业标准缺乏公安数据安全分级依据。本研究计量公安个人数据隐私值,为该类数据分级提供定量依据。通过筛选公安领域4类隐私文本,构建隐私文本库;抽取公安数据项、核心动词和程度词等语义元素,建立公安隐私语义词表;依据隐私词汇的敏感程度、语义强度和文本力度等3个指标,构建隐私计量模型,计量公安个人数据项的隐私值。根据计量结果,将公安个人数据分为4个级别:刑侦经侦数据、个人资讯数据、个人基本数据和治安管理数据、交通管理数据。

关键词: 公安个人数据, 公安数据隐私, 公安隐私计量, 公安敏感数据单元

Abstract: Chinas Data Security Law regards classification and grading as the basic system for data protection. It stipulates that public security organs should be responsible for the security of data collected and generated in the course of their work. However, the current public security industry standards lack a basis for the classification of public security data security. This study measures the privacy value of personal data within the public security system and provides a quantitative basis for data classification. A privacy text database is constructed by screening four types of privacy texts in the public security field. Semantic elements such as public security data items, core verbs, and degree modifiers, are extracted to establish a public security privacy semantic lexicon. Based on the sensitivity, semantic strength, and textual intensity of privacy vocabulary, a privacy measurement model is constructed to measure the privacy value of personal data items in public security. According to the measurement results, public security personal data are classified into four levels: (1) criminal and economic investigation data, (2) personal information data, (3)basic personal data and public security management data, and(4)traffic management data.