图书馆杂志

图书馆杂志 ›› 2026, Vol. 45 ›› Issue (4): 108-121.

• 信息管理 • 上一篇    下一篇

多维特征融合视角下虚假健康信息识别研究——以饮食健康信息为例

范静,朱云琴,余意   

  • 出版日期:2026-04-15 发布日期:2026-04-29
  • 作者简介:范静  北京外国语大学国际商学院,教授,博士,博士生导师。研究方向:商务智能、在线健康。作者贡献:论文框架设计及修改。E-mail:fanjing@bfsu.edu.cn  北京 100089
    朱云琴  北京外国语大学国际商学院,博士研究生。研究方向:管理信息系统、在线健康服务。作者贡献:数据处理、论文撰写及修改。北京 100089
    余意  北京外国语大学国际商学院,硕士研究生。研究方向:信息系统、在线健康。作者贡献:数据收集、数据清理。北京 100089

A Study on Health Misinformation Detection from a Multidimensional Feature Fusion Perspective:  A Case Study of Dietary Health Information

Fan Jing, Zhu Yunqin, Yu Yi   

  • Online:2026-04-15 Published:2026-04-29
  • About author:Fan Jing, Zhu Yunqin, Yu Yi

摘要: 文章基于微信公众号健康类文本进行特征挖掘,探究如何有效提高虚假健康信息识别的准确性,提升网络平台的健康信息质量,为用户健康决策提供参考。从内容特征、情感特征、发布者特征和领域特征4个维度出发,提出一种基于多维特征融合和多层感知机(MDFF-MLP)的虚假健康信息识别方法。首先,分析文章内容并提取多维关键特征;其次,使用特征消融实验确定表意丰富、真假区分能力强的特征组合,优化饮食健康领域的情感词典,构建基于“内容情感发布者领域”多维度融合的多层感知机模型;最后,利用深度置信网络学习多维特征并进行虚假饮食健康信息分类。结果显示:在MDFF-MLP模型中,内容特征、发布者特征的各单维特征对文本的真假区分度是0.90、0.84,多维特征融合后准确率达到0.96,F1值达到0.95,相较于逻辑回归(F1=0.82)和LightGBM(F1=0.89)等基线模型取得较大提升,具有较高的实用性。

关键词: 特征融合, 虚假健康信息, 信息治理, 多层感知机, 深度学习

Abstract: Based on the health-related texts from WeChat public accounts, the study conducts feature mining to explore how to effectively improve the accuracy of false health information detection, enhance the quality of healthy information on online platforms, and provide reference for users to make health-related decisions.Starting with four dimensions—content features, emotional features, publisher features and domain features, this paper proposes a health misinformation recognition method based on multidimensional feature fusion and multi-layer perceptron method(MDFF-MLP). Firstly, the textual content is analyzed to extract multidimensional key features. Secondly, feature ablation experiments are conducted to determine feature combinations with rich meaning and strong ability to distinguish true and false information. The sentiment lexicon in the dietary health domain is optimized, and a multilayer perceptron model based on the multi-dimensional fusion of “content-emotion-publisher-domain” features is constructed. Finally, a deep belief network is used to learn multidimensional features and classify dietary health misinformation. The findings indicate that in the MDFF-MLP model, the discriminative power of single-dimensional features reaches 0.90 for content features and 0.84 for publisher features. After multidimensional feature fusion, the model achieves an accuracy of 0.96, with an F1 score of 0.95. Compared with baseline models such as Logistic regression(F1=0.82) and LightGBM(F1=0.89), this model demonstrates significant improvement and strong practical applicability. 

Key words: Feature fusion, Health misinformation, Information governance, Multi-layer , perceptron, Deep learning