图书馆杂志

图书馆杂志 ›› 2026, Vol. 45 ›› Issue (5): 54-62.

• 全民阅读推广学坛 • 上一篇    下一篇

多模态AI赋能沉浸式阅读空间的路径分析、模型构建与优化策略——基于具身认知理论的视角

蔡文杰,王晨敏   

  • 出版日期:2026-05-15 发布日期:2026-05-27
  • 作者简介:蔡文杰  上海图书馆(上海科学技术情报研究所),工程师。研究方向:公共文化服务、人工智能技术研究与应用。作者贡献:论文框架设计、数据收集、撰写及修改。E-mail:wjcai@libnet.sh.cn  上海 200031
    王晨敏  上海图书馆(上海科学技术情报研究所),馆员。研究方向:历史文献红色资源开发利用、中华优秀传统文化阅读推广。作者贡献:论文修改。上海 200031

Path Analysis, Model Construction, and Optimization Strategy for Multimodal AI-Enabled Immersive Reading Spaces: From the Perspective of Embodied Cognition Theory

Cai Wenjie, Wang Chenmin   

  • Online:2026-05-15 Published:2026-05-27
  • About author:Cai Wenjie, Wang Chenmin

摘要: 在人工智能驱动下,公共文化服务的数字化转型正由技术赋能迈向认知驱动。然而,当前沉浸式体验仍面临交互深度不足与认知效果有限等挑战。本文基于具身认知理论,聚焦多模态AI在沉浸式阅读空间中的嵌入机制与认知路径重构,构建“感知输入—身体介入—情境耦合—行为认知—意义生成”五元分析框架,并以上海图书馆东馆“马可·波罗奇迹之旅”“遇见东坡”“未来之未来”3个典型项目为案例,提炼出感官驱动型、行为介入型与语义嵌套型三类具身认知模型。在此基础上,提出构建多模态通感协同机制、深化自然交互与行为驱动叙事、推动文化语义的空间化嵌套、建立知识共创与反馈闭环、联结个体体验与集体文化认同等优化策略,以推动公共文化服务实现从技术展示到认知激活的深度转型,为公共文化空间实践提供理论支撑与方法参考。

关键词: 多模态AI, 沉浸式阅读空间, 具身认知理论, 智慧图书馆, 认知机制

Abstract: AI driven digital transformation of public cultural services is transitioning from technological empowerment to cognition-driven paradigms. However, immersive experiences still face challenges such as insufficient interaction depth and limited cognitive effectiveness. Based on embodied cognition theory, this study examines the embedding mechanisms of multimodal AI in immersive reading spaces and the reconstruction of cognitive pathways, constructing a five-element analytical framework of “perceptual input—bodily engagement—contextual coupling—behavioral cognition—meaning generation”. Through case studies of three representative projects at the Shanghai Library East—Marco Polo‘s Journey of Wonders, Encountering Su Dongpo, and The Future of the Future—it distills three embodied cognition models: sensory-driven, behavior-driven, and semantic-nesting models. On this basis, it proposes optimization strategies including: constructing a multimodal synesthetic synergy mechanism, deepening natural interaction and behavior-driven narratives, advancing the spatial embedding of cultural semantics, establishing a knowledge co-creation and feedback loop, and connecting individual experiences with collective cultural identity. These strategies aim to facilitate a transformation of public cultural services from technology demonstration to deep cognitive activation, providing theoretical support and methodological references for the practice of public cultural spaces.