农业图书情报学报 ›› 2026, Vol. 38 ›› Issue (1): 4-17.doi: 10.13998/j.cnki.issn1002-1248.25-0734

• 具身智能专题 •    下一篇

人智交互视角下社交机器人情感计算研究——文献综述与理论模型构建

吴丹1,3, 徐华卿1,2   

  1. 1. 武汉大学 人机交互与用户行为研究中心,武汉 430072
    2. 武汉大学 信息管理学院,武汉 430072
    3. 华中师范大学 信息管理学院,武汉 430079
  • 收稿日期:2025-11-22 出版日期:2026-01-05 发布日期:2026-03-10
  • 作者简介:

    吴丹(1978- ),女,博士,教授,研究方向为人机交互、智慧图书馆、用户信息行为

    徐华卿(2000- ),男,博士研究生,研究方向为人机交互

  • 基金资助:
    湖北省自然科学基金创新群体项目“以人为本的人工智能创新应用”(2023AFA012)

Affective Computing for Social Robots from the Perspective of Human-AI Interaction: A Literature Review and Theoretical Model Construction

WU Dan1,3, XU Huaqing1,2   

  1. 1. Human-Computer Interaction and User Behavior Research Center, Wuhan University, Wuhan 430072
    2. School of Information Management, Wuhan University, Wuhan 430072
    3. School of Information Management, Central China Normal University, Wuhan 430079);[58] Wang Xinxiang, Li Zihan, Wang Songyang, et al. Enhancing emotional expression in cat-like robots: Strategies for utilizing tail movements with human-like gazes[J]. Frontiers in Robotics and AI, 2024, 11: 1399012. ;[
    59. ] Gasteiger N, Lim J, Hellou M, et al. A scoping review of the literature on prosodic elements related to emotional speech in human-robot interaction[J]. International Journal of Social Robotics, 2024, 16
    4). 659-670.
    60. ] Liu Xiaozhen, Dong Jiayuan, Jeon M. Robots' "woohoo" and "argh" can enhance users' emotional and social perceptions: An exploratory study on non-lexical vocalizations and non-linguistic sounds[J]. ACM Transactions on Human-Robot Interaction, 2023, 12
    4). 1-20.
    61. ] Fiorini L, D’Onofrio G, Sorrentino A, et al. The role of coherent robot behavior and embodiment in emotion perception and recognition during human-robot interaction: Experimental study[J]. JMIR Human Factors, 2024, 11: e45494. ;[
    62. ] Heinisch J S, Kirchhoff J, Busch P, et al. Physiological data for affective computing in HRI with anthropomorphic service robots: The AFFECT-HRI data set[J]. Scientific Data, 2024, 11: 333. ;[
    63. ] Suguitan M, Depalma N, Hoffman G, et al. Face2Gesture: Translating facial expressions into robot movements through shared latent space neural networks[J]. ACM Transactions on Human-Robot Interaction, 2024, 13
    3). 1-18.
  • Received:2025-11-22 Online:2026-01-05 Published:2026-03-10

摘要:

[目的/意义] 在具身智能从工业自动化转向民生服务的战略背景下,社交机器人面临交互粘性不足与情境理解匮乏的现实困境,情感计算作为赋予机器感知、理解与模拟人类情感的核心技术,是支撑具身智能实现社会化的关键。研究旨在解析多模态感知、动态适应策略与伦理边界的技术路径,为构建负责人智交互体系提供理论参考。 [方法/过程] 遵循PRISMA导向,检索Web of Science近10年具身智能与情感计算交叉领域文献。基于具身性、技术完整性及交互实证性标准筛选,因内容完整性剔除无法获取全文条目,最终选取97篇核心文献。从视觉鲁棒感知、副语言解码、生理信号洞察及多源异构数据融合等维度解析感知层级,并探讨大语言模型驱动下的生成式适应策略。 [结果/结论] 社交机器人情感计算正经历从单一信号统计向多模态语义融合、从静态规则映射向生成式动态适应的范式演进。研究证实,多模态感知的实质是对人类意图的深度解构而非简单的数据统计,基于此,本研究构建了以情境理解为起点、适应行动为核心、伦理约束为底线的动态交互框架。该框架强调,情感适应应从机械模仿转向认知共情,通过大语言模型驱动的生成式策略实现交互的个性化与连贯性,同时伦理边界并非外部附加的规制,而应是内生于算法决策的逻辑约束,旨在应对隐私不对称与心理操纵等内生风险。未来的创新范式应立足于真实环境的生态效度,通过融合长期记忆的终身学习机制对抗新奇效应的消退,并建立人在回路的安全熔断机制,从而确保具身智能在介入人类精神世界过程中的主权安全与科技向善。

关键词: 人智交互, 社交机器人, 具身智能, 情感计算, 系统性文献综述

Abstract:

[Purpose/Significance] Against the backdrop of a strategic transition from industrial efficiency to embodied intelligence within the "Silver-haired Economy," social robots are evolving from functional tools into social companions. However, the field faces a critical bottleneck: a lack of interaction stickiness and empathetic resonance, which leads to high abandonment rates. Affective computing (AC) serves as the core technology to bridge this gap by enabling machines to detect, interpret, and simulate human emotions. Unlike previous literature that often treats AC as a standalone algorithmic task, this research reconstructs the value of AC from a Human-AI Interaction (HAI) perspective. This approach responds to the national "15th Five-Year Plan" requirements for secure and controllable AI governance by integrating technical pathways with ethical boundaries. By situating social robots within complex social relationships, this study provides a theoretical roadmap for robots to transition from mechanical entities to responsible social agents, thereby supporting the high-quality development of population-centric services. [Method/Process] This study employs a systematic literature review methodology guided by the PRISMA framework to ensure scientific rigor and comprehensiveness. The Web of Science Core Collection served as the primary data source, with a search timeframe spanning from 2015 to 2025 to capture the paradigm shifts triggered by deep learning and large-scale language models. A tripartite search logic-integrating subject entities (social robots), core technologies (affective computing), and interaction contexts (human-robot interaction)-was implemented to filter relevant literature. After a multi-level screening process based on embodiment, technical integrity, and empirical validity, 97 high-quality articles were selected. The study utilizes CiteSpace for keyword clustering and citation burst analysis, mapping the evolution of the field across three distinct stages: from foundational signal processing (2018-2019) to dynamic adaptation models (2020-2022), and finally to generative-driven intelligence and ethical regulation (2023-2025). This systematic approach allows for a deep synthesis of multimodal perception technologies, including robust vision, paralinguistic decoding, and physiological signal sensing. [Results/Conclusions] The findings reveal a significant paradigm shift in affective computing for social robots, evolving from simple signal statistics to deep situational understanding and from static rule-based responses to generative dynamic adaptation. The research proposes a holistic interaction framework comprising three pillars: situational understanding, adaptive action, and ethical constraints. Situational understanding leverages multimodal semantic fusion to decode human intent beyond surface-level data, while adaptive action ensures cross-modal consistency in physical expression through generative AI and long-term memory architectures. Ethical constraints are identified as an internal safety mechanism rather than external regulations, addressing risks such as privacy asymmetry, cultural bias in datasets, and psychological manipulation stemming from high anthropomorphism. The study concludes that the future of social robotics lies in three innovative paradigms: enhancing ecological validity through real-world deployment, constructing lifelong learning mechanisms to sustain long-term relationships, and embedding "human-in-the-loop" ethical fuses directly into algorithmic architectures. Despite these advancements, the research is currently limited by a lack of diverse cultural data and long-term field studies. Future research should prioritize cross-cultural design and the development of explainable affective decision-making modules to ensure the sustainable and benevolent development of embodied intelligence in complex social environments.

Key words: Human-AI interaction, social robots, embodied intelligence, affective computing, systematic literature review

中图分类号:  G250

引用本文

吴丹, 徐华卿. 人智交互视角下社交机器人情感计算研究——文献综述与理论模型构建[J]. 农业图书情报学报, 2026, 38(1): 4-17.

WU Dan, XU Huaqing. Affective Computing for Social Robots from the Perspective of Human-AI Interaction: A Literature Review and Theoretical Model Construction[J]. Journal of library and information science in agriculture, 2026, 38(1): 4-17.