1

Towards Building Condition-Based Cross-Modality Intention-Aware Human-AI Cooperation under VR Environment

The paper proposes a condition-based multi-modal human-AI cooperation framework for enhancing user intent identification and information presentation in VR. It utilizes intent tuples and a 2-Large-Language-Models (2-LLMs) architecture, improving prompt length and response generation. A VR furniture purchasing system based on this framework outperforms in user study phases, promising personalized VR experiences.

Ziyao He, Shiyuan Li, Yunpeng Song, Zhongmin Cai

Towards Building Condition-Based Cross-Modality Intention-Aware Human-AI Cooperation under VR Environment

Enhancing Augmented Reality Dialogue Systems with Multi-Modal Referential Information

This paper introduces a novel approach to AR dialogue systems, utilizing the SIMMC2-Point dataset to incorporate pointing modality. It employs BART and CLIP models to design multi-modal dialogues capturing spatial and attribute data. Ablation experiments underscore the pointing modality’s importance, advancing AR dialogue systems for immersive interactions.

Ziyao He, Zhongmin Cai

Enhancing Augmented Reality Dialogue Systems with Multi-Modal Referential Information

Interaction of Thoughts: Towards Mediating Task Assignment in Human-AI Cooperation with a Capability-Aware Shared Mental Model

Our paper proposes a novel approach to task assignment in human-AI cooperation, utilizing the capability-aware shared mental model with the unified form of tuples to represent task-specific capabilities of both human and AI. Results from our user study show that this approach improves accuracy and time efficiency while facilitating better understanding of each team member’s capabilities.

Ziyao He, Yunpeng Song, Shurui Zhou, Zhongmin Cai

Interaction of Thoughts: Towards Mediating Task Assignment in Human-AI Cooperation with a Capability-Aware Shared Mental Model