VR

Towards Building Condition-Based Cross-Modality Intention-Aware Human-AI Cooperation under VR Environment

The paper proposes a condition-based multi-modal human-AI cooperation framework for enhancing user intent identification and information presentation in VR. It utilizes intent tuples and a 2-Large-Language-Models (2-LLMs) architecture, improving prompt length and response generation. A VR furniture purchasing system based on this framework outperforms in user study phases, promising personalized VR experiences.

Ziyao He, Shiyuan Li, Yunpeng Song, Zhongmin Cai

Towards Building Condition-Based Cross-Modality Intention-Aware Human-AI Cooperation under VR Environment

Enhancing Augmented Reality Dialogue Systems with Multi-Modal Referential Information

This paper introduces a novel approach to AR dialogue systems, utilizing the SIMMC2-Point dataset to incorporate pointing modality. It employs BART and CLIP models to design multi-modal dialogues capturing spatial and attribute data. Ablation experiments underscore the pointing modality’s importance, advancing AR dialogue systems for immersive interactions.

Ziyao He, Zhongmin Cai

Enhancing Augmented Reality Dialogue Systems with Multi-Modal Referential Information