Academic
Academic
Home
Projects
Publications
Talks
Contact
Light
Dark
Automatic
1
Towards Building Condition-Based Cross-Modality Intention-Aware Human-AI Cooperation under VR Environment
The paper proposes a condition-based multi-modal human-AI cooperation framework for enhancing user intent identification and information presentation in VR. It utilizes intent tuples and a 2-Large-Language-Models (2-LLMs) architecture, improving prompt length and response generation. A VR furniture purchasing system based on this framework outperforms in user study phases, promising personalized VR experiences.
Ziyao He
,
Shiyuan Li
,
Yunpeng Song
,
Zhongmin Cai
PDF
Cite
Source Document
DOI
Enhancing Augmented Reality Dialogue Systems with Multi-Modal Referential Information
This paper introduces a novel approach to AR dialogue systems, utilizing the SIMMC2-Point dataset to incorporate pointing modality. It employs BART and CLIP models to design multi-modal dialogues capturing spatial and attribute data. Ablation experiments underscore the pointing modality’s importance, advancing AR dialogue systems for immersive interactions.
Ziyao He
,
Zhongmin Cai
PDF
Cite
Source Document
DOI
Interaction of Thoughts: Towards Mediating Task Assignment in Human-AI Cooperation with a Capability-Aware Shared Mental Model
Our paper proposes a novel approach to task assignment in human-AI cooperation, utilizing the capability-aware shared mental model with the unified form of tuples to represent task-specific capabilities of both human and AI. Results from our user study show that this approach improves accuracy and time efficiency while facilitating better understanding of each team member’s capabilities.
Ziyao He
,
Yunpeng Song
,
Shurui Zhou
,
Zhongmin Cai
PDF
Cite
Slides
Source Document
DOI
Cite
×