Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds
Published in arXiv preprint, 2023
This paper proposes the first large multi-modal model for open-world agents in Minecraft.
Recommended citation: S Zheng, J Liu, Y Feng, Z Lu. "Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds." 2023 arXiv preprint. arXiv:2310.13255. https://arxiv.org/abs/2310.13255