亚洲国产爱久久全部精品_日韩有码在线播放_国产欧美在线观看_中文字幕不卡在线观看

Voyager: An Open-Ended Embodied Agent with Large Language Models

1NVIDIA, 2Caltech, 3UT Austin, 4Stanford, 5ASU
*Equal contribution Equal advising
Corresponding authors: guanzhi@caltech.edu, dr.jimfan.ai@gmail.com

Abstract

We introduce Voyager, the first LLM-powered embodied lifelong learning agent in Minecraft that continuously explores the world, acquires diverse skills, and makes novel discoveries without human intervention. Voyager consists of three key components: 1) an automatic curriculum that maximizes exploration, 2) an ever-growing skill library of executable code for storing and retrieving complex behaviors, and 3) a new iterative prompting mechanism that incorporates environment feedback, execution errors, and self-verification for program improvement. Voyager interacts with GPT-4 via blackbox queries, which bypasses the need for model parameter fine-tuning. The skills developed by Voyager are temporally extended, interpretable, and compositional, which compounds the agent's abilities rapidly and alleviates catastrophic forgetting. Empirically, Voyager shows strong in-context lifelong learning capability and exhibits exceptional proficiency in playing Minecraft. It obtains 3.3x more unique items, travels 2.3x longer distances, and unlocks key tech tree milestones up to 15.3x faster than prior SOTA. Voyager is able to utilize the learned skill library in a new Minecraft world to solve novel tasks from scratch, while other techniques struggle to generalize.


Voyager discovers new Minecraft items and skills continually by self-driven exploration, significantly outperforming the baselines.

Introduction

Building generally capable embodied agents that continuously explore, plan, and develop new skills in open-ended worlds is a grand challenge for the AI community. Classical approaches employ reinforcement learning (RL) and imitation learning that operate on primitive actions, which could be challenging for systematic exploration, interpretability, and generalization. Recent advances in large language model (LLM) based agents harness the world knowledge encapsulated in pre-trained LLMs to generate consistent action plans or executable policies. They are applied to embodied tasks like games and robotics, as well as NLP tasks without embodiment. However, these agents are not lifelong learners that can progressively acquire, update, accumulate, and transfer knowledge over extended time spans.

Let us consider Minecraft as an example. Unlike most other games studied in AI, Minecraft does not impose a predefined end goal or a fixed storyline but rather provides a unique playground with endless possibilities. An effective lifelong learning agent should have similar capabilities as human players: (1) propose suitable tasks based on its current skill level and world state, e.g., learn to harvest sand and cactus before iron if it finds itself in a desert rather than a forest; (2) refine skills based on environment feedback and commit mastered skills to memory for future reuse in similar situations (e.g. fighting zombies is similar to fighting spiders); (3) continually explore the world and seek out new tasks in a self-driven manner.

Voyager Components

We introduce Voyager, the first LLM-powered embodied lifelong learning agent to drive exploration, master a wide range of skills, and make new discoveries continually without human intervention in Minecraft. Voyager is made possible through three key modules: 1) an automatic curriculum that maximizes exploration; 2) a skill library for storing and retrieving complex behaviors; and 3) a new iterative prompting mechanism that generates executable code for embodied control. We opt to use code as the action space instead of low-level motor commands because programs can naturally represent temporally extended and compositional actions, which are essential for many long-horizon tasks in Minecraft. Voyager interacts with a blackbox LLM (GPT-4) through prompting and in-context learning. Our approach bypasses the need for model parameter access and explicit gradient-based training or finetuning.



Voyager consists of three key components: an automatic curriculum for open-ended exploration, a skill library for increasingly complex behaviors, and an iterative prompting mechanism that uses code as action space.

Automatic Curriculum


Automatic curriculum. The automatic curriculum takes into account the exploration progress and the agent's state to maximize exploration. The curriculum is generated by GPT-4 based on the overarching goal of "discovering as many diverse things as possible". This approach can be perceived as an in-context form of novelty search.


Skill Library


Skill library. Top: Adding a new skill. Each skill is indexed by the embedding of its description, which can be retrieved in similar situations in the future. Bottom: Skill retrieval. When faced with a new task proposed by the automatic curriculum, we perform querying to identify the top-5 relevant skills. Complex skills can be synthesized by composing simpler programs, which compounds Voyager's capabilities rapidly over time and alleviates catastrophic forgetting.


Iterative Prompting Mechanism


Left: Environment feedback. GPT-4 realizes it needs 2 more planks before crafting sticks. Right: Execution error. GPT-4 realizes it should craft a wooden axe instead of an acacia axe since there is no acacia axe in Minecraft.



Self-verification. By providing the agent's current state and the task to GPT-4, we ask it to act as a critic and inform us whether the program achieves the task. In addition, if the task fails, it provides a critique by suggesting how to complete the task.

Experiments

We systematically evaluate Voyager and baselines on their exploration performance, tech tree mastery, map coverage, and zero-shot generalization capability to novel tasks in a new world.



Significantly Better Exploration

As shown in the first figure, Voyager's superiority is evident in its ability to consistently make new strides, discovering 63 unique items within 160 prompting iterations, 3.3x many novel items compared to its counterparts. On the other hand, AutoGPT lags considerably in discovering new items, while ReAct and Reflexion struggle to make significant progress.

Tech Tree Mastery

Tech tree mastery. The Minecraft tech tree tests the agent's ability to craft and use a hierarchy of tools. Progressing through this tree (wooden tool → stone tool → iron tool → diamond tool) requires the agent to master systematic and compositional skills. In this table, fractions indicate the number of successful trials out of three total runs. Numbers are prompting iterations averaged over three trials. The fewer the iterations, the more efficient the method. Compared with baselines, Voyager unlocks the wooden level 15.3x faster (in terms of the prompting iterations), the stone level 8.5x faster, the iron level 6.4x faster, and Voyager is the only one to unlock the diamond level of the tech tree


Extensive Map Traversal


Map coverage: Two bird's eye views of Minecraft maps. Voyager is able to navigate distances 2.3x longer compared to baselines by traversing a variety of terrains, while the baseline agents often find themselves confined to local areas, which significantly hampers their capacity to discover new knowledge.


Efficient Zero-Shot Generalization to Unseen Tasks


Zero-shot generalization to unseen tasks. We clear the agent's inventory, reset it to a newly instantiated world, and test it with unseen tasks. In the table above, fractions indicate the number of successful trials out of three total runs. Numbers are prompting iterations averaged over three trials. The fewer the iterations, the more efficient the method. Voyager can consistently solve all the tasks, while baselines cannot solve any task within 50 prompting iterations. What's interesting to note is that our skill library constructed from lifelong learning not only enhances Voyager's performance but also gives a boost to AutoGPT. This demonstrates that the skill library serves as a versatile tool that can be readily employed by other methods, effectively acting as a plug-and-play asset to enhance performance.


Ablation Studies


Ablation studies. GPT-3.5 means replacing GPT-4 with GPT-3.5 for code generation. Voyager outperforms all the alternatives, demonstrating the critical role of each component. In addition, GPT-4 significantly outperforms GPT-3.5 in code generation.

Conclusion

In this work, we introduce Voyager, the first LLM-powered embodied lifelong learning agent, which leverages GPT-4 to explore the world continuously, develop increasingly sophisticated skills, and make new discoveries consistently without human intervention. Voyager exhibits superior performance in discovering novel items, unlocking the Minecraft tech tree, traversing diverse terrains, and applying its learned skill library to unseen tasks in a newly instantiated world. Voyager serves as a starting point to develop powerful generalist agents without tuning the model parameters.

Media Coverage

"They Plugged GPT-4 Into Minecraft—and Unearthed New Potential for AI. The bot plays the video game by tapping the text generator to pick up new skills, suggesting that the tech behind ChatGPT could automate many workplace tasks." - Will Knight, WIRED

"The Voyager project shows, however, that by pairing GPT-4’s abilities with agent software that stores sequences that work and remembers what does not, developers can achieve stunning results." - John Koetsier, Forbes

"Voyager, the GTP-4 bot that plays Minecraft autonomously and better than anyone else" - Ruetir

"This AI used GPT-4 to become an expert Minecraft player" - Devin Coldewey, TechCrunch

Coverage Index: [Atmarkit] [Career Engine] [Crast.net] [Daily Top Feeds] [Entrepreneur en Espanol] [Finance Jxyuging] [Forbes] [Forbes Argentina] [Gaming Deputy] [Gearrice] [Haberik] [Head Topics] [InfoQ] [ITmedia News] [Mark Tech Post] [Medium] [MSN] [Note] [Noticias de Hoy] [Ruetir] [Stock HK] [Tech Tribune France] [TechCrunch] [TechBeezer] [Toutiao] [US Times Post] [VN Explorer] [WIRED] [Zaker]

Team

Guanzhi Wang
Yuqi Xie
Yunfan Jiang*
Ajay Mandlekar*

Chaowei Xiao
Yuke Zhu
Linxi "Jim" Fan
Anima Anandkumar

* Equal Contribution   † Equal Advising

BibTeX

@article{wang2023voyager,
  title   = {Voyager: An Open-Ended Embodied Agent with Large Language Models},
  author  = {Guanzhi Wang and Yuqi Xie and Yunfan Jiang and Ajay Mandlekar and Chaowei Xiao and Yuke Zhu and Linxi Fan and Anima Anandkumar},
  year    = {2023},
  journal = {arXiv preprint arXiv: Arxiv-2305.16291}
}
亚洲国产爱久久全部精品_日韩有码在线播放_国产欧美在线观看_中文字幕不卡在线观看

    
    

    9000px;">

      
      

      国产精品久久久久久久久免费樱桃| 一区二区三区在线播放| 日本va欧美va精品发布| 国产精品久久三区| 久久这里只有精品视频网| 欧美精品vⅰdeose4hd| 在线观看亚洲精品| 欧美影片第一页| 91成人在线精品| 色先锋资源久久综合| 91麻豆免费看片| 日本道在线观看一区二区| 色婷婷久久99综合精品jk白丝| 成人国产精品免费观看动漫| 国产精品一区二区三区99| 国产精品一级片| 青青草国产精品97视觉盛宴| 日韩黄色小视频| 午夜成人免费视频| 捆绑调教一区二区三区| 黄色成人免费在线| 成人高清视频免费观看| 一本久久a久久免费精品不卡| 色综合一个色综合亚洲| 欧美日韩在线播放三区| 日韩三级精品电影久久久 | 欧美成人女星排名| 久久综合九色综合欧美就去吻| 久久久久久一二三区| 国产精品久久久久久户外露出 | 日韩高清一区二区| 九色porny丨国产精品| 国产成人在线免费| 欧美性色aⅴ视频一区日韩精品| 欧美日韩精品欧美日韩精品一 | 久久网站最新地址| 国产精品三级电影| 午夜精品一区在线观看| 狠狠色综合色综合网络| 色婷婷香蕉在线一区二区| 538在线一区二区精品国产| 国产亚洲一区字幕| 午夜精品久久久久影视| 国产一区久久久| 欧美无砖砖区免费| 国产亚洲人成网站| 亚洲一二三四区| 国产精品一区二区久激情瑜伽 | 国产精品久久久久桃色tv| 亚洲最大色网站| 国产大陆亚洲精品国产| 欧美男人的天堂一二区| 中文子幕无线码一区tr| 免费观看日韩电影| 色94色欧美sute亚洲线路一ni| 久久久精品日韩欧美| 亚洲人成影院在线观看| 免费看日韩a级影片| 波多野结衣的一区二区三区| 欧美一区二区三区免费大片| 最好看的中文字幕久久| 国产一区二三区好的| 欧美日韩国产另类不卡| 国产精品久久久久aaaa樱花| 卡一卡二国产精品| 欧美日韩国产中文| 亚洲天堂a在线| 国产成人午夜99999| 日韩欧美国产综合| 亚洲成人av资源| 色网综合在线观看| 国产精品免费久久久久| 国产成人精品在线看| 欧美精品一区二区三区在线 | 韩国精品在线观看| 欧美美女网站色| 亚洲另类中文字| 99久久综合狠狠综合久久| 欧美精品一区二区三| 日韩一区精品视频| 91麻豆精品久久久久蜜臀| 亚洲一区在线观看网站| 欧美亚洲国产一区二区三区va| 最新不卡av在线| 91小宝寻花一区二区三区| 亚洲桃色在线一区| 日本高清不卡一区| 一区二区免费在线| 欧美群妇大交群中文字幕| 亚洲第一福利一区| 欧美久久久久中文字幕| 免费看欧美女人艹b| 日韩一卡二卡三卡| 国内国产精品久久| 久久久久国产精品麻豆ai换脸| 狠狠网亚洲精品| 国产午夜精品久久久久久免费视| 国产精品香蕉一区二区三区| 国产精品乱码一区二三区小蝌蚪| 成人听书哪个软件好| 最新国产の精品合集bt伙计| 在线看不卡av| 亚洲成人激情社区| 91精品国产福利在线观看| 乱中年女人伦av一区二区| 久久美女高清视频| 99re热这里只有精品视频| 亚洲一区二区影院| 久久亚洲综合av| 91麻豆免费观看| 免费人成网站在线观看欧美高清| 久久午夜羞羞影院免费观看| a美女胸又www黄视频久久| 香蕉av福利精品导航| 欧美精品一区二区三区久久久| av日韩在线网站| 日韩激情视频网站| 欧美韩国日本一区| 欧美二区在线观看| 成人综合婷婷国产精品久久蜜臀| 亚洲精品成人在线| 日韩欧美国产高清| 99免费精品在线| 免费欧美在线视频| 亚洲欧洲av一区二区三区久久| 欧美日韩国产另类一区| 丰满放荡岳乱妇91ww| 性久久久久久久久久久久| 中文字幕电影一区| 欧美日韩久久久| 国产精品自在在线| 午夜精品aaa| 国产精品欧美一级免费| 在线观看91精品国产麻豆| 99精品热视频| 国产乱人伦偷精品视频不卡| 亚洲成人综合在线| 18成人在线视频| 2023国产精品| 5566中文字幕一区二区电影| 色婷婷精品大在线视频| 国产一二精品视频| 青青国产91久久久久久| 一区二区三区欧美日韩| 久久精品欧美一区二区三区不卡 | 欧美r级电影在线观看| 欧美最新大片在线看| 国产不卡视频一区| 久久精品99国产精品| 午夜亚洲国产au精品一区二区| 亚洲丝袜自拍清纯另类| 国产女主播一区| 久久久一区二区三区捆绑**| 日韩精品中文字幕一区| 欧美性生活大片视频| 欧洲中文字幕精品| 日本乱人伦aⅴ精品| 95精品视频在线| av一区二区久久| 成人黄动漫网站免费app| 国产高清久久久| 国模无码大尺度一区二区三区| 午夜精品福利一区二区三区蜜桃| 亚洲精品老司机| 亚洲三级免费观看| 国产精品传媒视频| 最新热久久免费视频| 中文字幕一区二区三中文字幕| 欧美经典一区二区| 日本一区二区三区国色天香 | 亚洲综合偷拍欧美一区色| 亚洲视频每日更新| 国产精品国产三级国产三级人妇| 国产三级久久久| 国产日韩亚洲欧美综合| 国产欧美久久久精品影院| 欧美国产综合一区二区| 欧美国产日本韩| 亚洲欧美一区二区三区孕妇| 亚洲精品日韩专区silk| 一区二区三区四区在线| 亚洲综合激情网| 日韩精品欧美精品| 久久精品久久综合| 成人av网站免费| 在线免费亚洲电影| 91精品国产综合久久久蜜臀图片| 日韩一区二区影院| 久久久九九九九| 亚洲欧美日韩久久| 午夜精品久久久久久久久久 | 欧美日韩你懂得| 国产成人综合自拍| 99国产精品久久久久久久久久久| 色婷婷一区二区| 91精品国产色综合久久久蜜香臀| 精品国偷自产国产一区| 中文一区二区在线观看| 亚洲最大成人网4388xx| 精品在线视频一区|