DynaSaur: Large Language Agents Beyond Predefined Actions

要約

既存のLLMエージェントシステムは、通常、すべてのステップで固定および事前定義されたセットからアクションを選択します。
このアプローチは、閉じた狭いスコープ環境で効果的ですが、実世界の自由なシナリオに2つの主要な課題を提示します。（1）LLMエージェントの計画と演技能力を大幅に制限し、（2）可能なすべてのアクションを列挙し、実装するためにかなりの人間の努力を必要とします。
これらの制限に対処するために、必要に応じてアクションを動的に作成および構成できるLLMエージェントフレームワークを提案します。
このフレームワークでは、エージェントは、汎用プログラミング言語で記述されたプログラムを生成および実行することにより、環境と対話します。
さらに、生成されたアクションは、将来の再利用のために時間とともに蓄積されます。
複数のベンチマークにわたる広範な実験は、このフレームワークが柔軟性を大幅に向上させ、固定アクションセットに依存する以前の方法よりも優れていることを示しています。
特に、LLMエージェントは、予期せぬエッジケースのために事前定義されたアクションが不十分または失敗するシナリオで適応および回復することができます。
私たちのコードは、https：//github.com/adobe-research/dynasaurにあります。

要約(オリジナル)

Existing LLM agent systems typically select actions from a fixed and predefined set at every step. While this approach is effective in closed, narrowly scoped environments, it presents two major challenges for real-world, open-ended scenarios: (1) it significantly restricts the planning and acting capabilities of LLM agents, and (2) it requires substantial human effort to enumerate and implement all possible actions, which is impractical in complex environments with a vast number of potential actions. To address these limitations, we propose an LLM agent framework that can dynamically create and compose actions as needed. In this framework, the agent interacts with its environment by generating and executing programs written in a general-purpose programming language. Moreover, generated actions are accumulated over time for future reuse. Our extensive experiments across multiple benchmarks show that this framework significantly improves flexibility and outperforms prior methods that rely on a fixed action set. Notably, it enables LLM agents to adapt and recover in scenarios where predefined actions are insufficient or fail due to unforeseen edge cases. Our code can be found in https://github.com/adobe-research/dynasaur.

arxiv情報

著者	Dang Nguyen,Viet Dac Lai,Seunghyun Yoon,Ryan A. Rossi,Handong Zhao,Ruiyi Zhang,Puneet Mathur,Nedim Lipka,Yu Wang,Trung Bui,Franck Dernoncourt,Tianyi Zhou
発行日	2025-06-04 16:26:58+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

DynaSaur: Large Language Agents Beyond Predefined Actions

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー