Large Language Models as Zero-Shot Human Models for Human-Robot Interaction

要約

ヒューマンモデルは、ヒューマンロボットインタラクション (HRI) において重要な役割を果たし、ロボットが自分の行動が人に与える影響を考慮し、それに応じて行動を計画できるようにします。
ただし、優れた人体モデルを作成することは困難です。
コンテキスト依存の人間の行動をキャプチャするには、かなりの事前知識および/または大量のインタラクションデータが必要ですが、どちらも入手が困難です。
この作業では、人間が生成した膨大な量のテキストデータを消費してきた大言語モデル (LLM) が、HRI のゼロショット人間モデルとして機能する可能性を探ります。
3 つのソーシャルデータセットに対する実験では、有望な結果が得られました。
LLM は、専用モデルに匹敵するパフォーマンスを達成できます。
とはいえ、プロンプトに対する感度や空間的/数値的推論の事故など、現在の制限についても説明します。
調査結果に基づいて、LLM ベースのヒューマンモデルをソーシャルロボットの計画プロセスに統合し、HRI シナリオに適用する方法を示します。
具体的には、シミュレートされた信頼ベースのテーブルクリアタスクに関する 1 つのケーススタディを提示し、カスタムモデルに依存した過去の結果を再現します。
次に、新しいロボット器具通過実験 (n = 65) を実施します。予備的な結果は、LLM ベースの人間モデルを使用した計画が、基本的な近視眼的計画よりも優れていることを示しています。
要約すると、我々の結果は、LLM が HRI の人間のモデリングに有望な (しかし不完全な) アプローチを提供することを示しています。

要約(オリジナル)

Human models play a crucial role in human-robot interaction (HRI), enabling robots to consider the impact of their actions on people and plan their behavior accordingly. However, crafting good human models is challenging; capturing context-dependent human behavior requires significant prior knowledge and/or large amounts of interaction data, both of which are difficult to obtain. In this work, we explore the potential of large-language models (LLMs) — which have consumed vast amounts of human-generated text data — to act as zero-shot human models for HRI. Our experiments on three social datasets yield promising results; the LLMs are able to achieve performance comparable to purpose-built models. That said, we also discuss current limitations, such as sensitivity to prompts and spatial/numerical reasoning mishaps. Based on our findings, we demonstrate how LLM-based human models can be integrated into a social robot’s planning process and applied in HRI scenarios. Specifically, we present one case study on a simulated trust-based table-clearing task and replicate past results that relied on custom models. Next, we conduct a new robot utensil-passing experiment (n = 65) where preliminary results show that planning with a LLM-based human model can achieve gains over a basic myopic plan. In summary, our results show that LLMs offer a promising (but incomplete) approach to human modeling for HRI.

arxiv情報

著者	Bowen Zhang,Harold Soh
発行日	2023-03-06 23:16:24+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Large Language Models as Zero-Shot Human Models for Human-Robot Interaction

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー