Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions

要約

人との自然で直観的なインタラクションを実現するために、HRI フレームワークは、人間の知覚、意図の伝達、人間を認識したナビゲーション、および共同アクションのための幅広い方法を組み合わせています。
実際には、人々の予測不可能な行動や環境の予期せぬ状態に遭遇した場合、これらのフレームワークにはそのような状態を動的に認識し、適応して回復して相互作用を再開する能力が欠けている可能性があります。
大規模言語モデル (LLM) は、高度な推論機能とコンテキスト保持機能により、ロボットの適応性を高めるための有望なソリューションを提供します。
ただし、この可能性は直接インタラクション指標の改善につながるわけではありません。
この論文では、(1) 完全にスクリプト化されている、(2) LLM 強化された応答を含むという 2 つの条件で実装された、アプローチ、指示、およびオブジェクト操作を含む産業用ロボットとの代表的な対話について考察します。
視線追跡とアンケートを使用して、参加者のタスク効率、エンゲージメント、ロボットの認識を測定します。
結果は、LLM 条件の主観的評価が高いことを示していますが、客観的な指標では、特に単純なタスク中の効率と集中力において、スクリプト化された条件が同等に機能することが示されています。
また、スクリプト化された条件は、特に些細で反復的なインタラクションの場合、応答待ち時間とエネルギー消費の点で LLM 強化応答よりも優れている可能性があることにも注意します。

要約(オリジナル)

To achieve natural and intuitive interaction with people, HRI frameworks combine a wide array of methods for human perception, intention communication, human-aware navigation and collaborative action. In practice, when encountering unpredictable behavior of people or unexpected states of the environment, these frameworks may lack the ability to dynamically recognize such states, adapt and recover to resume the interaction. Large Language Models (LLMs), owing to their advanced reasoning capabilities and context retention, present a promising solution for enhancing robot adaptability. This potential, however, may not directly translate to improved interaction metrics. This paper considers a representative interaction with an industrial robot involving approach, instruction, and object manipulation, implemented in two conditions: (1) fully scripted and (2) including LLM-enhanced responses. We use gaze tracking and questionnaires to measure the participants’ task efficiency, engagement, and robot perception. The results indicate higher subjective ratings for the LLM condition, but objective metrics show that the scripted condition performs comparably, particularly in efficiency and focus during simple tasks. We also note that the scripted condition may have an edge over LLM-enhanced responses in terms of response latency and energy consumption, especially for trivial and repetitive interactions.

arxiv情報

著者	Tim Schreiter,Jens V. Rüppel,Rishi Hazra,Andrey Rudenko,Martin Magnusson,Achim J. Lilienthal
発行日	2025-01-21 13:42:06+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー