Estimating speaker direction on a humanoid robot with binaural acoustic signals

要約

音声対話中に人間のような動作を実現するには、人型ロボットが人間の話者の位置を推定する必要があります。
ここでは、人間とロボットのインタラクションシナリオのリアルタイムアプリケーションも考慮しながら、到来方向 (DOA) 推定に使用されるパラメーターを最適化する方法を紹介します。
この方法は、人型ロボット頭部上のバイノーラル音源定位フレームワークに適用されます。
この作業のために実際のデータが収集され、注釈が付けられます。
最適化は総当たり法とベイジアンモデルベースの方法によって実行され、結果が検証および議論され、リアルタイム使用のレイテンシへの影響も調査されます。

要約(オリジナル)

To achieve human-like behaviour during speech interactions, it is necessary for a humanoid robot to estimate the location of a human talker. Here, we present a method to optimize the parameters used for the direction of arrival (DOA) estimation, while also considering real-time applications for human-robot interaction scenarios. This method is applied to binaural sound source localization framework on a humanoid robotic head. Real data is collected and annotated for this work. Optimizations are performed via a brute force method and a Bayesian model based method, results are validated and discussed, and effects on latency for real-time use are also explored.

arxiv情報

著者	Pranav Barot,Katja Mombaur,Ewen MacDonald
発行日	2023-07-22 17:11:46+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Estimating speaker direction on a humanoid robot with binaural acoustic signals

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー