Obstacle Avoidance Deep Reinforcement Learning-Based Trajectory Planner with Robust Low-Level Control for Robotic Manipulators

要約

ロボット工学における現代の戦略は学習ベースであり、複雑なブラックボックスの性質と解釈可能性の欠如を特徴とし、安定性と安全性を確保する上で課題を引き起こす可能性があります。
これらの問題に対処するために、障害物のない深層強化学習 (DRL) 軌道プランナーと、新しい自動調整の低レベルおよび関節レベルの制御戦略を統合し、環境との相互作用を通じて学習フェーズに積極的に関与することを提案します。
このアプローチは、計算に関連する複雑さを回避しながら、非反復的でランダムな障害物回避タスクにも対処します。
まず、速度制限付きで障害物のない動きを計画するためのモデルフリー DRL エージェントが、関節レベルの推論を通じてタスク空間内で「n」自由度 (DoF) を持つマニピュレーターに採用されます。
この計画は、堅牢なサブシステムベースの適応コントローラーに入力され、必要なトルクが生成されます。同時に、カッコー探索最適化 (CSO) アルゴリズムが制御ゲインを強化して、目標値からの最大偏差に到達するまでの時間や安定するまでの時間を最小限に抑えます。
所望の値、および定常状態での持続的なトラッキングエラー。
このアプローチでは、未知のロボットマニピュレータモデリングにもかかわらず、不慣れな環境では位置と速度の誤差が指数関数的にゼロに収束することが保証されます。
理論的な主張は、シミュレーション結果の提示を通じて検証されます。

要約(オリジナル)

In robotics, contemporary strategies are learning-based, characterized by a complex black-box nature and a lack of interpretability, which may pose challenges in ensuring stability and safety. To address these issues, we propose integrating an obstacle-free deep reinforcement learning (DRL) trajectory planner with a novel auto-tuning low- and joint-level control strategy, all while actively engaging in the learning phase through interactions with the environment. This approach circumvents the complexities associated with computations while also addressing nonrepetitive and random obstacle avoidance tasks. First, a model-free DRL agent to plan velocity-bounded and obstacle-free motion is employed for a manipulator with ‘n’ degrees of freedom (DoF) in task space through joint-level reasoning. This plan is then input into a robust subsystem-based adaptive controller, which produces the necessary torques, while the Cuckoo Search Optimization (CSO) algorithm enhances control gains to minimize the time required to reach, time taken to stabilize, the maximum deviation from the desired value, and persistent tracking error in the steady state. This approach guarantees that position and velocity errors exponentially converge to zero in an unfamiliar environment, despite unknown robotic manipulator modeling. Theoretical assertions are validated through the presentation of simulation outcomes.

arxiv情報

著者	Mehdi Heydari Shahna,Seyed Adel Alizadeh Kolagar,Jouni Mattila
発行日	2024-02-06 06:56:07+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Obstacle Avoidance Deep Reinforcement Learning-Based Trajectory Planner with Robust Low-Level Control for Robotic Manipulators

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー