「cs.LG」カテゴリーアーカイブ

HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving

投稿日: 2025年5月23日作成者: jarxiv

要約大規模な言語モデル（LLMS）と強化学習（RL）を統合すると、複雑なシナリ … 続きを読む →

カテゴリー: cs.LG, cs.RO | コメントを受け付けていません

Multi-modal Integration Analysis of Alzheimer’s Disease Using Large Language Models and Knowledge Graphs

投稿日: 2025年5月23日作成者: jarxiv

要約大規模な言語モデル（LLMS）と知識グラフを使用したアルツハイマー病（AD … 続きを読む →

カテゴリー: cs.AI, cs.LG, I.2.1 | コメントを受け付けていません

AnyBody: A Benchmark Suite for Cross-Embodiment Manipulation

投稿日: 2025年5月22日作成者: jarxiv

要約新しい実施形態に対する制御ポリシーの一般化は、ロボット工学におけるスケーラ … 続きを読む →

カテゴリー: cs.LG, cs.RO | コメントを受け付けていません

Learning-based Airflow Inertial Odometry for MAVs using Thermal Anemometers in a GPS and vision denied environment

投稿日: 2025年5月22日作成者: jarxiv

要約この作業は、熱風速計、IMU、ESC、気圧計を含むマルチセンサーデータ融合 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Learning Novel Skills from Language-Generated Demonstrations

投稿日: 2025年5月22日作成者: jarxiv

要約ロボットは、新しいスキルを必要とするタスクに取り組むために、多様なドメイン … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers

投稿日: 2025年5月22日作成者: jarxiv

要約最新のディープポリシーグラディエントメソッドは、シミュレートされたロボット … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Cascaded Diffusion Models for Neural Motion Planning

投稿日: 2025年5月22日作成者: jarxiv

要約現実の世界のロボットは、衝突せずに複雑な環境の目標を認識して移動する必要が … 続きを読む →

カテゴリー: cs.LG, cs.RO | コメントを受け付けていません

Learning-based Autonomous Oversteer Control and Collision Avoidance

投稿日: 2025年5月22日作成者: jarxiv

要約車両の後部タイヤが牽引力を失い、意図しない過度のヨーを誘発するオーバーステ … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards

投稿日: 2025年5月22日作成者: jarxiv

要約四肢装置制御ポリシーは、報酬の正確な勾配を使用して高性能でトレーニングする … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Guided Policy Optimization under Partial Observability

投稿日: 2025年5月22日作成者: jarxiv

要約部分的に観察可能な環境での強化学習（RL）は、不確実性の下での学習の複雑さ … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving

Multi-modal Integration Analysis of Alzheimer’s Disease Using Large Language Models and Knowledge Graphs

AnyBody: A Benchmark Suite for Cross-Embodiment Manipulation

Learning-based Airflow Inertial Odometry for MAVs using Thermal Anemometers in a GPS and vision denied environment

Learning Novel Skills from Language-Generated Demonstrations

Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers

Cascaded Diffusion Models for Neural Motion Planning

Learning-based Autonomous Oversteer Control and Collision Avoidance

ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards

Guided Policy Optimization under Partial Observability

最近の投稿

最近のコメント

アーカイブ

カテゴリー