「cs.AI」カテゴリーアーカイブ

ProcessBench: Identifying Process Errors in Mathematical Reasoning

投稿日: 2024年12月11日作成者: jarxiv

要約言語モデルは数学の問題を解くときに定期的に間違いを犯すため、推論プロセスに … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

RL Zero: Zero-Shot Language to Behaviors without any Supervision

投稿日: 2024年12月10日作成者: jarxiv

要約人間は与えられた報酬関数の最適な動作を予測できないことが多く、貧弱な報酬設 … 続きを読む →

カテゴリー: cs.AI, cs.GR, cs.LG, cs.RO | コメントを受け付けていません

Constrained Control for Autonomous Spacecraft Rendezvous: Learning-Based Time Shift Governor

投稿日: 2024年12月10日作成者: jarxiv

要約この論文では、二体問題の設定においてランデブーおよびドッキング (RD) … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

TrojanRobot: Backdoor Attacks Against LLM-based Embodied Robots in the Physical World

投稿日: 2024年12月10日作成者: jarxiv

要約ロボット操作とは、ロボット工学と人工知能の高度な技術を使用した、ロボットの … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion

投稿日: 2024年12月10日作成者: jarxiv

要約メトリック単眼深度推定の一般化は、その不適切な姿勢の性質により大きな課題を … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning

投稿日: 2024年12月10日作成者: jarxiv

要約大規模言語モデル (LLM) の優れた機能により、LLM はさまざまな自律 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC, cs.RO | コメントを受け付けていません

Words2Contact: Identifying Support Contacts from Verbal Instructions Using Foundation Models

投稿日: 2024年12月10日作成者: jarxiv

要約このペーパーでは、大規模言語モデルとビジョン言語モデルを活用した、言語ガイ … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Large Language Model Benchmarks in Medical Tasks

投稿日: 2024年12月10日作成者: jarxiv

要約医療分野で大規模言語モデル (LLM) の適用が増えるにつれ、ベンチマーク … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Improved GUI Grounding via Iterative Narrowing

投稿日: 2024年12月10日作成者: jarxiv

要約グラフィカルユーザーインターフェイス (GUI) の基礎は、視覚言語モ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

GameArena: Evaluating LLM Reasoning through Live Computer Games

投稿日: 2024年12月10日作成者: jarxiv

要約大規模言語モデル (LLM) の推論能力を評価することは困難です。既存の … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

ProcessBench: Identifying Process Errors in Mathematical Reasoning

RL Zero: Zero-Shot Language to Behaviors without any Supervision

Constrained Control for Autonomous Spacecraft Rendezvous: Learning-Based Time Shift Governor

TrojanRobot: Backdoor Attacks Against LLM-based Embodied Robots in the Physical World

GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion

From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning

Words2Contact: Identifying Support Contacts from Verbal Instructions Using Foundation Models

Large Language Model Benchmarks in Medical Tasks

Improved GUI Grounding via Iterative Narrowing

GameArena: Evaluating LLM Reasoning through Live Computer Games

最近の投稿

最近のコメント

アーカイブ

カテゴリー