「cs.LG」カテゴリーアーカイブ

Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge

投稿日: 2024年8月20日作成者: jarxiv

要約大規模言語モデル (LLM) は機械学習の状況に革命をもたらしましたが、現 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

PEDAL: Enhancing Greedy Decoding with Large Language Models using Diverse Exemplars

投稿日: 2024年8月20日作成者: jarxiv

要約 Self-Consistency などの多様な推論パスを備えた自己アンサン … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Deep Reinforcement Learning for Robotics: A Survey of Real-World Successes

投稿日: 2024年8月19日作成者: jarxiv

要約強化学習 (RL)、特にディープ RL (DRL) と呼ばれるディープニ … 続きを読む →

カテゴリー: cs.LG, cs.RO | コメントを受け付けていません

FedRobo: Federated Learning Driven Autonomous Inter Robots Communication For Optimal Chemical Sprays

投稿日: 2024年8月19日作成者: jarxiv

要約フェデレーテッドラーニングにより、ロボットは一元化されたデータ収集に依存 … 続きを読む →

カテゴリー: cs.CV, cs.DC, cs.LG, cs.RO | コメントを受け付けていません

D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning

投稿日: 2024年8月19日作成者: jarxiv

要約オフライン強化学習アルゴリズムは、コストがかかる、または危険な現実世界の探 … 続きを読む →

カテゴリー: cs.LG, cs.RO | コメントを受け付けていません

MMP++: Motion Manifold Primitives with Parametric Curve Models

投稿日: 2024年8月19日作成者: jarxiv

要約基本的なモーションスキルをエンコードするための多様体ベースのアプローチで … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

AirPilot: A PPO-based DRL Auto-Tuned Nonlinear PID Drone Controller for Robust Autonomous Flights

投稿日: 2024年8月19日作成者: jarxiv

要約ナビゲーションの精度、速度、安定性は、安全な UAV 飛行操縦と動的な環境 … 続きを読む →

カテゴリー: cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

S-RAF: A Simulation-Based Robustness Assessment Framework for Responsible Autonomous Driving

投稿日: 2024年8月19日作成者: jarxiv

要約人工知能 (AI) テクノロジーが進歩するにつれて、AI 駆動システムの堅 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.CY, cs.LG, cs.RO | コメントを受け付けていません

RBLA: Rank-Based-LoRA-Aggregation for Fine-tuning Heterogeneous Models in FLaaS

投稿日: 2024年8月19日作成者: jarxiv

要約 Federated Learning (FL) は、携帯電話、デスクトップ … 続きを読む →

カテゴリー: cs.DC, cs.LG | コメントを受け付けていません

Efficient Multi-Policy Evaluation for Reinforcement Learning

投稿日: 2024年8月19日作成者: jarxiv

要約複数のターゲットポリシーを公平に評価するために、RL 実践者の間で主流の … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge

PEDAL: Enhancing Greedy Decoding with Large Language Models using Diverse Exemplars

Deep Reinforcement Learning for Robotics: A Survey of Real-World Successes

FedRobo: Federated Learning Driven Autonomous Inter Robots Communication For Optimal Chemical Sprays

D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning

MMP++: Motion Manifold Primitives with Parametric Curve Models

AirPilot: A PPO-based DRL Auto-Tuned Nonlinear PID Drone Controller for Robust Autonomous Flights

S-RAF: A Simulation-Based Robustness Assessment Framework for Responsible Autonomous Driving

RBLA: Rank-Based-LoRA-Aggregation for Fine-tuning Heterogeneous Models in FLaaS

Efficient Multi-Policy Evaluation for Reinforcement Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー