「cs.LG」カテゴリーアーカイブ

Task Vectors are Cross-Modal

投稿日: 2024年10月30日作成者: jarxiv

要約私たちは、視覚と言語モデル (VLM) の内部表現と、VLM がタスク表現 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Local Policies Enable Zero-shot Long-horizon Manipulation

投稿日: 2024年10月30日作成者: jarxiv

要約ロボット操作用の Sim2real は、複雑な接触をシミュレートし、現実的 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

EMOCPD: Efficient Attention-based Models for Computational Protein Design Using Amino Acid Microenvironment

投稿日: 2024年10月30日作成者: jarxiv

要約計算タンパク質設計 (CPD) とは、タンパク質を設計するための計算手法の … 続きを読む →

カテゴリー: cs.AI, cs.LG, q-bio.BM | コメントを受け付けていません

An Effective Theory of Bias Amplification

投稿日: 2024年10月30日作成者: jarxiv

要約機械学習モデルはデータに存在するバイアスを捉えて増幅する可能性があり、その … 続きを読む →

カテゴリー: cs.CY, cs.LG, stat.ML | コメントを受け付けていません

Aligning Text-to-Image Diffusion Models with Reward Backpropagation

投稿日: 2024年10月30日作成者: jarxiv

要約テキストから画像への拡散モデルは、非常に大規模な教師なしまたは弱く教師付き … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar

投稿日: 2024年10月29日作成者: jarxiv

要約 3D 占有ベースの認識パイプラインは、詳細なシーンの説明をキャプチャし、さ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets

投稿日: 2024年10月29日作成者: jarxiv

要約制約付き強化学習は、報酬と制約の両方が考慮される安全性が重要な分野で有望な … 続きを読む →

カテゴリー: 68T01, cs.LG, cs.RO, I.2.6 | コメントを受け付けていません

AI Olympics challenge with Evolutionary Soft Actor Critic

投稿日: 2024年10月29日作成者: jarxiv

要約次のレポートでは、IROS 2024 で開催される AI オリンピック競技 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.NE, cs.RO | コメントを受け付けていません

RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences

投稿日: 2024年10月29日作成者: jarxiv

要約好みに基づく強化学習 (PbRL) は、人間の好みを報酬シグナルとして利用 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Reference-Free Formula Drift with Reinforcement Learning: From Driving Data to Tire Energy-Inspired, Real-World Policies

投稿日: 2024年10月29日作成者: jarxiv

要約車をドリフトさせるスキル、つまりプロのドライバーのように制御されたオーバー … 続きを読む →

カテゴリー: cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Task Vectors are Cross-Modal

Local Policies Enable Zero-shot Long-horizon Manipulation

EMOCPD: Efficient Attention-based Models for Computational Protein Design Using Amino Acid Microenvironment

An Effective Theory of Bias Amplification

Aligning Text-to-Image Diffusion Models with Reward Backpropagation

RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar

Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets

AI Olympics challenge with Evolutionary Soft Actor Critic

RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences

Reference-Free Formula Drift with Reinforcement Learning: From Driving Data to Tire Energy-Inspired, Real-World Policies

最近の投稿

最近のコメント

アーカイブ

カテゴリー