「cs.AI」カテゴリーアーカイブ

Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs

投稿日: 2024年10月16日作成者: jarxiv

要約欧州連合の 24 の公用語すべてをサポートすることで、ヨーロッパの言語の多 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning

投稿日: 2024年10月16日作成者: jarxiv

要約この論文では、部分的に観察可能なオンライン強化学習用に設計されたトランスフ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Need of AI in Modern Education: in the Eyes of Explainable AI (xAI)

投稿日: 2024年10月16日作成者: jarxiv

要約現代教育は AI なしでは \textit{現代} ではありません。ただ … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Predicting from Strings: Language Model Embeddings for Bayesian Optimization

投稿日: 2024年10月16日作成者: jarxiv

要約ベイジアン最適化は、検索効率を向上させるための実験計画やブラックボックス最 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

What We Talk About When We Talk About LMs: Implicit Paradigm Shifts and the Ship of Language Models

投稿日: 2024年10月16日作成者: jarxiv

要約言語モデル (LM) という用語は、対象となるモデルの時間固有のコレクショ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

LoRD: Adapting Differentiable Driving Policies to Distribution Shifts

投稿日: 2024年10月16日作成者: jarxiv

要約運用ドメイン間の分布の変化は、自動運転車 (SDV) の学習モデルのパフォ … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Learning Smooth Humanoid Locomotion through Lipschitz-Constrained Policies

投稿日: 2024年10月16日作成者: jarxiv

要約強化学習とシミュレーションからリアルへの変換を組み合わせることで、脚式ロボ … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Autonomous Improvement of Instruction Following Skills via Foundation Models

投稿日: 2024年10月16日作成者: jarxiv

要約自律的に収集された経験から改善できるインテリジェントな指示従うロボットには … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions

投稿日: 2024年10月16日作成者: jarxiv

要約強化学習では、DDPG や TD3 などのオフポリシーアクタークリティカ … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO, stat.ML | コメントを受け付けていません

LoRA-Pro: Are Low-Rank Adapters Properly Optimized?

投稿日: 2024年10月16日作成者: jarxiv

要約 LoRA としても知られる低ランク適応は、基礎モデルをパラメーター効率よく … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs

AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning

Need of AI in Modern Education: in the Eyes of Explainable AI (xAI)

Predicting from Strings: Language Model Embeddings for Bayesian Optimization

What We Talk About When We Talk About LMs: Implicit Paradigm Shifts and the Ship of Language Models

LoRD: Adapting Differentiable Driving Policies to Distribution Shifts

Learning Smooth Humanoid Locomotion through Lipschitz-Constrained Policies

Autonomous Improvement of Instruction Following Skills via Foundation Models

Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions

LoRA-Pro: Are Low-Rank Adapters Properly Optimized?

最近の投稿

最近のコメント

アーカイブ

カテゴリー