月別アーカイブ: 2024年5月

Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning

投稿日: 2024年5月21日作成者: jarxiv

要約強化学習に基づいてタスク指向の対話エージェントをトレーニングするには時間が … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Learning Force Control for Legged Manipulation

投稿日: 2024年5月21日作成者: jarxiv

要約インタラクション中の接触力を制御することは、移動や操作のタスクにとって重要 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Robust Deep Reinforcement Learning with Adaptive Adversarial Perturbations in Action Space

投稿日: 2024年5月21日作成者: jarxiv

要約深層強化学習 (DRL) アルゴリズムでは、シミュレーションと現実世界の間 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

A review on the use of large language models as virtual tutors

投稿日: 2024年5月21日作成者: jarxiv

要約 Transformer アーキテクチャは、自然言語処理の長期的な依存関係の … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Non-autoregressive Generative Models for Reranking Recommendation

投稿日: 2024年5月21日作成者: jarxiv

要約現代のレコメンデーションシステムは、ユーザーの特定の要求や興味に合わせた … 続きを読む →

カテゴリー: cs.AI, cs.IR | コメントを受け付けていません

Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models

投稿日: 2024年5月21日作成者: jarxiv

要約研究を自律的に推進する技術は計算科学発見で顕著ですが、合成生物学は有用な目 … 続きを読む →

カテゴリー: cs.AI, cs.LG, q-bio.MN | コメントを受け付けていません

Scrutinize What We Ignore: Reining Task Representation Shift In Context-Based Offline Meta Reinforcement Learning

投稿日: 2024年5月21日作成者: jarxiv

要約オフラインメタ強化学習 (OMRL) は、事前に収集されたデータとメタ学 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

A New Baseline Assumption of Integated Gradients Based on Shaply value

投稿日: 2024年5月21日作成者: jarxiv

要約ディープニューラルネットワーク (DNN) をデコードする作業には、多 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

KG-RAG: Bridging the Gap Between Knowledge and Creativity

投稿日: 2024年5月21日作成者: jarxiv

要約大規模言語モデルエージェント (LMA) の創造的な能力を維持しながら事 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

投稿日: 2024年5月21日作成者: jarxiv

要約大規模言語モデル (LLM) は、幅広いタスクで顕著なパフォーマンスを達成 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年5月

Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning

Learning Force Control for Legged Manipulation

Robust Deep Reinforcement Learning with Adaptive Adversarial Perturbations in Action Space

A review on the use of large language models as virtual tutors

Non-autoregressive Generative Models for Reranking Recommendation

Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models

Scrutinize What We Ignore: Reining Task Representation Shift In Context-Based Offline Meta Reinforcement Learning

A New Baseline Assumption of Integated Gradients Based on Shaply value

KG-RAG: Bridging the Gap Between Knowledge and Creativity

Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー