「cs.AI」カテゴリーアーカイブ

Safeguard is a Double-edged Sword: Denial-of-service Attack on Large Language Models

投稿日: 2024年10月24日作成者: jarxiv

要約安全性は、オープン展開における大規模言語モデル (LLM) の最大の関心事 … 続きを読む →

カテゴリー: cs.AI, cs.CR | コメントを受け付けていません

Explaining Bayesian Networks in Natural Language using Factor Arguments. Evaluation in the medical domain

投稿日: 2024年10月24日作成者: jarxiv

要約この論文では、因子引数の観点からベイジアンネットワーク推論の自然言語説明 … 続きを読む →

カテゴリー: cs.AI, cs.LO | コメントを受け付けていません

Utilitarian Algorithm Configuration for Infinite Parameter Spaces

投稿日: 2024年10月24日作成者: jarxiv

要約功利主義的アルゴリズム構成は、特定のアルゴリズムのパラメーター空間を自動的 … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning

投稿日: 2024年10月24日作成者: jarxiv

要約報酬ベースの微調整は、言語ポリシーを意図された行動 (創造性や安全性など) … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

MADial-Bench: Towards Real-world Evaluation of Memory-Augmented Dialogue Generation

投稿日: 2024年10月24日作成者: jarxiv

要約長期記憶は、チャットボットや対話システム (DS) が一貫性のある人間のよ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers

投稿日: 2024年10月24日作成者: jarxiv

要約 Rotary Positional Embeddings (RoPE) は … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Physical Reasoning and Object Planning for Household Embodied Agents

投稿日: 2024年10月24日作成者: jarxiv

要約この研究では、代替オブジェクトを選択する複雑なタスクに特に重点を置き、堅牢 … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Correlated Proxies: A New Definition and Improved Mitigation for Reward Hacking

投稿日: 2024年10月24日作成者: jarxiv

要約複雑な目標を正確に指定することは難しいため、強化学習ポリシーは多くの場合、 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Training Free Guided Flow Matching with Optimal Control

投稿日: 2024年10月24日作成者: jarxiv

要約事前トレーニングされた拡散モデルとフローマッチングモデルを使用した制御され … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

投稿日: 2024年10月24日作成者: jarxiv

要約教師なし事前トレーニングは、多くの教師ありドメインで変革をもたらしました。 … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Safeguard is a Double-edged Sword: Denial-of-service Attack on Large Language Models

Explaining Bayesian Networks in Natural Language using Factor Arguments. Evaluation in the medical domain

Utilitarian Algorithm Configuration for Infinite Parameter Spaces

Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning

MADial-Bench: Towards Real-world Evaluation of Memory-Augmented Dialogue Generation

Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers

Physical Reasoning and Object Planning for Household Embodied Agents

Correlated Proxies: A New Definition and Improved Mitigation for Reward Hacking

Training Free Guided Flow Matching with Optimal Control

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

最近の投稿

最近のコメント

アーカイブ

カテゴリー