「cs.AI」カテゴリーアーカイブ

Nemotron-4 340B Technical Report

投稿日: 2024年6月18日作成者: jarxiv

要約 Nemotron-4-340B-Base、Nemotron-4-340B- … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Refusal in Language Models Is Mediated by a Single Direction

投稿日: 2024年6月18日作成者: jarxiv

要約会話型の大規模言語モデルは、指示への従うことと安全性の両方を考慮して微調整 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Reward Machines for Deep RL in Noisy and Uncertain Environments

投稿日: 2024年6月18日作成者: jarxiv

要約報酬マシンは、命令、安全制約、その他の時間的に拡張された報酬に値する動作を … 続きを読む →

カテゴリー: cs.AI, cs.FL, cs.LG, F.4.3 | コメントを受け付けていません

Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity

投稿日: 2024年6月18日作成者: jarxiv

要約アライメント技術を理解するには、命令チューニングによってもたらされるゼロシ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Social Environment Design

投稿日: 2024年6月18日作成者: jarxiv

要約人工知能 (AI) は、政府や経済の政策決定を改善するために使用できるテク … 続きを読む →

カテゴリー: cs.AI, econ.GN, q-fin.EC, stat.ML | コメントを受け付けていません

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

投稿日: 2024年6月18日作成者: jarxiv

要約大規模言語モデル (LLM) の優れたパフォーマンスに貢献する主な原動力の … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions

投稿日: 2024年6月18日作成者: jarxiv

要約汎用 AI の最近の進歩により、意図された目標、倫理原則、個人とグループの … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC | コメントを受け付けていません

Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies

投稿日: 2024年6月18日作成者: jarxiv

要約人間は計画を立てる際に目標の状態を想像し、その目標に合わせて行動を実行する … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Transcendence: Generative Models Can Outperform The Experts That Train Them

投稿日: 2024年6月18日作成者: jarxiv

要約生成モデルは、トレーニング対象のデータによって引き起こされる条件付き確率分 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

DustNet: skillful neural network predictions of Saharan dust

投稿日: 2024年6月18日作成者: jarxiv

要約大気中には何百万トンもの鉱物粉塵が浮遊しており、天候や気候と相互作用します … 続きを読む →

カテゴリー: 86-06(Primary), 86A10(Secondary), cs.AI, I.2.1, physics.ao-ph, physics.data-an, physics.geo-ph | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Nemotron-4 340B Technical Report

Refusal in Language Models Is Mediated by a Single Direction

Reward Machines for Deep RL in Noisy and Uncertain Environments

Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity

Social Environment Design

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions

Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies

Transcendence: Generative Models Can Outperform The Experts That Train Them

DustNet: skillful neural network predictions of Saharan dust

最近の投稿

最近のコメント

アーカイブ

カテゴリー