「cs.AI」カテゴリーアーカイブ

Efficient Dynamic Shielding for Parametric Safety Specifications

投稿日: 2025年5月29日作成者: jarxiv

要約シールドは、AI制御された自律システムの安全性を確保するための有望なアプロ … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.LO, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

JEDI: Latent End-to-end Diffusion Mitigates Agent-Human Performance Asymmetry in Model-Based Reinforcement Learning

投稿日: 2025年5月29日作成者: jarxiv

要約モデルベースの強化学習（MBRL）の最近の進歩は、強力な拡散ワールドモデル … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

A Physics-Informed Machine Learning Framework for Safe and Optimal Control of Autonomous Systems

投稿日: 2025年5月29日作成者: jarxiv

要約自律システムが日常生活でより遍在するようになるにつれて、安全性を保証する高 … 続きを読む →

カテゴリー: cs.AI, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Sample Efficient Robot Learning in Supervised Effect Prediction Tasks

投稿日: 2025年5月29日作成者: jarxiv

要約自己教師のロボット学習では、エージェントは環境との積極的な相互作用、エネル … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Gender-Neutral Large Language Models for Medical Applications: Reducing Bias in PubMed Abstracts

投稿日: 2025年5月29日作成者: jarxiv

要約このペーパーでは、性別の職業代名詞を中和することにより医学文献で使用される … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

投稿日: 2025年5月29日作成者: jarxiv

要約トレーニング後の段階でのマルチモーダル大手言語モデル（MLLMS）の改善は … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

How Do LLMs Perform Two-Hop Reasoning in Context?

投稿日: 2025年5月29日作成者: jarxiv

要約「ソクラテスは人間です。すべての人間は致命的です。したがって、ソクラテ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Human-Centered Human-AI Collaboration (HCHAC)

投稿日: 2025年5月29日作成者: jarxiv

要約インテリジェントな時代において、人間とインテリジェントシステムとの相互作用 … 続きを読む →

カテゴリー: cs.AI, cs.CY, cs.HC | コメントを受け付けていません

Position: Don’t Use the CLT in LLM Evals With Fewer Than a Few Hundred Datapoints

投稿日: 2025年5月29日作成者: jarxiv

要約有効なエラーバーや有意性テストを含む、大規模な言語モデル（LLM）の厳密な … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

Learned Collusion

投稿日: 2025年5月29日作成者: jarxiv

要約 Qラーニングは、利用可能な各アクションに関連付けられた継続値の推定値（Q値 … 続きを読む →

カテゴリー: cs.AI, cs.GT, econ.TH | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Efficient Dynamic Shielding for Parametric Safety Specifications

JEDI: Latent End-to-end Diffusion Mitigates Agent-Human Performance Asymmetry in Model-Based Reinforcement Learning

A Physics-Informed Machine Learning Framework for Safe and Optimal Control of Autonomous Systems

Sample Efficient Robot Learning in Supervised Effect Prediction Tasks

Gender-Neutral Large Language Models for Medical Applications: Reducing Bias in PubMed Abstracts

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

How Do LLMs Perform Two-Hop Reasoning in Context?

Human-Centered Human-AI Collaboration (HCHAC)

Position: Don’t Use the CLT in LLM Evals With Fewer Than a Few Hundred Datapoints

Learned Collusion

最近の投稿

最近のコメント

アーカイブ

カテゴリー