「cs.AI」カテゴリーアーカイブ

reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use

投稿日: 2024年2月28日作成者: jarxiv

要約大麻使用とそれに伴う大麻使用障害（CUD）の蔓延は、世界的に公衆衛生上の重 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

When Your AI Deceives You: Challenges with Partial Observability of Human Evaluators in Reward Learning

投稿日: 2024年2月28日作成者: jarxiv

要約人間のフィードバックからの強化学習 (RLHF) の過去の分析は、人間が環 … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

Evaluating Very Long-Term Conversational Memory of LLM Agents

投稿日: 2024年2月28日作成者: jarxiv

要約長期にわたるオープンドメインの対話に関する既存の研究は、5 つ以内のチャッ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Preference Ranking Optimization for Human Alignment

投稿日: 2024年2月28日作成者: jarxiv

要約大規模言語モデル (LLM) には誤解を招くコンテンツが含まれることが多く … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Dynamic fairness-aware recommendation through multi-agent social choice

投稿日: 2024年2月28日作成者: jarxiv

要約パーソナライズされたレコメンデーションのコンテキストにおけるアルゴリズムの … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Wisdom of Committee: Distilling from Foundation Model to Specialized Application Model

投稿日: 2024年2月28日作成者: jarxiv

要約基礎モデルの最近の進歩により、幅広いタスクにわたって優れたパフォーマンスが … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Learning to Program Variational Quantum Circuits with Fast Weights

投稿日: 2024年2月28日作成者: jarxiv

要約量子機械学習 (QML) は、逐次制御タスクと時系列モデリングに対処する先 … 続きを読む →

カテゴリー: cs.AI, cs.ET, cs.LG, cs.NE, quant-ph | コメントを受け付けていません

Accelerating Cutting-Plane Algorithms via Reinforcement Learning Surrogates

投稿日: 2024年2月28日作成者: jarxiv

要約離散最適化は、混合整数計画法や組み合わせ最適化などの分野にわたる一連の $ … 続きを読む →

カテゴリー: cs.AI, cs.LG, math.OC | コメントを受け付けていません

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

投稿日: 2024年2月28日作成者: jarxiv

要約何十年もの間、人間とコンピューターのやり取りは基本的に手動で行われてきまし … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.HC | コメントを受け付けていません

Implicit Visual Bias Mitigation by Posterior Estimate Sharpening of a Bayesian Neural Network

投稿日: 2024年2月28日作成者: jarxiv

要約ディープニューラルネットワークの公平性は、データセットのバイアスと偽の … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use

When Your AI Deceives You: Challenges with Partial Observability of Human Evaluators in Reward Learning

Evaluating Very Long-Term Conversational Memory of LLM Agents

Preference Ranking Optimization for Human Alignment

Dynamic fairness-aware recommendation through multi-agent social choice

Wisdom of Committee: Distilling from Foundation Model to Specialized Application Model

Learning to Program Variational Quantum Circuits with Fast Weights

Accelerating Cutting-Plane Algorithms via Reinforcement Learning Surrogates

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

Implicit Visual Bias Mitigation by Posterior Estimate Sharpening of a Bayesian Neural Network

最近の投稿

最近のコメント

アーカイブ

カテゴリー