「cs.AI」カテゴリーアーカイブ

Pre-training for Recommendation Unlearning

投稿日: 2025年5月29日作成者: jarxiv

要約グラフニューラルネットワーク（GNNS）を搭載した最新の推奨システム（GN … 続きを読む →

カテゴリー: cs.AI, cs.IR, cs.LG | コメントを受け付けていません

Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents

投稿日: 2025年5月29日作成者: jarxiv

要約大規模な言語モデル（LLMS）とチャットボットエージェントは、時々間違った … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Maximizing Confidence Alone Improves Reasoning

投稿日: 2025年5月29日作成者: jarxiv

要約強化学習（RL）により、機械学習モデルが多くの分野で大きな進歩を達成できる … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Fostering Video Reasoning via Next-Event Prediction

投稿日: 2025年5月29日作成者: jarxiv

要約次のトークン予測は、LLMSの推論を可能にする基礎学習タスクとして機能しま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

A Closer Look at Multimodal Representation Collapse

投稿日: 2025年5月29日作成者: jarxiv

要約私たちは、モダリティ崩壊の基本的な理解を開発することを目指しています。これ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

VLM Can Be a Good Assistant: Enhancing Embodied Visual Tracking with Self-Improving Vision-Language Models

投稿日: 2025年5月29日作成者: jarxiv

要約環境視覚モデル（VLM）を使用して具体化された視覚追跡（EVT）を強化する … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Thinking with Generated Images

投稿日: 2025年5月29日作成者: jarxiv

要約生成された画像で思考を提示します。これは、中間視覚的思考ステップの自発的な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Scaling-up Perceptual Video Quality Assessment

投稿日: 2025年5月29日作成者: jarxiv

要約データスケーリング法は、さまざまな下流タスクにわたる大規模なマルチモーダル … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Preference Adaptive and Sequential Text-to-Image Generation

投稿日: 2025年5月29日作成者: jarxiv

要約インタラクティブなテキストからイメージ（T2I）生成の問題に対処し、一連の … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.SY, eess.SY | コメントを受け付けていません

PRISM: Video Dataset Condensation with Progressive Refinement and Insertion for Sparse Motion

投稿日: 2025年5月29日作成者: jarxiv

要約ビデオデータセットの凝縮は、ディープ学習アプリケーションでの大規模なビデオ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Pre-training for Recommendation Unlearning

Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents

Maximizing Confidence Alone Improves Reasoning

Fostering Video Reasoning via Next-Event Prediction

A Closer Look at Multimodal Representation Collapse

VLM Can Be a Good Assistant: Enhancing Embodied Visual Tracking with Self-Improving Vision-Language Models

Thinking with Generated Images

Scaling-up Perceptual Video Quality Assessment

Preference Adaptive and Sequential Text-to-Image Generation

PRISM: Video Dataset Condensation with Progressive Refinement and Insertion for Sparse Motion

最近の投稿

最近のコメント

アーカイブ

カテゴリー