「cs.AI」カテゴリーアーカイブ

Resfusion: Denoising Diffusion Probabilistic Models for Image Restoration Based on Prior Residual Noise

投稿日: 2024年10月7日作成者: jarxiv

要約近年、ノイズ除去拡散モデルの研究は、画像復元の分野にも応用を広げている。従 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Variational Bayes Gaussian Splatting

投稿日: 2024年10月7日作成者: jarxiv

要約近年、3Dガウススプラッティングは、ガウスの混合を使用して3Dシーンをモデ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Unraveling the Truth: Do VLMs really Understand Charts? A Deep Dive into Consistency and Robustness

投稿日: 2024年10月7日作成者: jarxiv

要約図表質問応答（CQA）は、視覚言語理解の重要な分野である。しかし、この分野 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.HC, cs.LG | コメントを受け付けていません

AID: Attention Interpolation of Text-to-Image Diffusion

投稿日: 2024年10月7日作成者: jarxiv

要約条件拡散モデルは、様々な環境において未見の画像を作成し、画像補間を支援する … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

GenSim2: Scaling Robot Data Generation with Multi-modal and Reasoning LLMs

投稿日: 2024年10月7日作成者: jarxiv

要約今日のロボットシミュレーションは、多様なシミュレーションタスクとシーンを作 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Estimating Body and Hand Motion in an Ego-sensed World

投稿日: 2024年10月7日作成者: jarxiv

要約我々は、ヘッドマウントデバイスから人間の動きを推定するシステムEgoAll … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

投稿日: 2024年10月7日作成者: jarxiv

要約 LLM-as-a-Judgeは、様々なベンチマークにおける評価手法として広 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage

投稿日: 2024年10月7日作成者: jarxiv

要約ジェネラリスト型ウェブエージェントは、実際のウェブサイト上で様々なタスクを … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

Scaling Manipulation Learning with Visual Kinematic Chain Prediction

投稿日: 2024年10月4日作成者: jarxiv

要約多様なデータセットから汎用モデルを学習することは、機械学習において大きな成 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own

投稿日: 2024年10月4日作成者: jarxiv

要約強化学習（RL）は、ロボットの操作タスクを解決するための有望なアプローチで … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Resfusion: Denoising Diffusion Probabilistic Models for Image Restoration Based on Prior Residual Noise

Variational Bayes Gaussian Splatting

Unraveling the Truth: Do VLMs really Understand Charts? A Deep Dive into Consistency and Robustness

AID: Attention Interpolation of Text-to-Image Diffusion

GenSim2: Scaling Robot Data Generation with Multi-modal and Reasoning LLMs

Estimating Body and Hand Motion in an Ego-sensed World

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage

Scaling Manipulation Learning with Visual Kinematic Chain Prediction

Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own

最近の投稿

最近のコメント

アーカイブ

カテゴリー