「cs.AI」カテゴリーアーカイブ

EMC$^2$: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence

投稿日: 2024年4月17日作成者: jarxiv

要約対比学習における主な課題は、データのより適切なエンコードを学習するために、 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, math.OC | コメントを受け付けていません

E3: Ensemble of Expert Embedders for Adapting Synthetic Image Detectors to New Generators Using Limited Data

投稿日: 2024年4月17日作成者: jarxiv

要約生成 AI が急速に進歩するにつれて、新しい合成画像ジェネレーターが急速な … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Private Attribute Inference from Images with Vision-Language Models

投稿日: 2024年4月17日作成者: jarxiv

要約大規模言語モデル (LLM) が日常業務やデジタルインタラクションの至る … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

LoopAnimate: Loopable Salient Object Animation

投稿日: 2024年4月17日作成者: jarxiv

要約拡散モデルに基づくビデオ生成の研究は急速に進んでいます。ただし、オブジェ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM

投稿日: 2024年4月17日作成者: jarxiv

要約高密度同時ローカライゼーションおよびマッピング (SLAM) は、ロボット … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Mixed Prototype Consistency Learning for Semi-supervised Medical Image Segmentation

投稿日: 2024年4月17日作成者: jarxiv

要約最近、半教師あり医療画像セグメンテーションにおいてプロトタイプ学習が登場し … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?

投稿日: 2024年4月17日作成者: jarxiv

要約拡散モデルは、テキストから画像への生成において顕著な機能を発揮しました。 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge

投稿日: 2024年4月17日作成者: jarxiv

要約気道関連の定量的画像バイオマーカーは、肺疾患の検査、診断、予後にとって重要 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

COMBO: Compositional World Models for Embodied Multi-Agent Cooperation

投稿日: 2024年4月17日作成者: jarxiv

要約この論文では、部分的な自己中心的な世界観しか与えられない場合、分散型エージ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MA | コメントを受け付けていません

GROUNDHOG: Grounding Large Language Models to Holistic Segmentation

投稿日: 2024年4月17日作成者: jarxiv

要約ほとんどのマルチモーダル大規模言語モデル (MLLM) は、因果関係のある … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

EMC$^2$: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence

E3: Ensemble of Expert Embedders for Adapting Synthetic Image Detectors to New Generators Using Limited Data

Private Attribute Inference from Images with Vision-Language Models

LoopAnimate: Loopable Salient Object Animation

SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM

Mixed Prototype Consistency Learning for Semi-supervised Medical Image Segmentation

LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?

Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge

COMBO: Compositional World Models for Embodied Multi-Agent Cooperation

GROUNDHOG: Grounding Large Language Models to Holistic Segmentation

最近の投稿

最近のコメント

アーカイブ

カテゴリー