「cs.CV」カテゴリーアーカイブ

Explaining the Impact of Training on Vision Models via Activation Clustering

投稿日: 2025年2月20日作成者: jarxiv

要約 Visionモデル向けの説明可能な人工知能（XAI）の分野での最近の開発は … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Image compositing is all you need for data augmentation

投稿日: 2025年2月20日作成者: jarxiv

要約このペーパーでは、オブジェクト検出モデルのパフォーマンスに対するさまざまな … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

A Chain-of-Thought Subspace Meta-Learning for Few-shot Image Captioning with Large Vision and Language Models

投稿日: 2025年2月20日作成者: jarxiv

要約大規模なデータで事前に処理されている大規模なビジョンと言語モデルは、視覚的 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Carefully Blending Adversarial Training, Purification, and Aggregation Improves Adversarial Robustness

投稿日: 2025年2月20日作成者: jarxiv

要約この作業では、イメージ分類のための新しい敵対的な防御メカニズム &#821 … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CV, cs.LG | コメントを受け付けていません

GPU-Friendly Laplacian Texture Blending

投稿日: 2025年2月20日作成者: jarxiv

要約テクスチャと材料ブレンドは、レンダリングされた仮想世界に多様性を追加し、複 … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior

投稿日: 2025年2月20日作成者: jarxiv

要約この論文では、単一の画像から高品質の3Dモデルを生成するための新しい2段階 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

IP-Composer: Semantic Composition of Visual Concepts

投稿日: 2025年2月20日作成者: jarxiv

要約コンテンツクリエーターは、多くの場合、複数の視覚ソースからインスピレーショ … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360$^\circ$ Cameras

投稿日: 2025年2月20日作成者: jarxiv

要約屋内環境の3Dマッピングとレンダリング用の360 $^\ circ $カメ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Training-Free Framework for Precise Mobile Manipulation of Small Everyday Objects

投稿日: 2025年2月20日作成者: jarxiv

要約毎日のモバイル操作タスクの多くは、ノブをつかんでキャビネットを開いたり、ラ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

投稿日: 2025年2月20日作成者: jarxiv

要約画像トークン化により、生のピクセルよりも処理するのが効率的な圧縮された離散 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Explaining the Impact of Training on Vision Models via Activation Clustering

Image compositing is all you need for data augmentation

A Chain-of-Thought Subspace Meta-Learning for Few-shot Image Captioning with Large Vision and Language Models

Carefully Blending Adversarial Training, Purification, and Aggregation Improves Adversarial Robustness

GPU-Friendly Laplacian Texture Blending

High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior

IP-Composer: Semantic Composition of Visual Concepts

IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360$^\circ$ Cameras

A Training-Free Framework for Precise Mobile Manipulation of Small Everyday Objects

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

最近の投稿

最近のコメント

アーカイブ

カテゴリー