「cs.CV」カテゴリーアーカイブ

Computational Trichromacy Reconstruction: Empowering the Color-Vision Deficient to Recognize Colors Using Augmented Reality

投稿日: 2024年9月27日作成者: jarxiv

要約私たちは、色覚異常 (CVD) を持つ人々が色の認識/名前付けを支援する支 … 続きを読む →

カテゴリー: cs.CV, cs.HC | コメントを受け付けていません

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

投稿日: 2024年9月27日作成者: jarxiv

要約スタイル転送技術は 2D 画像の様式化のために十分に開発されていますが、こ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

投稿日: 2024年9月27日作成者: jarxiv

要約パーソナライズされたテキストから画像への生成方法は、参照画像に基づいてカス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Neural Light Spheres for Implicit Image Stitching and View Synthesis

投稿日: 2024年9月27日作成者: jarxiv

要約パノラマは、キャプチャするのが難しく、携帯電話の画面に表示するのが難しいた … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model

投稿日: 2024年9月27日作成者: jarxiv

要約相補的な知覚情報を共有することにより、複数のエージェントが協力して知覚する … 続きを読む →

カテゴリー: cs.CV, cs.MA | コメントを受け付けていません

Manydepth2: Motion-Aware Self-Supervised Monocular Depth Estimation in Dynamic Scenes

投稿日: 2024年9月27日作成者: jarxiv

要約自己監視型単眼奥行き推定の進歩にもかかわらず、静的な世界についての仮定に依 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Perturb, Attend, Detect and Localize (PADL): Robust Proactive Image Defense

投稿日: 2024年9月27日作成者: jarxiv

要約画像操作の検出と位置特定は、生成モデル (GM) の普及により、研究コミュ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Spatial Hierarchy and Temporal Attention Guided Cross Masking for Self-supervised Skeleton-based Action Recognition

投稿日: 2024年9月27日作成者: jarxiv

要約自己教師ありスケルトンベースのアクション認識では、効果的なマスキングを通じ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

The Hard Positive Truth about Vision-Language Compositionality

投稿日: 2024年9月27日作成者: jarxiv

要約いくつかのベンチマークは、私たちの最良のビジョン言語モデル (CLIP な … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Jumping through Local Minima: Quantization in the Loss Landscape of Vision Transformers

投稿日: 2024年9月27日作成者: jarxiv

要約量子化スケールとビット幅は、ニューラルネットワークの量子化方法を検討する … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Computational Trichromacy Reconstruction: Empowering the Color-Vision Deficient to Recognize Colors Using Augmented Reality

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

Neural Light Spheres for Implicit Image Stitching and View Synthesis

CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model

Manydepth2: Motion-Aware Self-Supervised Monocular Depth Estimation in Dynamic Scenes

Perturb, Attend, Detect and Localize (PADL): Robust Proactive Image Defense

Spatial Hierarchy and Temporal Attention Guided Cross Masking for Self-supervised Skeleton-based Action Recognition

The Hard Positive Truth about Vision-Language Compositionality

Jumping through Local Minima: Quantization in the Loss Landscape of Vision Transformers

最近の投稿

最近のコメント

アーカイブ

カテゴリー