「cs.CV」カテゴリーアーカイブ

Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method

投稿日: 2024年9月2日作成者: jarxiv

要約この論文では、光の吸収、散乱、その他の課題に悩まされる水中イメージングを強 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

DeformGS: Scene Flow in Highly Deformable Scenes for Deformable Object Manipulation

投稿日: 2024年9月2日作成者: jarxiv

要約布などの変形可能な物体を折りたたんだり、ドレープしたり、位置を変更したりす … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

LSMS: Language-guided Scale-aware MedSegmentor for Medical Image Referring Segmentation

投稿日: 2024年9月2日作成者: jarxiv

要約従来の医用画像セグメンテーション方法は、医師が診断や治療のために特定の病変 … 続きを読む →

カテゴリー: cs.CV, I.4.6 | コメントを受け付けていません

A Permuted Autoregressive Approach to Word-Level Recognition for Urdu Digital Text

投稿日: 2024年9月2日作成者: jarxiv

要約この研究論文では、デジタルウルドゥー語テキスト向けに特別に設計された新し … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Look, Learn and Leverage (L$^3$): Mitigating Visual-Domain Shift and Discovering Intrinsic Relations via Symbolic Alignment

投稿日: 2024年9月2日作成者: jarxiv

要約最新の深層学習モデルは、視覚的な外観と本質的な関係 (因果構造など) デー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields

投稿日: 2024年9月2日作成者: jarxiv

要約ガウススプラッティングは、リアルタイムで高いレンダリングパフォーマンス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

How Knowledge Distillation Mitigates the Synthetic Gap in Fair Face Recognition

投稿日: 2024年9月2日作成者: jarxiv

要約知識蒸留 (KD) 戦略の機能を活用して、最近の顔認識データセットの撤回に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes

投稿日: 2024年9月2日作成者: jarxiv

要約医療画像のセマンティックセグメンテーションは、病気の診断や治療計画などの … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Open-vocabulary Temporal Action Localization using VLMs

投稿日: 2024年9月2日作成者: jarxiv

要約ビデオアクションのローカリゼーションは、長いビデオから特定のアクションの … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

CinePreGen: Camera Controllable Video Previsualization via Engine-powered Diffusion

投稿日: 2024年9月2日作成者: jarxiv

要約ビデオ生成 AI モデル (SORA など) の進歩に伴い、クリエイターは … 続きを読む →

カテゴリー: cs.CV, cs.HC | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method

DeformGS: Scene Flow in Highly Deformable Scenes for Deformable Object Manipulation

LSMS: Language-guided Scale-aware MedSegmentor for Medical Image Referring Segmentation

A Permuted Autoregressive Approach to Word-Level Recognition for Urdu Digital Text

Look, Learn and Leverage (L$^3$): Mitigating Visual-Domain Shift and Discovering Intrinsic Relations via Symbolic Alignment

RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields

How Knowledge Distillation Mitigates the Synthetic Gap in Fair Face Recognition

Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes

Open-vocabulary Temporal Action Localization using VLMs

CinePreGen: Camera Controllable Video Previsualization via Engine-powered Diffusion

最近の投稿

最近のコメント

アーカイブ

カテゴリー