月別アーカイブ: 2024年8月

Rethinking Video with a Universal Event-Based Representation

投稿日: 2024年8月13日作成者: jarxiv

要約従来、ビデオは一連の個別の画像フレームとして構造化されていました。しかし … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning

投稿日: 2024年8月13日作成者: jarxiv

要約ビジュアルストーリーテリングシステムは、一連の画像から複数の文章からな … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

CT evaluation of 2D and 3D holistic deep learning methods for the volumetric segmentation of airway lesions

投稿日: 2024年8月13日作成者: jarxiv

要約この研究は、嚢胞性線維症 (CF) 病変に焦点を当て、2D 形式と 3D … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering

投稿日: 2024年8月13日作成者: jarxiv

要約 3D ガウススプラッティング (3DGS) は、その優れたレンダリング効 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Toward a Surgeon-in-the-Loop Ophthalmic Robotic Apprentice using Reinforcement and Imitation Learning

投稿日: 2024年8月13日作成者: jarxiv

要約ロボット支援手術システムは、手術の精度を高め、人的ミスを最小限に抑える上で … 続きを読む →

カテゴリー: cs.CV, cs.HC, cs.LG, cs.RO | コメントを受け付けていません

OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

投稿日: 2024年8月13日作成者: jarxiv

要約この研究では、3D オープンボキャブラリーシーンを理解するための新しい 3 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Finding Patterns in Ambiguity: Interpretable Stress Testing in the Decision~Boundary

投稿日: 2024年8月13日作成者: jarxiv

要約さまざまなドメインでディープラーニングの使用が増加していることにより、これ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Long-Form Answers to Visual Questions from Blind and Low Vision People

投稿日: 2024年8月13日作成者: jarxiv

要約ビジョン言語モデルは、画像に関する質問に対する長い形式の回答、つまり長い形 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

From SAM to SAM 2: Exploring Improvements in Meta’s Segment Anything Model

投稿日: 2024年8月13日作成者: jarxiv

要約 2023 年 4 月に Meta によってコンピュータービジョンコミュ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

EqNIO: Subequivariant Neural Inertial Odometry

投稿日: 2024年8月13日作成者: jarxiv

要約現在、ニューラルネットワークは、慣性測定ユニット (IMU) データから … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

月別アーカイブ: 2024年8月

Rethinking Video with a Universal Event-Based Representation

Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning

CT evaluation of 2D and 3D holistic deep learning methods for the volumetric segmentation of airway lesions

Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering

Toward a Surgeon-in-the-Loop Ophthalmic Robotic Apprentice using Reinforcement and Imitation Learning

OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

Finding Patterns in Ambiguity: Interpretable Stress Testing in the Decision~Boundary

Long-Form Answers to Visual Questions from Blind and Low Vision People

From SAM to SAM 2: Exploring Improvements in Meta’s Segment Anything Model

EqNIO: Subequivariant Neural Inertial Odometry

最近の投稿

最近のコメント

アーカイブ

カテゴリー