「cs.CV」カテゴリーアーカイブ

Image Matching Filtering and Refinement by Planes and Beyond

投稿日: 2024年11月18日作成者: jarxiv

要約この論文では、画像マッチングにおける疎な対応をフィルタリングおよび洗練する … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos

投稿日: 2024年11月18日作成者: jarxiv

要約 Egocentric Hand Object Interaction (H … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

SINETRA: a Versatile Framework for Evaluating Single Neuron Tracking in Behaving Animals

投稿日: 2024年11月18日作成者: jarxiv

要約行動する動物のニューロン活動を正確に追跡することは、複雑な動きと背景ノイズ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Automated Segmentation of Ischemic Stroke Lesions in Non-Contrast Computed Tomography Images for Enhanced Treatment and Prognosis

投稿日: 2024年11月18日作成者: jarxiv

要約脳卒中は世界で 2 番目に多い死因であり、低・中所得国 (LMIC) でま … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling

投稿日: 2024年11月18日作成者: jarxiv

要約近年、トランスフォーマーコンポーネントで構成される深層学習モデルにより、医 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Multimodal Object Detection using Depth and Image Data for Manufacturing Parts

投稿日: 2024年11月15日作成者: jarxiv

要約製造業では、さまざまな種類の製造部品やコンポーネントを正確にピッキングして … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos

投稿日: 2024年11月15日作成者: jarxiv

要約 Egocentric Hand Object Interaction (H … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation

投稿日: 2024年11月15日作成者: jarxiv

要約ビデオ生成モデルの学習に大規模ビデオデータを利用する最近の進歩は、複雑な … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation

投稿日: 2024年11月15日作成者: jarxiv

要約視覚-言語-動作 (VLA) モデルは、エンドツーエンドの学習プロセスを通 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey

投稿日: 2024年11月15日作成者: jarxiv

要約マルチモーダル基礎モデルの急速な進化により、テキスト、画像、オーディオ、ビ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Image Matching Filtering and Refinement by Planes and Beyond

UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos

SINETRA: a Versatile Framework for Evaluating Single Neuron Tracking in Behaving Animals

Automated Segmentation of Ischemic Stroke Lesions in Non-Contrast Computed Tomography Images for Enhanced Treatment and Prognosis

I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling

Multimodal Object Detection using Depth and Image Data for Manufacturing Parts

UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos

VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation

TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation

Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey

最近の投稿

最近のコメント

アーカイブ

カテゴリー