「cs.CV」カテゴリーアーカイブ

Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration

投稿日: 2024年11月15日作成者: jarxiv

要約近年、アテンションメカニズムにより、主要な特徴情報に焦点を当てることによ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Information-driven design of imaging systems

投稿日: 2024年11月15日作成者: jarxiv

要約最新の画像システムのほとんどは、人間が見る前に、または人間が見る代わりに、 … 続きを読む →

カテゴリー: cs.CV, cs.IT, eess.IV, math.IT, physics.data-an, physics.optics | コメントを受け付けていません

Vision-based Manipulation of Transparent Plastic Bags in Industrial Setups

投稿日: 2024年11月15日作成者: jarxiv

要約この論文では、インダストリー 4.0 パラダイムに沿って、産業環境における … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

One-Shot Manipulation Strategy Learning by Making Contact Analogies

投稿日: 2024年11月15日作成者: jarxiv

要約我々は、新しいオブジェクトへの高速かつ広範な一般化を伴う操作戦略のワンショ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Super-resolution multi-contrast unbiased eye atlases with deep probabilistic refinement

投稿日: 2024年11月15日作成者: jarxiv

要約目的: 目の形態、特に眼窩と視神経は集団によって大きく異なります。これら … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling

投稿日: 2024年11月15日作成者: jarxiv

要約近年、トランスフォーマーコンポーネントで構成される深層学習モデルにより、医 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models

投稿日: 2024年11月15日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は、さまざまなタスクにわたる … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants

投稿日: 2024年11月15日作成者: jarxiv

要約画像から植物の 3D デジタルツインを自動的に構築する機能は、農業、環境 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

On the Surprising Effectiveness of Attention Transfer for Vision Transformers

投稿日: 2024年11月15日作成者: jarxiv

要約従来の通念では、ビジョントランスフォーマー (ViT) を事前トレーニン … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.NE | コメントを受け付けていません

MagicQuill: An Intelligent Interactive Image Editing System

投稿日: 2024年11月15日作成者: jarxiv

要約画像編集にはさまざまな複雑なタスクが含まれており、効率的かつ正確な操作技術 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration

Information-driven design of imaging systems

Vision-based Manipulation of Transparent Plastic Bags in Industrial Setups

One-Shot Manipulation Strategy Learning by Making Contact Analogies

Super-resolution multi-contrast unbiased eye atlases with deep probabilistic refinement

I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling

Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models

CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants

On the Surprising Effectiveness of Attention Transfer for Vision Transformers

MagicQuill: An Intelligent Interactive Image Editing System

最近の投稿

最近のコメント

アーカイブ

カテゴリー