月別アーカイブ: 2024年3月

Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes

投稿日: 2024年3月26日作成者: jarxiv

要約自己教師あり学習は、限定されたアノテーションを持つターゲットタスクの転移 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Medical Image Registration and Its Application in Retinal Images: A Review

投稿日: 2024年3月26日作成者: jarxiv

要約医療画像レジストレーションは、異なる時間、角度、またはモダリティで撮影され … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Explaining CLIP’s performance disparities on data from blind/low vision users

投稿日: 2024年3月26日作成者: jarxiv

要約大規模なマルチモーダルモデル (LMM) は、視覚障害者または弱視 (B … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Geometric Prior Based Deep Human Point Cloud Geometry Compression

投稿日: 2024年3月26日作成者: jarxiv

要約デジタルアバターの出現により、リアルで複雑な詳細を備えた人間の点群に対す … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework

投稿日: 2024年3月26日作成者: jarxiv

要約トーキングヘッドベースのアバター作成ソリューションの注目すべきプロセスにも … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Let Real Images be as a Judger, Spotting Fake Images Synthesized with Generative Models

投稿日: 2024年3月26日作成者: jarxiv

要約ここ数年、生成モデルは、品質と多様性の両方でリアルな画像 (つまり、顔の画 … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification

投稿日: 2024年3月26日作成者: jarxiv

要約アルツハイマー病 (AD) は、認知機能と機能の低下を引き起こす不治の神経 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ModeTv2: GPU-accelerated Motion Decomposition Transformer for Pairwise Optimization in Medical Image Registration

投稿日: 2024年3月26日作成者: jarxiv

要約変形可能な画像レジストレーションは、医療画像処理において重要な役割を果たし … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Open-Set Recognition in the Age of Vision-Language Models

投稿日: 2024年3月26日作成者: jarxiv

要約ビジョン言語モデル (VLM) はインターネット規模のデータセットでトレー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting

投稿日: 2024年3月26日作成者: jarxiv

要約 CNN または ViT を時空間予測用の RNN と組み合わせることで、時 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年3月

Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes

Medical Image Registration and Its Application in Retinal Images: A Review

Explaining CLIP’s performance disparities on data from blind/low vision users

Geometric Prior Based Deep Human Point Cloud Geometry Compression

Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework

Let Real Images be as a Judger, Spotting Fake Images Synthesized with Generative Models

CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification

ModeTv2: GPU-accelerated Motion Decomposition Transformer for Pairwise Optimization in Medical Image Registration

Open-Set Recognition in the Age of Vision-Language Models

VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting

最近の投稿

最近のコメント

アーカイブ

カテゴリー