「cs.CV」カテゴリーアーカイブ

EgoBlind: Towards Egocentric Visual Assistance for the Blind

投稿日: 2025年6月19日作成者: jarxiv

要約視覚障害者から収集された最初のエゴセントリックビデオデータセットであるeg … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

A Comprehensive Survey on Continual Learning in Generative Models

投稿日: 2025年6月19日作成者: jarxiv

要約生成モデルの急速な進歩により、最新のAIシステムは、特定のドメインで人間レ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Exploring Personalized Federated Learning Architectures for Violence Detection in Surveillance Videos

投稿日: 2025年6月19日作成者: jarxiv

要約都市監視システムにおける暴力事件を検出するという課題は、ビデオデータの膨大 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

LaViDa: A Large Diffusion Language Model for Multimodal Understanding

投稿日: 2025年6月19日作成者: jarxiv

要約最新のビジョン言語モデル（VLM）は、視覚的な推論を必要とする幅広いタスク … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CLAIM: Clinically-Guided LGE Augmentation for Realistic and Diverse Myocardial Scar Synthesis and Segmentation

投稿日: 2025年6月19日作成者: jarxiv

要約後期ガドリニウム増強（LGE）心臓MRIからの深い学習ベースの心筋瘢痕セグ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation

投稿日: 2025年6月19日作成者: jarxiv

要約ミリ波レーダーを使用した高密度のメートリック深度推定には、通常、マルチフレ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Automated MRI Tumor Segmentation using hybrid U-Net with Transformer and Efficient Attention

投稿日: 2025年6月19日作成者: jarxiv

要約癌は異常な成長であり、局所的に侵入し、遠い臓器に転移する可能性があります。 … 続きを読む →

カテゴリー: cs.CV, eess.IV, I.2.6 | コメントを受け付けていません

Control and Realism: Best of Both Worlds in Layout-to-Image without Training

投稿日: 2025年6月19日作成者: jarxiv

要約レイアウトからイメージの生成は、被験者の配置と配置を正確に制御する複雑なシ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Show-o2: Improved Native Unified Multimodal Models

投稿日: 2025年6月19日作成者: jarxiv

要約このホワイトペーパーでは、自動網性モデリングとフローマッチングを活用する改 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Baltimore Atlas: FreqWeaver Adapter for Semi-supervised Ultra-high Spatial Resolution Land Cover Classification

投稿日: 2025年6月19日作成者: jarxiv

要約超高空間解像度の土地被覆分類は、きめ細かい土地被覆分析には不可欠ですが、ピ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

EgoBlind: Towards Egocentric Visual Assistance for the Blind

A Comprehensive Survey on Continual Learning in Generative Models

Exploring Personalized Federated Learning Architectures for Violence Detection in Surveillance Videos

LaViDa: A Large Diffusion Language Model for Multimodal Understanding

CLAIM: Clinically-Guided LGE Augmentation for Realistic and Diverse Myocardial Scar Synthesis and Segmentation

RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation

Automated MRI Tumor Segmentation using hybrid U-Net with Transformer and Efficient Attention

Control and Realism: Best of Both Worlds in Layout-to-Image without Training

Show-o2: Improved Native Unified Multimodal Models

Baltimore Atlas: FreqWeaver Adapter for Semi-supervised Ultra-high Spatial Resolution Land Cover Classification

最近の投稿

最近のコメント

アーカイブ

カテゴリー