「cs.CV」カテゴリーアーカイブ

Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding

投稿日: 2024年10月21日作成者: jarxiv

要約現在のマルチモーダル大規模言語モデル (MLLM) はビデオ理解において有 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Less is More: Selective Reduction of CT Data for Self-Supervised Pre-Training of Deep Learning Models with Contrastive Learning Improves Downstream Classification Performance

投稿日: 2024年10月21日作成者: jarxiv

要約対照学習を使用した深層学習モデルの自己教師あり事前トレーニングは、画像分析 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification

投稿日: 2024年10月21日作成者: jarxiv

要約急性リンパ芽球性白血病 (ALL) は、白血病の中で最も悪性度が高く、成人 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior

投稿日: 2024年10月21日作成者: jarxiv

要約スキン付きマルチパーソン線形 (SMPL) モデルは、3D 人間の姿勢推定 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Fundus to Fluorescein Angiography Video Generation as a Retinal Generative Foundation Model

投稿日: 2024年10月21日作成者: jarxiv

要約フルオレセイン眼底血管造影法 (FFA) は、網膜血管の問題の診断とモニタ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts

投稿日: 2024年10月21日作成者: jarxiv

要約 Sparse Mixture of Experts (SMoE) は、ディ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, stat.ML | コメントを受け付けていません

Scalable Drift Monitoring in Medical Imaging AI

投稿日: 2024年10月21日作成者: jarxiv

要約医療画像への人工知能 (AI) の統合により、臨床診断は高度化しましたが、 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera

投稿日: 2024年10月21日作成者: jarxiv

要約新しいビュー合成のための暗黙的ニューラル表現と明示的 3D ガウススプラ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Harnessing Shared Relations via Multimodal Mixup Contrastive Learning for Multimodal Classification

投稿日: 2024年10月21日作成者: jarxiv

要約深層マルチモーダル学習は、対照学習を活用してモダリティ間の明示的な 1 対 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Movie101v2: Improved Movie Narration Benchmark

投稿日: 2024年10月21日作成者: jarxiv

要約自動映画ナレーションは、視覚障害のある視聴者を支援するために、ビデオに合わ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding

Less is More: Selective Reduction of CT Data for Self-Supervised Pre-Training of Deep Learning Models with Contrastive Learning Improves Downstream Classification Performance

A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification

Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior

Fundus to Fluorescein Angiography Video Generation as a Retinal Generative Foundation Model

MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts

Scalable Drift Monitoring in Medical Imaging AI

IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera

Harnessing Shared Relations via Multimodal Mixup Contrastive Learning for Multimodal Classification

Movie101v2: Improved Movie Narration Benchmark

最近の投稿

最近のコメント

アーカイブ

カテゴリー