「cs.CV」カテゴリーアーカイブ

NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks

投稿日: 2025年4月29日作成者: jarxiv

要約既存の視覚言語アクション（VLA）モデルは、ゼロショットシナリオで有望なパ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback

投稿日: 2025年4月29日作成者: jarxiv

要約スコア蒸留サンプリング（SDS）は、テキストから3Dのコンテンツ生成で顕著 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real Transfer

投稿日: 2025年4月29日作成者: jarxiv

要約卓球のプレーヤーのテクニックを分析するには、ボールの3D軌道とスピンに関す … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation

投稿日: 2025年4月29日作成者: jarxiv

要約プロトタイプのパーツ学習は、セマンティックセグメンテーションを解釈可能にす … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images

投稿日: 2025年4月29日作成者: jarxiv

要約このペーパーでは、Clip-vitおよびFusion学習を使用してAIに生 … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

Using Fixed and Mobile Eye Tracking to Understand How Visitors View Art in a Museum: A Study at the Bowes Museum, County Durham, UK

投稿日: 2025年4月29日作成者: jarxiv

要約次の論文では、ダーラム大学の研究者が関与する共同プロジェクトと、英国ダーラ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Federated Out-of-Distribution Generalization: A Causal Augmentation View

投稿日: 2025年4月29日作成者: jarxiv

要約 Federated Learningは、マルチソース情報を統合して、すべて … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Interpretable Dynamic Graph Neural Networks for Small Occluded Object Detection and Tracking

投稿日: 2025年4月29日作成者: jarxiv

要約歩行者、サイクリスト、バイクなどの小型の閉塞されたオブジェクトの検出と追跡 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Enhancing breast cancer detection on screening mammogram using self-supervised learning and a hybrid deep model of Swin Transformer and Convolutional Neural Network

投稿日: 2025年4月29日作成者: jarxiv

要約目的：高品質のキュレートされた標識医療訓練データの希少性は、乳がん診断に人 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition

投稿日: 2025年4月29日作成者: jarxiv

要約映画の構成のタスクのための新しいフレームワークであるCineverseを提 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks

CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback

Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real Transfer

Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation

DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images

Using Fixed and Mobile Eye Tracking to Understand How Visitors View Art in a Museum: A Study at the Bowes Museum, County Durham, UK

Federated Out-of-Distribution Generalization: A Causal Augmentation View

Interpretable Dynamic Graph Neural Networks for Small Occluded Object Detection and Tracking

Enhancing breast cancer detection on screening mammogram using self-supervised learning and a hybrid deep model of Swin Transformer and Convolutional Neural Network

CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition

最近の投稿

最近のコメント

アーカイブ

カテゴリー