「cs.CV」カテゴリーアーカイブ

StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart

投稿日: 2024年11月22日作成者: jarxiv

要約人間の両眼視を模倣した高品質のステレオビデオを生成するには、フレーム全体 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching

投稿日: 2024年11月22日作成者: jarxiv

要約エクスペリエンス目標の視覚的再配置タスクは、Embedded AI 内の基 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Layer Pruning with Consensus: A Triple-Win Solution

投稿日: 2024年11月22日作成者: jarxiv

要約レイヤープルーニングは、標準的な構造化プルーニングに代わる有望な代替手段 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

投稿日: 2024年11月22日作成者: jarxiv

要約本稿では、IDEA Research が開発したこれまでで最高のオープンワ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models

投稿日: 2024年11月22日作成者: jarxiv

要約医用画像のセグメンテーションは正確な臨床診断に不可欠ですが、病変と正常組織 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Contrasting local and global modeling with machine learning and satellite data: A case study estimating tree canopy height in African savannas

投稿日: 2024年11月22日作成者: jarxiv

要約衛星画像を使用した機械学習 (SatML) の進歩により、地球規模での環境 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation

投稿日: 2024年11月22日作成者: jarxiv

要約同時位置特定とマッピング (SLAM) 技術を使用して視覚障害者をナビゲー … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Localizing Events in Videos with Multimodal Queries

投稿日: 2024年11月22日作成者: jarxiv

要約ビデオ検索などのユーザー指向アプリケーションの重要性が高まる中、セマンティ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Sparkle: Mastering Basic Spatial Capabilities in Vision Language Models Elicits Generalization to Composite Spatial Reasoning

投稿日: 2024年11月22日作成者: jarxiv

要約ビジョン言語モデル (VLM) は、幅広い下流タスクにわたって優れたパフォ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Using Formal Models, Safety Shields and Certified Control to Validate AI-Based Train Systems

投稿日: 2024年11月22日作成者: jarxiv

要約自律システムの認証は、科学と産業において重要な関心事です。 KI-LOK … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart

SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching

Layer Pruning with Consensus: A Triple-Win Solution

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models

Contrasting local and global modeling with machine learning and satellite data: A case study estimating tree canopy height in African savannas

InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation

Localizing Events in Videos with Multimodal Queries

Sparkle: Mastering Basic Spatial Capabilities in Vision Language Models Elicits Generalization to Composite Spatial Reasoning

Using Formal Models, Safety Shields and Certified Control to Validate AI-Based Train Systems

最近の投稿

最近のコメント

アーカイブ

カテゴリー