投稿者「jarxiv」のアーカイブ

Stroke-based Cyclic Amplifier: Image Super-Resolution at Arbitrary Ultra-Large Scales

投稿日: 2025年6月13日作成者: jarxiv

要約以前の任意のスケール画像スーパー解像度（ASISR）メソッドは、アップサン … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SlotPi: Physics-informed Object-centric Reasoning Models

投稿日: 2025年6月13日作成者: jarxiv

要約現実世界の人間の能力に似た視覚的観察を通じて、物理的法則によって支配される … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Human-Robot Navigation using Event-based Cameras and Reinforcement Learning

投稿日: 2025年6月13日作成者: jarxiv

要約この作業では、イベントカメラとその他のセンサーを補強学習と組み合わせて、リ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Modality-AGnostic Image Cascade (MAGIC) for Multi-Modality Cardiac Substructure Segmentation

投稿日: 2025年6月13日作成者: jarxiv

要約心臓の下部構造は、放射線誘発性心疾患のリスクを最小限に抑えるために胸部放射 … 続きを読む →

カテゴリー: cs.CV, physics.med-ph | コメントを受け付けていません

Prompts to Summaries: Zero-Shot Language-Guided Video Summarization

投稿日: 2025年6月13日作成者: jarxiv

要約ビデオデータの爆発的な成長により、ドメイン固有のトレーニングデータなしで動 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Unsupervised Deformable Image Registration with Structural Nonparametric Smoothing

投稿日: 2025年6月13日作成者: jarxiv

要約学習ベースの変形可能な画像登録（DIR）は、ニューラルネットワークを介した … 続きを読む →

カテゴリー: cs.CV, eess.IV, eess.SP | コメントを受け付けていません

Occlusion-Aware 3D Hand-Object Pose Estimation with Masked AutoEncoders

投稿日: 2025年6月13日作成者: jarxiv

要約単眼のRGB画像からのハンドオブジェクトのポーズ推定は、主に手観書の相互作 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

VideoDeepResearch: Long Video Understanding With Agentic Tool Using

投稿日: 2025年6月13日作成者: jarxiv

要約長いビデオ理解（LVU）は、タスクに固有の複雑さとコンテキストウィンドウの … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches

投稿日: 2025年6月13日作成者: jarxiv

要約大規模な言語モデルのパラダイムシフトが成功し、データの大規模なコーパスでの … 続きを読む →

カテゴリー: A.1, cs.AI, cs.CV, eess.IV | コメントを受け付けていません

Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought

投稿日: 2025年6月13日作成者: jarxiv

要約ビデオ分析からインタラクティブなシステムに至るまで、ビデオコンテンツの理解 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Stroke-based Cyclic Amplifier: Image Super-Resolution at Arbitrary Ultra-Large Scales

SlotPi: Physics-informed Object-centric Reasoning Models

Human-Robot Navigation using Event-based Cameras and Reinforcement Learning

Modality-AGnostic Image Cascade (MAGIC) for Multi-Modality Cardiac Substructure Segmentation

Prompts to Summaries: Zero-Shot Language-Guided Video Summarization

Unsupervised Deformable Image Registration with Structural Nonparametric Smoothing

Occlusion-Aware 3D Hand-Object Pose Estimation with Masked AutoEncoders

VideoDeepResearch: Long Video Understanding With Agentic Tool Using

Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches

Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought

最近の投稿

最近のコメント

アーカイブ

カテゴリー