月別アーカイブ: 2025年1月

4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation

投稿日: 2025年1月7日作成者: jarxiv

要約 LiDAR ポイントのセマンティックセグメンテーションは、自動運転システ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

投稿日: 2025年1月7日作成者: jarxiv

要約近年、ビジョン言語モデル (VLM) により、ビデオの理解が大幅に進歩しま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map

投稿日: 2025年1月7日作成者: jarxiv

要約交通標識の規制を順守することは、人間と自律車両の両方のナビゲーションにとっ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild

投稿日: 2025年1月7日作成者: jarxiv

要約自然風景の画像内にビジュアルテキストを生成することは、多くの未解決の問題 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild

投稿日: 2025年1月7日作成者: jarxiv

要約複雑な視覚的推論は、今日でも重要な課題です。通常、この課題には、思考連鎖 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Human Gaze Boosts Object-Centered Representation Learning

投稿日: 2025年1月7日作成者: jarxiv

要約人間のような自己中心的な視覚入力でトレーニングされた最近の自己教師あり学習 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos

投稿日: 2025年1月7日作成者: jarxiv

要約 3D 手の姿勢推定の出現にもかかわらず、現在の方法は主にカメラフレーム内 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

投稿日: 2025年1月7日作成者: jarxiv

要約画像拡散モデルは、GAN ベースの手法における過度の平滑化の問題に対処する … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Novel Automatic Real-time Motion Tracking Method for Magnetic Resonance Imaging-guided Radiotherapy: Leveraging the Enhanced Tracking-Learning-Detection Framework with Automatic Segmentation

投稿日: 2025年1月7日作成者: jarxiv

要約背景と目的: MRI 誘導放射線治療 (MRIgRT) における正確な動作 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV, physics.med-ph, q-bio.TO | コメントを受け付けていません

LEDiff: Latent Exposure Diffusion for HDR Generation

投稿日: 2025年1月7日作成者: jarxiv

要約消費者向けディスプレイでは 10 ストップを超えるダイナミックレンジのサ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

月別アーカイブ: 2025年1月

4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map

SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild

Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild

Human Gaze Boosts Object-Centered Representation Learning

HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

A Novel Automatic Real-time Motion Tracking Method for Magnetic Resonance Imaging-guided Radiotherapy: Leveraging the Enhanced Tracking-Learning-Detection Framework with Automatic Segmentation

LEDiff: Latent Exposure Diffusion for HDR Generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー