「cs.CV」カテゴリーアーカイブ

Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions

投稿日: 2025年4月14日作成者: jarxiv

要約一般的な環境を積極的に探索しながら、任意のオブジェクトを説明する際のエージ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review

投稿日: 2025年4月14日作成者: jarxiv

要約自動化された運転には正確な車線検出が不可欠であり、さまざまな道路シナリオで … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset

投稿日: 2025年4月14日作成者: jarxiv

要約デジタルツインカタログ（DTC）を紹介します。これは、新しい大規模なフォト … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.RO | コメントを受け付けていません

Discriminator-Free Direct Preference Optimization for Video Diffusion

投稿日: 2025年4月14日作成者: jarxiv

要約直接選好最適化（DPO）は、WIN/LOSITデータペアを通じてモデルを人 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection

投稿日: 2025年4月14日作成者: jarxiv

要約ディープフェイクの顔の急増は、私たちの日常生活に大きな潜在的な悪影響をもた … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails

投稿日: 2025年4月14日作成者: jarxiv

要約リモートセンシングでは、同じシーンをキャプチャするさまざまなセンサーのマル … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Proxy-Anchor and EVT-Driven Continual Learning Method for Generalized Category Discovery

投稿日: 2025年4月14日作成者: jarxiv

要約継続的な一般化されたカテゴリの発見が、以前に学んだカテゴリの壊滅的な忘却を … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Shadow Erosion and Nighttime Adaptability for Camera-Based Automated Driving Applications

投稿日: 2025年4月14日作成者: jarxiv

要約 RGBカメラからの画像の強化は、医療イメージング、衛星イメージング、自動運 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

F-LMM: Grounding Frozen Large Multimodal Models

投稿日: 2025年4月14日作成者: jarxiv

要約視覚的な接地能力を備えた大規模なマルチモーダルモデル（LMM）を支えると、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets

投稿日: 2025年4月14日作成者: jarxiv

要約熟度のレベルは、バナナの品質を決定するのに不可欠です。バナナの成熟度を正 … 続きを読む →

カテゴリー: 68T05, 68T07, 68T10, cs.CV, I.2.10 | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions

Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review

Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset

Discriminator-Free Direct Preference Optimization for Video Diffusion

Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection

COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails

Proxy-Anchor and EVT-Driven Continual Learning Method for Generalized Category Discovery

Shadow Erosion and Nighttime Adaptability for Camera-Based Automated Driving Applications

F-LMM: Grounding Frozen Large Multimodal Models

Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets

最近の投稿

最近のコメント

アーカイブ

カテゴリー