「cs.CV」カテゴリーアーカイブ

Shape-Based Single Object Classification Using Ensemble Method Classifiers

投稿日: 2025年1月17日作成者: jarxiv

要約最近は画像も増えてきています。画像の注釈付けと取得は分類の問題を引き起こ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Vision-Language Models Do Not Understand Negation

投稿日: 2025年1月17日作成者: jarxiv

要約多くの実用的なビジョン言語アプリケーションでは、自然言語を使用して特定のオ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection

投稿日: 2025年1月17日作成者: jarxiv

要約物体検出は、自動運転やセキュリティからスマートシティに至るまで、幅広い用途 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.DC | コメントを受け付けていません

DriveLM: Driving with Graph Visual Question Answering

投稿日: 2025年1月17日作成者: jarxiv

要約私たちは、Web スケールのデータでトレーニングされたビジョン言語モデル … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DEFOM-Stereo: Depth Foundation Model Based Stereo Matching

投稿日: 2025年1月17日作成者: jarxiv

要約ステレオマッチングは、コンピュータービジョンとロボット工学におけるメト … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Diffusion Models in Vision: A Survey

投稿日: 2025年1月17日作成者: jarxiv

要約ノイズ除去拡散モデルは、コンピュータビジョンにおける最近の新たなトピック … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

MonoSOWA: Scalable monocular 3D Object detector Without human Annotations

投稿日: 2025年1月17日作成者: jarxiv

要約単一の RGB カメラを使用してオブジェクトの 3 次元の位置と方向を検出 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning

投稿日: 2025年1月17日作成者: jarxiv

要約 LiDAR は自動運転において重要なセンサーであり、一般的にカメラと併用さ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment

投稿日: 2025年1月17日作成者: jarxiv

要約この記事では、さまざまな ROS ベースの SLAM システムによって計算 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Instruction-Guided Fusion of Multi-Layer Visual Features in Large Vision-Language Models

投稿日: 2025年1月17日作成者: jarxiv

要約大規模ビジョン言語モデル (LVLM) は、事前トレーニングされたビジョン … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Shape-Based Single Object Classification Using Ensemble Method Classifiers

Vision-Language Models Do Not Understand Negation

RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection

DriveLM: Driving with Graph Visual Question Answering

DEFOM-Stereo: Depth Foundation Model Based Stereo Matching

Diffusion Models in Vision: A Survey

MonoSOWA: Scalable monocular 3D Object detector Without human Annotations

The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning

Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment

Instruction-Guided Fusion of Multi-Layer Visual Features in Large Vision-Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー