「cs.CV」カテゴリーアーカイブ

T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs

投稿日: 2024年12月2日作成者: jarxiv

要約画像領域におけるマルチモーダル大規模言語モデル (MLLM) の成功は、研 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

SuperMat: Physically Consistent PBR Material Estimation at Interactive Rates

投稿日: 2024年12月2日作成者: jarxiv

要約画像から物理ベースのマテリアルをその構成プロパティに分解することは、特に計 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Enhancing weed detection performance by means of GenAI-based image augmentation

投稿日: 2024年12月2日作成者: jarxiv

要約作物の生産性と生態系のバランスを維持するには、正確な雑草管理が不可欠です。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DECODE: Domain-aware Continual Domain Expansion for Motion Prediction

投稿日: 2024年11月28日作成者: jarxiv

要約自動運転車が複雑な環境を効果的に移動し、他の交通参加者の行動を正確に予測す … 続きを読む →

カテゴリー: cs.CV, cs.RO, I.2.9 | コメントを受け付けていません

HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction

投稿日: 2024年11月28日作成者: jarxiv

要約 HI-SLAM2 は、RGB 入力のみを使用して高速かつ正確な単眼シーンの … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Breathless: An 8-hour Performance Contrasting Human and Robot Expressiveness

投稿日: 2024年11月28日作成者: jarxiv

要約この論文では、人間のダンサー (Cuan) と産業用ロボットアームを組み … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVs

投稿日: 2024年11月28日作成者: jarxiv

要約固定翼無人航空機 (UAV) は、その長時間の耐久性と高速機能により、急成 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like Autonomous Driving with Adaptive Feedback

投稿日: 2024年11月28日作成者: jarxiv

要約安全、快適、効率的なナビゲーションを確保することは、自動運転システムにとっ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Image Compression Using Novel View Synthesis Priors

投稿日: 2024年11月28日作成者: jarxiv

要約リアルタイムの視覚フィードバックは、特に検査や操作作業中に、遠隔操作車両の … 続きを読む →

カテゴリー: cs.CV, cs.RO, eess.IV | コメントを受け付けていません

Applications of Spiking Neural Networks in Visual Place Recognition

投稿日: 2024年11月28日作成者: jarxiv

要約ロボット工学では、スパイキングニューラルネットワーク (SNN) は、 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs

SuperMat: Physically Consistent PBR Material Estimation at Interactive Rates

Enhancing weed detection performance by means of GenAI-based image augmentation

DECODE: Domain-aware Continual Domain Expansion for Motion Prediction

HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction

Breathless: An 8-hour Performance Contrasting Human and Robot Expressiveness

Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVs

FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like Autonomous Driving with Adaptive Feedback

Image Compression Using Novel View Synthesis Priors

Applications of Spiking Neural Networks in Visual Place Recognition

最近の投稿

最近のコメント

アーカイブ

カテゴリー