「cs.CV」カテゴリーアーカイブ

Nothing Stands Still: A Spatiotemporal Benchmark on 3D Point Cloud Registration Under Large Geometric and Temporal Change

投稿日: 2025年1月10日作成者: jarxiv

要約人工空間の 3D 幾何学マップの構築は、コンピュータービジョンとロボット … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data

投稿日: 2025年1月10日作成者: jarxiv

要約人間のドライバーとは対照的に、現在の自動運転システムは依然としてトレーニン … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark

投稿日: 2025年1月10日作成者: jarxiv

要約大規模ビジョン言語モデル (LVLM) によるロボットの一般化の強化がます … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision

投稿日: 2025年1月10日作成者: jarxiv

要約深度推定 (DE) は、シーンに関する空間情報を提供し、3D 再構成、オブ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Exosense: A Vision-Based Scene Understanding System For Exoskeletons

投稿日: 2025年1月10日作成者: jarxiv

要約自己平衡外骨格は、運動障害のある人にとって重要な技術です。現在の課題は人 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Multi-Task Model Merging via Adaptive Weight Disentanglement

投稿日: 2025年1月10日作成者: jarxiv

要約モデルのマージは、さまざまなタスクからのタスク固有の重みを統合されたマルチ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion

投稿日: 2025年1月10日作成者: jarxiv

要約大規模言語モデル (LLM) は、さまざまな推論タスクにわたって強力なパフ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

OneLLM: One Framework to Align All Modalities with Language

投稿日: 2025年1月10日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は、その強力なマルチモーダル … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model

投稿日: 2025年1月10日作成者: jarxiv

要約これまでの大規模ビジョン言語モデル (LVLM) のほとんどは、主に英語デ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

CoE: Deep Coupled Embedding for Non-Rigid Point Cloud Correspondences

投稿日: 2025年1月10日作成者: jarxiv

要約低コストの 3D センサーの普及により、生の点群として表現された非剛体変形 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Nothing Stands Still: A Spatiotemporal Benchmark on 3D Point Cloud Registration Under Large Geometric and Temporal Change

AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data

ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark

A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision

Exosense: A Vision-Based Scene Understanding System For Exoskeletons

Multi-Task Model Merging via Adaptive Weight Disentanglement

InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion

OneLLM: One Framework to Align All Modalities with Language

Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model

CoE: Deep Coupled Embedding for Non-Rigid Point Cloud Correspondences

最近の投稿

最近のコメント

アーカイブ

カテゴリー