「cs.AI」カテゴリーアーカイブ

Metamorphic Testing for Pose Estimation Systems

投稿日: 2025年2月14日作成者: jarxiv

要約ポーズ推定システムは、スポーツ分析から家畜ケアまで、さまざまな分野で使用さ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.SE | コメントを受け付けていません

Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection

投稿日: 2025年2月14日作成者: jarxiv

要約コンパクトな回転境界ボックス（Rbox）を使用した視覚オブジェクトの方向を … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

投稿日: 2025年2月14日作成者: jarxiv

要約具体化されたエージェントを作成するためにマルチモーダルの大手言語モデル（M … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery

投稿日: 2025年2月14日作成者: jarxiv

要約 Vision Transformers（VIT）は最近、コンピュータービジ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Opening Articulated Objects in the Real World

投稿日: 2025年2月14日作成者: jarxiv

要約以前に見えなかった環境で、以前に見えなかったオブジェクトで有能に動作できる … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

投稿日: 2025年2月14日作成者: jarxiv

要約人間の参照からの器用な操作のための一般化可能なニューラル追跡コントローラー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Exploring the Potential of Encoder-free Architectures in 3D LMMs

投稿日: 2025年2月14日作成者: jarxiv

要約エンコーダーフリーのアーキテクチャは、2Dビジュアルドメインで事前に検討さ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

投稿日: 2025年2月14日作成者: jarxiv

要約チェーンオブシュート（COT）で質問に答えることで、大規模な言語モデル（L … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Better Embeddings with Coupled Adam

投稿日: 2025年2月14日作成者: jarxiv

要約それらの驚くべき能力にもかかわらず、LLMSは、異方性の望ましくないが理解 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation

投稿日: 2025年2月14日作成者: jarxiv

要約注意ベースの方法は、従来の幾何学的深部学習（GDL）モデルを上回り、球状の … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Metamorphic Testing for Pose Estimation Systems

Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery

Opening Articulated Objects in the Real World

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

Exploring the Potential of Encoder-free Architectures in 3D LMMs

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Better Embeddings with Coupled Adam

Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation

最近の投稿

最近のコメント

アーカイブ

カテゴリー