「cs.CV」カテゴリーアーカイブ

Control-oriented Clustering of Visual Latent Representation

投稿日: 2025年2月7日作成者: jarxiv

要約視覚表現空間のジオメトリ（Visionエンコーダーからアクションデコーダー … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

ImDy: Human Inverse Dynamics from Imitated Observations

投稿日: 2025年2月7日作成者: jarxiv

要約人間の運動観察から駆動されるトルクを再現することを目的とする逆ダイナミクス … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.RO | コメントを受け付けていません

LeAP: Consistent multi-domain 3D labeling using Foundation Models

投稿日: 2025年2月7日作成者: jarxiv

要約データセットの可用性は、3Dセマンティック理解に関する研究の強力なドライバ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks

投稿日: 2025年2月7日作成者: jarxiv

要約実用的なナビゲーションエージェントは、次の指示、オブジェクトの検索、質問へ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

A Self-supervised Multimodal Deep Learning Approach to Differentiate Post-radiotherapy Progression from Pseudoprogression in Glioblastoma

投稿日: 2025年2月7日作成者: jarxiv

要約膠芽腫（GBM）患者の放射線療法（RT）後の真の進行（TP）からの擬似プロ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Enhancing people localisation in drone imagery for better crowd management by utilising every pixel in high-resolution images

投稿日: 2025年2月7日作成者: jarxiv

要約ドローンを使用した正確な人々のローカリゼーションは、大規模なイベントや公開 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Composing Novel Classes: A Concept-Driven Approach to Generalized Category Discovery

投稿日: 2025年2月7日作成者: jarxiv

要約一般化されたカテゴリ発見（GCD）問題に取り組みます。これは、既知のクラス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction

投稿日: 2025年2月7日作成者: jarxiv

要約ビジョンベースの3D占有予測のタスクは、3Dジオメトリを再構築し、2Dから … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MuJo: Multimodal Joint Feature Space Learning for Human Activity Recognition

投稿日: 2025年2月7日作成者: jarxiv

要約人間の活動認識（HAR）は、ヘルスケア、スポーツ、フィットネス、セキュリテ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Beyond Random Augmentations: Pretraining with Hard Views

投稿日: 2025年2月7日作成者: jarxiv

要約自己教師の学習（SSL）メソッドは、通常、ランダムな画像の増強またはビュー … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Control-oriented Clustering of Visual Latent Representation

ImDy: Human Inverse Dynamics from Imitated Observations

LeAP: Consistent multi-domain 3D labeling using Foundation Models

Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks

A Self-supervised Multimodal Deep Learning Approach to Differentiate Post-radiotherapy Progression from Pseudoprogression in Glioblastoma

Enhancing people localisation in drone imagery for better crowd management by utilising every pixel in high-resolution images

Composing Novel Classes: A Concept-Driven Approach to Generalized Category Discovery

Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction

MuJo: Multimodal Joint Feature Space Learning for Human Activity Recognition

Beyond Random Augmentations: Pretraining with Hard Views

最近の投稿

最近のコメント

アーカイブ

カテゴリー