「cs.CV」カテゴリーアーカイブ

Skeleton-Based Human Action Recognition with Noisy Labels

投稿日: 2024年8月7日作成者: jarxiv

要約人間と空間を共有する支援ロボットにとって、次のインタラクションについて情報 … 続きを読む →

カテゴリー: cs.CV, cs.RO, eess.IV | コメントを受け付けていません

Towards Activated Muscle Group Estimation in the Wild

投稿日: 2024年8月7日作成者: jarxiv

要約この論文では、野生での身体活動中に活動している筋肉領域を特定することを目的 … 続きを読む →

カテゴリー: cs.CV, cs.RO, eess.IV | コメントを受け付けていません

MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model

投稿日: 2024年8月7日作成者: jarxiv

要約 LiDAR ベースの移動物体セグメンテーション (MOS) は、以前のスキ … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.RO, eess.IV | コメントを受け付けていません

Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

投稿日: 2024年8月7日作成者: jarxiv

要約画像キャプション用の検索拡張モデルの最近の進歩により、強力なドメイン転送機 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications

投稿日: 2024年8月7日作成者: jarxiv

要約内視鏡手術は 2 次元のビューに依存しているため、外科医にとっては深さの認 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Prototype Learning for Micro-gesture Classification

投稿日: 2024年8月7日作成者: jarxiv

要約このペーパーでは、IJCAI 2024 の MiGA チャレンジにおけるマ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Multi-Agent 3D Map Reconstruction and Change Detection in Microgravity with Free-Flying Robots

投稿日: 2024年8月7日作成者: jarxiv

要約国際宇宙ステーション (ISS) にある NASA の Astrobee … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Source-Free Domain-Invariant Performance Prediction

投稿日: 2024年8月7日作成者: jarxiv

要約モデルのパフォーマンスを正確に推定することは、特にソースドメインとターゲ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Benchmarking In-the-wild Multimodal Disease Recognition and A Versatile Baseline

投稿日: 2024年8月7日作成者: jarxiv

要約既存の植物病害分類モデルは、研究室内の病害画像の認識において顕著な性能を達 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation

投稿日: 2024年8月7日作成者: jarxiv

要約大規模なテキストから画像への拡散モデルは、生成 AI とマルチモーダルテ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Skeleton-Based Human Action Recognition with Noisy Labels

Towards Activated Muscle Group Estimation in the Wild

MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model

Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications

Prototype Learning for Micro-gesture Classification

Multi-Agent 3D Map Reconstruction and Change Detection in Microgravity with Free-Flying Robots

Source-Free Domain-Invariant Performance Prediction

Benchmarking In-the-wild Multimodal Disease Recognition and A Versatile Baseline

FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation

最近の投稿

最近のコメント

アーカイブ

カテゴリー