「cs.CV」カテゴリーアーカイブ

Robotic framework for autonomous manipulation of laboratory equipment with different degrees of transparency via 6D pose estimation

投稿日: 2024年10月11日作成者: jarxiv

要約現代のロボットシステムの多くは自律的に動作しますが、環境を正確に分析して … 続きを読む →

カテゴリー: cs.CV, cs.RO, cs.SE, cs.SY, eess.SY | コメントを受け付けていません

LaB-CL: Localized and Balanced Contrastive Learning for improving parking slot detection

投稿日: 2024年10月11日作成者: jarxiv

要約駐車スロット検出は、自動駐車システムに不可欠な技術です。一般に、駐車枠検 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

投稿日: 2024年10月11日作成者: jarxiv

要約ロボット工学では両手操作が不可欠ですが、2 つのロボットアームを調整する … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Understanding Spatio-Temporal Relations in Human-Object Interaction using Pyramid Graph Convolutional Network

投稿日: 2024年10月11日作成者: jarxiv

要約人間のアクティビティの認識は、知能ロボットにとって重要なタスクです。特に人 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Understanding Human Activity with Uncertainty Measure for Novelty in Graph Convolutional Networks

投稿日: 2024年10月11日作成者: jarxiv

要約人間の活動を理解することは、特に人間とロボットのコラボレーションの分野にお … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Multimodal Perception System for Real Open Environment

投稿日: 2024年10月11日作成者: jarxiv

要約この論文では、実際のオープン環境向けの新しいマルチモーダル知覚システムを紹 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

DragTraffic: Interactive and Controllable Traffic Scene Generation for Autonomous Driving

投稿日: 2024年10月11日作成者: jarxiv

要約自動運転システムの評価とトレーニングには、多様でスケーラブルなコーナーケー … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning

投稿日: 2024年10月11日作成者: jarxiv

要約モデルがテレビクリップなどの複雑でマルチモーダルなコンテンツを理解するこ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, I.2.10 | コメントを受け付けていません

Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

投稿日: 2024年10月11日作成者: jarxiv

要約この論文では、最新の畳み込みニューラルネットワーク (ConvNet) … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

A framework for compressing unstructured scientific data via serialization

投稿日: 2024年10月11日作成者: jarxiv

要約既知のローカル接続を使用して非構造化科学データを圧縮するための一般的なフレ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Robotic framework for autonomous manipulation of laboratory equipment with different degrees of transparency via 6D pose estimation

LaB-CL: Localized and Balanced Contrastive Learning for improving parking slot detection

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Understanding Spatio-Temporal Relations in Human-Object Interaction using Pyramid Graph Convolutional Network

Understanding Human Activity with Uncertainty Measure for Novelty in Graph Convolutional Networks

Multimodal Perception System for Real Open Environment

DragTraffic: Interactive and Controllable Traffic Scene Generation for Autonomous Driving

TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning

Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

A framework for compressing unstructured scientific data via serialization

最近の投稿

最近のコメント

アーカイブ

カテゴリー