「cs.AI」カテゴリーアーカイブ

Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models

投稿日: 2025年2月25日作成者: jarxiv

要約最近のテキスト間拡散モデルは、視覚的な生成タスクの範囲を強化するために効果 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Tidiness Score-Guided Monte Carlo Tree Search for Visual Tabletop Rearrangement

投稿日: 2025年2月25日作成者: jarxiv

要約このホワイトペーパーでは、RGB-Dカメラのみを使用してテーブルトップの片 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

A novel approach to navigate the taxonomic hierarchy to address the Open-World Scenarios in Medicinal Plant Classification

投稿日: 2025年2月25日作成者: jarxiv

要約この記事では、問題をオープンクラスの問題として提起することにより、植物の階 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

AnyTop: Character Animation Diffusion with Any Topology

投稿日: 2025年2月25日作成者: jarxiv

要約任意のスケルトンの動きを生成することは、コンピューターグラフィックスの長年 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

DIS-CO: Discovering Copyrighted Content in VLMs Training Data

投稿日: 2025年2月25日作成者: jarxiv

要約トレーニングデータに直接アクセスすることなく、著作権で保護されたコンテンツ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, I.2 | コメントを受け付けていません

RELICT: A Replica Detection Framework for Medical Image Generation

投稿日: 2025年2月25日作成者: jarxiv

要約深い学習モデルの一般化を強化し、改善するための合成医療データの可能性にもか … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

Experimental validation of UAV search and detection system in real wilderness environment

投稿日: 2025年2月25日作成者: jarxiv

要約 Search and Rescue（SAR）ミッションには、特に挑戦的また … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs

投稿日: 2025年2月25日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLM）は、近年、視覚認識タスクの急速な進 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

FACTR: Force-Attending Curriculum Training for Contact-Rich Policy Learning

投稿日: 2025年2月25日作成者: jarxiv

要約ボックスピックアップやローリング生地など、人間が実行する多くのコンタクトリ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

V-HOP: Visuo-Haptic 6D Object Pose Tracking

投稿日: 2025年2月25日作成者: jarxiv

要約人間は、操作中に堅牢なオブジェクト知覚のために視覚と触覚を自然に統合します … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models

Tidiness Score-Guided Monte Carlo Tree Search for Visual Tabletop Rearrangement

A novel approach to navigate the taxonomic hierarchy to address the Open-World Scenarios in Medicinal Plant Classification

AnyTop: Character Animation Diffusion with Any Topology

DIS-CO: Discovering Copyrighted Content in VLMs Training Data

RELICT: A Replica Detection Framework for Medical Image Generation

Experimental validation of UAV search and detection system in real wilderness environment

MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs

FACTR: Force-Attending Curriculum Training for Contact-Rich Policy Learning

V-HOP: Visuo-Haptic 6D Object Pose Tracking

最近の投稿

最近のコメント

アーカイブ

カテゴリー