月別アーカイブ: 2024年5月

Choose What You Need: Disentangled Representation Learning for Scene Text Recognition, Removal and Editing

投稿日: 2024年5月8日作成者: jarxiv

要約シーンテキスト画像には、スタイル情報（フォント、背景）だけでなく、コンテン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

$\textbf{Splat-MOVER}$: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting

投稿日: 2024年5月8日作成者: jarxiv

要約我々は、オープンボキャブラリーロボット操作のためのモジュラーロボットスタッ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving

投稿日: 2024年5月8日作成者: jarxiv

要約ビジョン中心の自動運転は、コストが低いため、最近広く注目を集めています。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

BILTS: A novel bi-invariant local trajectory-shape descriptor for rigid-body motion

投稿日: 2024年5月8日作成者: jarxiv

要約動作と確立された動作モデルとの類似性を測定することは、動作の分析、認識、生 … 続きを読む →

カテゴリー: cs.CG, cs.CV, cs.RO | コメントを受け付けていません

Deep Unlearning: Fast and Efficient Training-free Approach to Class Forgetting

投稿日: 2024年5月8日作成者: jarxiv

要約機械のアンラーニングは、ユーザーデータの削除に対する規制上の要求とプライバ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, stat.ML | コメントを受け付けていません

Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks

投稿日: 2024年5月8日作成者: jarxiv

要約画像理解機能を備えた大規模言語モデル (LLM) の強化により、高性能の視 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Vision Mamba: A Comprehensive Survey and Taxonomy

投稿日: 2024年5月8日作成者: jarxiv

要約状態空間モデル (SSM) は、動的システムの動作を記述および分析するため … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

On Good Practices for Task-Specific Distillation of Large Pretrained Visual Models

投稿日: 2024年5月8日作成者: jarxiv

要約大規模な事前トレーニング済み視覚モデルは、さまざまな認識タスクにわたって顕 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

投稿日: 2024年5月8日作成者: jarxiv

要約ドキュメント画像の品質は全体的なパフォーマンスに大きく影響するため、ドキュ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid

投稿日: 2024年5月8日作成者: jarxiv

要約 Neural Radiance Field~(NeRF) は、オブジェクト … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年5月

Choose What You Need: Disentangled Representation Learning for Scene Text Recognition, Removal and Editing

$\textbf{Splat-MOVER}$: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting

DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving

BILTS: A novel bi-invariant local trajectory-shape descriptor for rigid-body motion

Deep Unlearning: Fast and Efficient Training-free Approach to Class Forgetting

Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks

Vision Mamba: A Comprehensive Survey and Taxonomy

On Good Practices for Task-Specific Distillation of Large Pretrained Visual Models

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid

最近の投稿

最近のコメント

アーカイブ

カテゴリー