月別アーカイブ: 2025年2月

Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation

投稿日: 2025年2月25日作成者: jarxiv

要約オーディオ駆動のトーキングヘッド生成のための新しいフレームワークであるディ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Modeling Multi-modal Cross-interaction for Multi-label Few-shot Image Classification Based on Local Feature Selection

投稿日: 2025年2月25日作成者: jarxiv

要約マルチラベル少数のショット画像分類（ML-FSIC）の目的は、各ラベルに少 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Motion-Robust T2* Quantification from Gradient Echo MRI with Physics-Informed Deep Learning

投稿日: 2025年2月25日作成者: jarxiv

要約目的：勾配エコーからのT2*の定量化磁気共鳴画像法は、運動の影響を受け、信 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV, physics.med-ph | コメントを受け付けていません

A Two-step Linear Mixing Model for Unmixing under Hyperspectral Variability

投稿日: 2025年2月25日作成者: jarxiv

要約スペクトルアンミキシングは、ハイパースペクトル画像処理の研究分野で重要なタ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

ELFS: Label-Free Coreset Selection with Proxy Training Dynamics

投稿日: 2025年2月25日作成者: jarxiv

要約高品質のヒトが注目したデータは、最新の深い学習パイプラインにとって重要です … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models

投稿日: 2025年2月25日作成者: jarxiv

要約最近のテキスト間拡散モデルは、視覚的な生成タスクの範囲を強化するために効果 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

X Modality Assisting RGBT Object Tracking

投稿日: 2025年2月25日作成者: jarxiv

要約堅牢なマルチモーダル機能表現の開発は、オブジェクト追跡パフォーマンスを強化 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Tidiness Score-Guided Monte Carlo Tree Search for Visual Tabletop Rearrangement

投稿日: 2025年2月25日作成者: jarxiv

要約このホワイトペーパーでは、RGB-Dカメラのみを使用してテーブルトップの片 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

MegaLoc: One Retrieval to Place Them All

投稿日: 2025年2月25日作成者: jarxiv

要約特定のクエリと同じ場所から画像を取得することは、視覚的な場所認識、ランドマ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder

投稿日: 2025年2月25日作成者: jarxiv

要約事前に訓練された画像自動エンコーダーは、コンピュータービジョンでますます利 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年2月

Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation

Modeling Multi-modal Cross-interaction for Multi-label Few-shot Image Classification Based on Local Feature Selection

Motion-Robust T2* Quantification from Gradient Echo MRI with Physics-Informed Deep Learning

A Two-step Linear Mixing Model for Unmixing under Hyperspectral Variability

ELFS: Label-Free Coreset Selection with Proxy Training Dynamics

Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models

X Modality Assisting RGBT Object Tracking

Tidiness Score-Guided Monte Carlo Tree Search for Visual Tabletop Rearrangement

MegaLoc: One Retrieval to Place Them All

Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder

最近の投稿

最近のコメント

アーカイブ

カテゴリー