月別アーカイブ: 2024年5月

Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation

投稿日: 2024年5月8日作成者: jarxiv

要約話す顔の生成タスクの目的は、視覚的な詳細とアイデンティティ情報を維持しなが … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Enhancing Boundary Segmentation for Topological Accuracy with Skeleton-based Methods

投稿日: 2024年5月8日作成者: jarxiv

要約トポロジーの一貫性は、ニューロン電子顕微鏡画像の細胞膜セグメンテーション、 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection

投稿日: 2024年5月8日作成者: jarxiv

要約クラス増分学習 (CIL) の分野では、生成モデルの継続的な改善と並行して … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications

投稿日: 2024年5月8日作成者: jarxiv

要約 Neural Radiance Fields (NeRF) は、3D シー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

A Unified Approach for Text- and Image-guided 4D Scene Generation

投稿日: 2024年5月8日作成者: jarxiv

要約大規模な拡散生成モデルにより、ユーザーが提供したテキストプロンプトや画像 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation

投稿日: 2024年5月8日作成者: jarxiv

要約テキストプロンプトとセマンティックマスクや落書きマップなどの視覚入力を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Solving the bongard-logo problem by modeling a probabilistic model

投稿日: 2024年5月8日作成者: jarxiv

要約抽象的な推論の問題は、AI アルゴリズムの知覚および認知能力に挑戦し、明示 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Zero Grads: Learning Local Surrogate Losses for Non-Differentiable Graphics

投稿日: 2024年5月8日作成者: jarxiv

要約勾配ベースの最適化は現在、グラフィックス全体で広く普及していますが、残念な … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.LG | コメントを受け付けていません

Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos

投稿日: 2024年5月8日作成者: jarxiv

要約ハンドとオブジェクトの相互作用中に人間がどのように動作するかを理解すること … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PoseINN: Realtime Visual-based Pose Regression and Localization with Invertible Neural Networks

投稿日: 2024年5月8日作成者: jarxiv

要約カメラからエゴポーズを推定することは、移動ロボット工学から拡張現実感まで幅 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

月別アーカイブ: 2024年5月

Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation

Enhancing Boundary Segmentation for Topological Accuracy with Skeleton-based Methods

SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection

Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications

A Unified Approach for Text- and Image-guided 4D Scene Generation

Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation

Solving the bongard-logo problem by modeling a probabilistic model

Zero Grads: Learning Local Surrogate Losses for Non-Differentiable Graphics

Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos

PoseINN: Realtime Visual-based Pose Regression and Localization with Invertible Neural Networks

最近の投稿

最近のコメント

アーカイブ

カテゴリー