「cs.CV」カテゴリーアーカイブ

Real-Time Video Generation with Pyramid Attention Broadcast

投稿日: 2024年8月23日作成者: jarxiv

要約私たちは、DiT ベースのビデオ生成のためのリアルタイム、高品質、トレーニ … 続きを読む →

カテゴリー: cs.CV, cs.DC | コメントを受け付けていません

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations

投稿日: 2024年8月23日作成者: jarxiv

要約テキストの説明からリアルなシーンを生成できるテキストからビデオ (T2V) … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Automating Deformable Gasket Assembly

投稿日: 2024年8月23日作成者: jarxiv

要約ガスケットの組み立てでは、変形可能なガスケットを狭いチャネルに位置合わせし … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction

投稿日: 2024年8月23日作成者: jarxiv

要約ボリュームレンダリングによるニューラル暗黙的再構成は、密な 3D サー … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

DreamCinema: Cinematic Transfer with Free Camera and 3D Character

投稿日: 2024年8月23日作成者: jarxiv

要約私たちはデジタルメディアの隆盛の時代に生きており、誰もが個人の映画製作者 … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.MM | コメントを受け付けていません

A New Chinese Landscape Paintings Generation Model based on Stable Diffusion using DreamBooth

投稿日: 2024年8月23日作成者: jarxiv

要約この研究では主に、中国の山水画を生成するための安定拡散モデル (SDM) … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Exploring Robustness of Visual State Space model against Backdoor Attacks

投稿日: 2024年8月23日作成者: jarxiv

要約 Visual State Space Model (VSS) は、さまざま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Target-Oriented Object Grasping via Multimodal Human Guidance

投稿日: 2024年8月22日作成者: jarxiv

要約人間とロボットの対話やコラボレーションのシナリオにおいて、ロボットによる把 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection

投稿日: 2024年8月22日作成者: jarxiv

要約最近、トランスフォーマベースの方法は、単一の 2D 画像から 3D 属性を … 続きを読む →

カテゴリー: cs.CV, cs.RO, eess.IV | コメントを受け付けていません

Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis

投稿日: 2024年8月22日作成者: jarxiv

要約従来のジオメトリベースの SLAM システムは、通常、データの関連付けが特 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Real-Time Video Generation with Pyramid Attention Broadcast

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations

Automating Deformable Gasket Assembly

ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction

DreamCinema: Cinematic Transfer with Free Camera and 3D Character

A New Chinese Landscape Paintings Generation Model based on Stable Diffusion using DreamBooth

Exploring Robustness of Visual State Space model against Backdoor Attacks

Target-Oriented Object Grasping via Multimodal Human Guidance

S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection

Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis

最近の投稿

最近のコメント

アーカイブ

カテゴリー