月別アーカイブ: 2025年1月

Gaussian Eigen Models for Human Heads

投稿日: 2025年1月15日作成者: jarxiv

要約現在のパーソナライズされたニューラルヘッドアバターはトレードオフに直面 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LayerAnimate: Layer-specific Control for Animation

投稿日: 2025年1月15日作成者: jarxiv

要約アニメーションビデオでは、前景要素と背景要素がレイヤーに分離され、スケッ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers

投稿日: 2025年1月15日作成者: jarxiv

要約意味論的な将来予測は、動的環境をナビゲートする自律システムにとって重要です … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MiniMax-01: Scaling Foundation Models with Lightning Attention

投稿日: 2025年1月15日作成者: jarxiv

要約 MiniMax-Text-01 および MiniMax-VL-01 を含む … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation

投稿日: 2025年1月15日作成者: jarxiv

要約医療診断などのリスクに敏感なアプリケーションにニューラルネットワークを導 … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

Diffusion Adversarial Post-Training for One-Step Video Generation

投稿日: 2025年1月15日作成者: jarxiv

要約拡散モデルは画像やビデオの生成に広く使用されていますが、反復生成プロセスは … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

GameFactory: Creating New Games with Generative Interactive Videos

投稿日: 2025年1月15日作成者: jarxiv

要約生成型ゲームエンジンは、新しいコンテンツを自律的に作成し、手動の作業負荷 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

投稿日: 2025年1月15日作成者: jarxiv

要約私たちは、画像とビデオの両方の領域レベルの理解を容易にするように設計された … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Predicting 4D Hand Trajectory from Monocular Videos

投稿日: 2025年1月15日作成者: jarxiv

要約単眼ビデオからコヒーレントな 4D 手の軌跡を推測するアプローチである H … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise

投稿日: 2025年1月15日作成者: jarxiv

要約生成モデリングは、ランダムノイズを構造化された出力に変換することを目的と … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年1月

Gaussian Eigen Models for Human Heads

LayerAnimate: Layer-specific Control for Animation

Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers

MiniMax-01: Scaling Foundation Models with Lightning Attention

Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation

Diffusion Adversarial Post-Training for One-Step Video Generation

GameFactory: Creating New Games with Generative Interactive Videos

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Predicting 4D Hand Trajectory from Monocular Videos

Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise

最近の投稿

最近のコメント

アーカイブ

カテゴリー