月別アーカイブ: 2025年5月

Position: Interactive Generative Video as Next-Generation Game Engine

投稿日: 2025年5月30日作成者: jarxiv

要約現代のゲーム開発は、従来のゲームエンジンの所定のコンテンツにより、創造性と … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

UniViTAR: Unified Vision Transformer with Native Resolution

投稿日: 2025年5月30日作成者: jarxiv

要約従来のビジョントランスは、入力解像度を標準化することにより視覚モデリングを … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Comparing the Effects of Persistence Barcodes Aggregation and Feature Concatenation on Medical Imaging

投稿日: 2025年5月30日作成者: jarxiv

要約医療画像分析では、機能エンジニアリングは、機械学習モデルの設計とパフォーマ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SynTable: A Synthetic Data Generation Pipeline for Unseen Object Amodal Instance Segmentation of Cluttered Tabletop Scenes

投稿日: 2025年5月30日作成者: jarxiv

要約この作業では、NVIDIAのISAAC SIMレプリケーターコンポーザーを … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes

投稿日: 2025年5月30日作成者: jarxiv

要約自動運転車のトレーニング認識システムには、マニュアルラベルから労働集約的な … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Radiant Triangle Soup with Soft Connectivity Forces for 3D Reconstruction and Novel View Synthesis

投稿日: 2025年5月30日作成者: jarxiv

要約この作業では、シーンのジオメトリと外観を表すために三角形を使用した推論時間 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Merge-Friendly Post-Training Quantization for Multi-Target Domain Adaptation

投稿日: 2025年5月30日作成者: jarxiv

要約モデルのマージは、タスク固有の重みを組み合わせて、マルチターゲットドメイン … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

投稿日: 2025年5月30日作成者: jarxiv

要約テキストからビデオへの最近の進歩（T2V）拡散モデルにより、忠実で現実的な … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

D-AR: Diffusion via Autoregressive Models

投稿日: 2025年5月30日作成者: jarxiv

要約このホワイトペーパーでは、標準の次のトークン予測のファッションでのバニラの … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation

投稿日: 2025年5月30日作成者: jarxiv

要約このレポートでは、マルチモーダルの理解と生成を統合するためのシンプルで軽量 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年5月

Position: Interactive Generative Video as Next-Generation Game Engine

UniViTAR: Unified Vision Transformer with Native Resolution

Comparing the Effects of Persistence Barcodes Aggregation and Feature Concatenation on Medical Imaging

SynTable: A Synthetic Data Generation Pipeline for Unseen Object Amodal Instance Segmentation of Cluttered Tabletop Scenes

PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes

Radiant Triangle Soup with Soft Connectivity Forces for 3D Reconstruction and Novel View Synthesis

Merge-Friendly Post-Training Quantization for Multi-Target Domain Adaptation

VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

D-AR: Diffusion via Autoregressive Models

OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー