投稿者「jarxiv」のアーカイブ

SITE: towards Spatial Intelligence Thorough Evaluation

投稿日: 2025年5月9日作成者: jarxiv

要約 Spatial Intelligence（SI）は、神経科学からロボット工 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant

投稿日: 2025年5月9日作成者: jarxiv

要約 StreamBridgeを紹介します。これは、オフラインのビデオllmsを … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Generating Physically Stable and Buildable LEGO Designs from Text

投稿日: 2025年5月9日作成者: jarxiv

要約テキストプロンプトから物理的に安定したレゴブリックモデルを生成するための最 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Flow-GRPO: Training Flow Matching Models via Online RL

投稿日: 2025年5月9日作成者: jarxiv

要約 Flow-Grpoを提案します。これは、オンライン強化学習（RL）をフロー … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation

投稿日: 2025年5月9日作成者: jarxiv

要約画像の理解と生成のための統一されたモデルの最近の進歩は印象的ですが、ほとん … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion

投稿日: 2025年5月9日作成者: jarxiv

要約現在の構造からの構造（SFM）メソッドは、通常、2段階のパイプラインに続き … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3D Scene Generation: A Survey

投稿日: 2025年5月9日作成者: jarxiv

要約 3Dシーンジェネレーションは、没入型メディア、ロボット工学、自律運転、具体 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation

投稿日: 2025年5月9日作成者: jarxiv

要約単一の画像から高品質のアニメーション可能な3Dヒトアバターを作成すると、単 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Demonstrating ViSafe: Vision-enabled Safety for High-speed Detect and Avoid

投稿日: 2025年5月9日作成者: jarxiv

要約保証された安全性分離は、共有空域で空中車両のシームレスな高密度操作を達成す … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation

投稿日: 2025年5月9日作成者: jarxiv

要約ロボット工学の効率的なパス計画、特に大規模で動的な環境内では、依然として重 … 続きを読む →

カテゴリー: cs.CL, cs.RO | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

SITE: towards Spatial Intelligence Thorough Evaluation

StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant

Generating Physically Stable and Buildable LEGO Designs from Text

Flow-GRPO: Training Flow Matching Models via Online RL

Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation

DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion

3D Scene Generation: A Survey

SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation

Demonstrating ViSafe: Vision-enabled Safety for High-speed Detect and Avoid

SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation

最近の投稿

最近のコメント

アーカイブ

カテゴリー