投稿者「jarxiv」のアーカイブ

Show-o2: Improved Native Unified Multimodal Models

投稿日: 2025年6月19日作成者: jarxiv

要約このホワイトペーパーでは、自動網性モデリングとフローマッチングを活用する改 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Baltimore Atlas: FreqWeaver Adapter for Semi-supervised Ultra-high Spatial Resolution Land Cover Classification

投稿日: 2025年6月19日作成者: jarxiv

要約超高空間解像度の土地被覆分類は、きめ細かい土地被覆分析には不可欠ですが、ピ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Unified Graph-based Framework for Scalable 3D Tree Reconstruction and Non-Destructive Biomass Estimation from Point Clouds

投稿日: 2025年6月19日作成者: jarxiv

要約地上森林のバイオマス（AGB）の推定は、炭素貯蔵を評価し、持続可能な森林管 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TARDIS STRIDE: A Spatio-Temporal Road Image Dataset and World Model for Autonomy

投稿日: 2025年6月19日作成者: jarxiv

要約世界モデルは、環境をシミュレートし、効果的なエージェントの動作を可能にする … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

RDD: Robust Feature Detector and Descriptor using Deformable Transformer

投稿日: 2025年6月19日作成者: jarxiv

要約構造からのストレートとスラムの中心的なステップとして、大幅な視点の変化など … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution

投稿日: 2025年6月19日作成者: jarxiv

要約特に、現実的な詳細合成のために安定した拡散（SD）などの事前に訓練された生 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Mono-Modalizing Extremely Heterogeneous Multi-Modal Medical Image Registration

投稿日: 2025年6月19日作成者: jarxiv

要約臨床診療では、陽電子放出断層撮影（PET）や分数異方性（FA）などの機能的 … 続きを読む →

カテゴリー: cs.CV, I.4.5 | コメントを受け付けていません

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding

投稿日: 2025年6月19日作成者: jarxiv

要約合成ビデオ生成は、そのリアリズムと幅広いアプリケーションに対して大きな注目 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

A dataset of high-resolution plantar pressures for gait analysis across varying footwear and walking speeds

投稿日: 2025年6月19日作成者: jarxiv

要約歩行とは、歩行中に生成される四肢の動きのパターンを指します。これは、物理的 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling

投稿日: 2025年6月19日作成者: jarxiv

要約マルチモーダルの医療画像合成には、ソースとターゲットモダリティの間の組織シ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Show-o2: Improved Native Unified Multimodal Models

Baltimore Atlas: FreqWeaver Adapter for Semi-supervised Ultra-high Spatial Resolution Land Cover Classification

A Unified Graph-based Framework for Scalable 3D Tree Reconstruction and Non-Destructive Biomass Estimation from Point Clouds

TARDIS STRIDE: A Spatio-Temporal Road Image Dataset and World Model for Autonomy

RDD: Robust Feature Detector and Descriptor using Deformable Transformer

One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution

Mono-Modalizing Extremely Heterogeneous Multi-Modal Medical Image Registration

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding

A dataset of high-resolution plantar pressures for gait analysis across varying footwear and walking speeds

I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling

最近の投稿

最近のコメント

アーカイブ

カテゴリー