投稿者「jarxiv」のアーカイブ

From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos

投稿日: 2025年6月6日作成者: jarxiv

要約 Composed Video Retrieval（COVR）は、クエリビデ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting

投稿日: 2025年6月6日作成者: jarxiv

要約 NERFやGaussian Splatting（GS）を含むニューラルレン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Rectified Point Flow: Generic Point Cloud Pose Estimation

投稿日: 2025年6月6日作成者: jarxiv

要約ペアワイズポイントクラウド登録とマルチパート形状アセンブリを単一の条件付き … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Video World Models with Long-term Spatial Memory

投稿日: 2025年6月6日作成者: jarxiv

要約新しい世界モデルは、カメラの動きやテキストプロンプトなどのアクションに応じ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion

投稿日: 2025年6月6日作成者: jarxiv

要約 3D Shape Compleyは、ロボット工学、デジタルツイン再建、およ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Stable Vision Concept Transformers for Medical Diagnosis

投稿日: 2025年6月6日作成者: jarxiv

要約透明性は医療分野で最も重要な懸念であり、研究者が説明可能なAI（XAI）の … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?

投稿日: 2025年6月6日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLMS）の出現により、エゴセントリックビ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model

投稿日: 2025年6月6日作成者: jarxiv

要約オートレーリングイメージの生成は、以前のトークンに基づいて次のトークンを予 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DM-SegNet: Dual-Mamba Architecture for 3D Medical Image Segmentation with Global Context Modeling

投稿日: 2025年6月6日作成者: jarxiv

要約正確な3D医療画像セグメンテーションには、グローバルなコンテキストモデリン … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

投稿日: 2025年6月6日作成者: jarxiv

要約拡散ベースのビデオ修復（VR）の最近の進歩は、視覚品質の大幅な改善を示して … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos

Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting

Rectified Point Flow: Generic Point Cloud Pose Estimation

Video World Models with Long-term Spatial Memory

RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion

Stable Vision Concept Transformers for Medical Diagnosis

EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?

AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model

DM-SegNet: Dual-Mamba Architecture for 3D Medical Image Segmentation with Global Context Modeling

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

最近の投稿

最近のコメント

アーカイブ

カテゴリー