「cs.CV」カテゴリーアーカイブ

Stable Vision Concept Transformers for Medical Diagnosis

投稿日: 2025年6月6日作成者: jarxiv

要約透明性は医療分野で最も重要な懸念であり、研究者が説明可能なAI（XAI）の … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?

投稿日: 2025年6月6日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLMS）の出現により、エゴセントリックビ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model

投稿日: 2025年6月6日作成者: jarxiv

要約オートレーリングイメージの生成は、以前のトークンに基づいて次のトークンを予 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DM-SegNet: Dual-Mamba Architecture for 3D Medical Image Segmentation with Global Context Modeling

投稿日: 2025年6月6日作成者: jarxiv

要約正確な3D医療画像セグメンテーションには、グローバルなコンテキストモデリン … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

投稿日: 2025年6月6日作成者: jarxiv

要約拡散ベースのビデオ修復（VR）の最近の進歩は、視覚品質の大幅な改善を示して … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos

投稿日: 2025年6月6日作成者: jarxiv

要約画像やビデオの包括的な地域レベルの視覚的理解のための概念的に簡単かつ効率的 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL

投稿日: 2025年6月6日作成者: jarxiv

要約考え方の推論と強化学習（RL）がNLPのブレークスルーを駆動していますが、 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Do It Yourself: Learning Semantic Correspondence from Pseudo-Labels

投稿日: 2025年6月6日作成者: jarxiv

要約画像とオブジェクトインスタンス間で意味的に類似したポイント間の対応を見つけ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MARBLE: Material Recomposition and Blending in CLIP-Space

投稿日: 2025年6月6日作成者: jarxiv

要約模範的な画像に基づいた画像内のオブジェクトの資料の編集は、コンピュータービ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ProJo4D: Progressive Joint Optimization for Sparse-View Inverse Physics Estimation

投稿日: 2025年6月6日作成者: jarxiv

要約ニューラルレンダリングは、3D再構成と新規ビューの合成に大きな進歩を遂げま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Stable Vision Concept Transformers for Medical Diagnosis

EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?

AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model

DM-SegNet: Dual-Mamba Architecture for 3D Medical Image Segmentation with Global Context Modeling

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos

ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL

Do It Yourself: Learning Semantic Correspondence from Pseudo-Labels

MARBLE: Material Recomposition and Blending in CLIP-Space

ProJo4D: Progressive Joint Optimization for Sparse-View Inverse Physics Estimation

最近の投稿

最近のコメント

アーカイブ

カテゴリー