月別アーカイブ: 2025年3月

Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentation

投稿日: 2025年3月21日作成者: jarxiv

要約インタラクティブポイントプロンプトベースの画像セグメンテーションの最近の進 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SceneMI: Motion In-betweening for Modeling Human-Scene Interactions

投稿日: 2025年3月21日作成者: jarxiv

要約人間の相互作用（HSI）のモデリングは、日常の人間の行動を理解してシミュレ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Unleashing Vecset Diffusion Model for Fast Shape Generation

投稿日: 2025年3月21日作成者: jarxiv

要約 3D形状生成は、特にVECSET拡散モデル（VDM）を通じて、いわゆる「ネ … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

Unifying 2D and 3D Vision-Language Understanding

投稿日: 2025年3月21日作成者: jarxiv

要約 3Dビジョン言語学習の進歩は、大規模な3Dデータセットの希少性によって妨げ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Rapid patient-specific neural networks for intraoperative X-ray to volume registration

投稿日: 2025年3月21日作成者: jarxiv

要約画像誘導介入における人工知能の統合は、変革の可能性をもたらし、複雑な手順中 … 続きを読む →

カテゴリー: cs.CV, eess.IV, physics.med-ph | コメントを受け付けていません

Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction

投稿日: 2025年3月21日作成者: jarxiv

要約 Dust3Rは最近、カメラの内在性と外的性論の推定、3Dのシーンの再構築、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Multi-Modal Foundation Models for Computational Pathology: A Survey

投稿日: 2025年3月21日作成者: jarxiv

要約基礎モデルは、計算病理学（CPATH）の強力なパラダイムとして浮上し、組織 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Ultra-Resolution Adaptation with Ease

投稿日: 2025年3月21日作成者: jarxiv

要約テキストからイメージへの拡散モデルは、近年顕著な進歩を遂げています。ただ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images

投稿日: 2025年3月21日作成者: jarxiv

要約 3Dガウススプラッティング（3DGS）は、印象的な新規ビューの合成パフォー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners

投稿日: 2025年3月21日作成者: jarxiv

要約知識編集（KE）により、大規模な言語モデル（LLM）で時代遅れまたは誤った … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.IR, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年3月

Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentation

SceneMI: Motion In-betweening for Modeling Human-Scene Interactions

Unleashing Vecset Diffusion Model for Fast Shape Generation

Unifying 2D and 3D Vision-Language Understanding

Rapid patient-specific neural networks for intraoperative X-ray to volume registration

Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction

Multi-Modal Foundation Models for Computational Pathology: A Survey

Ultra-Resolution Adaptation with Ease

Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images

CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners

最近の投稿

最近のコメント

アーカイブ

カテゴリー