投稿者「jarxiv」のアーカイブ

Using Foundation Models as Pseudo-Label Generators for Pre-Clinical 4D Cardiac CT Segmentation

投稿日: 2025年5月15日作成者: jarxiv

要約心臓画像のセグメンテーションは、多くの心臓画像分析と、心臓力学のモーション … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Meta-learning Slice-to-Volume Reconstruction in Fetal Brain MRI using Implicit Neural Representations

投稿日: 2025年5月15日作成者: jarxiv

要約複数の動き腐敗した低解像度2Dスライスからの高解像度スライスから容積再構成 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

投稿日: 2025年5月15日作成者: jarxiv

要約統一された画像の理解と生成は、マルチモーダルモデルに関する最近の研究で注目 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Don’t Forget your Inverse DDIM for Image Editing

投稿日: 2025年5月15日作成者: jarxiv

要約テキストからイメージの生成の分野は、拡散モデルの導入により大きな進歩を遂げ … 続きを読む →

カテゴリー: cs.CV, I.2.10 | コメントを受け付けていません

Optimal-state Dynamics Estimation for Physics-based Human Motion Capture from Videos

投稿日: 2025年5月15日作成者: jarxiv

要約単眼ビデオからの人間のモーションキャプチャは、近年大きな進歩を遂げています … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Variational Visual Question Answering

投稿日: 2025年5月15日作成者: jarxiv

要約視覚的な質問応答（VQA）のマルチモーダルモデルでは顕著な進歩にもかかわら … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

投稿日: 2025年5月15日作成者: jarxiv

要約深い思考モデルの最近の進歩により、数学的およびコーディングタスクに関する顕 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

LightLab: Controlling Light Sources in Images with Diffusion Models

投稿日: 2025年5月15日作成者: jarxiv

要約画像内の光源に対するきめの細かいパラメトリック制御のためのシンプルでありな … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

投稿日: 2025年5月15日作成者: jarxiv

要約オーディオビジュアルビデオの解析（AVVP）は、両方のユニモーダルイベント … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS | コメントを受け付けていません

Sensitivity-Constrained Fourier Neural Operators for Forward and Inverse Problems in Parametric Differential Equations

投稿日: 2025年5月15日作成者: jarxiv

要約 du/dt = f（u、x、t、p）の形式のパラメトリック微分方程式は、科 … 続きを読む →

カテゴリー: cs.CE, cs.LG | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Using Foundation Models as Pseudo-Label Generators for Pre-Clinical 4D Cardiac CT Segmentation

Meta-learning Slice-to-Volume Reconstruction in Fetal Brain MRI using Implicit Neural Representations

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Don’t Forget your Inverse DDIM for Image Editing

Optimal-state Dynamics Estimation for Physics-based Human Motion Capture from Videos

Variational Visual Question Answering

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

LightLab: Controlling Light Sources in Images with Diffusion Models

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

Sensitivity-Constrained Fourier Neural Operators for Forward and Inverse Problems in Parametric Differential Equations

最近の投稿

最近のコメント

アーカイブ

カテゴリー