投稿者「jarxiv」のアーカイブ

Compensating Spatiotemporally Inconsistent Observations for Online Dynamic 3D Gaussian Splatting

投稿日: 2025年5月5日作成者: jarxiv

要約動的シーンのオンライン再構成は、既存のオフライン動的再構成手法が録画された … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment

投稿日: 2025年5月5日作成者: jarxiv

要約オーディオビジュアル学習における最近の進歩は、モダリティを超えた表現の学習 … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging

投稿日: 2025年5月5日作成者: jarxiv

要約正確な肺腫瘍のセグメンテーションは、腫瘍学の診断、治療計画、患者の転帰を改 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Fusing Foveal Fixations Using Linear Retinal Transformations and Bayesian Experimental Design

投稿日: 2025年5月5日作成者: jarxiv

要約人間（および多くの脊椎動物）は、シーンの複数の固視を融合して全体の表現を得 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

CAMELTrack: Context-Aware Multi-cue ExpLoitation for Online Multi-Object Tracking

投稿日: 2025年5月5日作成者: jarxiv

要約オンラインマルチオブジェクトトラッキングは、トラックレット表現、特徴フュー … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing

投稿日: 2025年5月5日作成者: jarxiv

要約ムービーダビングは、与えられた短い参照音声のボーカルの音色を維持しながら、 … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

Diffusion-based Adversarial Purification from the Perspective of the Frequency Domain

投稿日: 2025年5月5日作成者: jarxiv

要約拡散に基づく敵対的浄化法は、順方向処理によって敵対的摂動を等方性ノイズの一 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MASH: Masked Anchored SpHerical Distances for 3D Shape Representation and Generation

投稿日: 2025年5月5日作成者: jarxiv

要約我々は、3D形状の新しいマルチビュー・パラメトリック表現であるMasked … 続きを読む →

カテゴリー: cs.CG, cs.CV | コメントを受け付けていません

A Neural Architecture Search Method using Auxiliary Evaluation Metric based on ResNet Architecture

投稿日: 2025年5月5日作成者: jarxiv

要約本稿では、ResNetをフレームワークとして用いたニューラル・アーキテクチ … 続きを読む →

カテゴリー: cs.CV, cs.NE | コメントを受け付けていません

FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors

投稿日: 2025年5月5日作成者: jarxiv

要約 3Dシーンにおけるテキスト駆動オブジェクト挿入は、自然言語による直感的なシ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Compensating Spatiotemporally Inconsistent Observations for Online Dynamic 3D Gaussian Splatting

CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment

Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging

Fusing Foveal Fixations Using Linear Retinal Transformations and Bayesian Experimental Design

CAMELTrack: Context-Aware Multi-cue ExpLoitation for Online Multi-Object Tracking

FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing

Diffusion-based Adversarial Purification from the Perspective of the Frequency Domain

MASH: Masked Anchored SpHerical Distances for 3D Shape Representation and Generation

A Neural Architecture Search Method using Auxiliary Evaluation Metric based on ResNet Architecture

FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors

最近の投稿

最近のコメント

アーカイブ

カテゴリー