月別アーカイブ: 2025年5月

FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing

投稿日: 2025年5月5日作成者: jarxiv

要約ムービーダビングは、与えられた短い参照音声のボーカルの音色を維持しながら、 … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

Diffusion-based Adversarial Purification from the Perspective of the Frequency Domain

投稿日: 2025年5月5日作成者: jarxiv

要約拡散に基づく敵対的浄化法は、順方向処理によって敵対的摂動を等方性ノイズの一 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MASH: Masked Anchored SpHerical Distances for 3D Shape Representation and Generation

投稿日: 2025年5月5日作成者: jarxiv

要約我々は、3D形状の新しいマルチビュー・パラメトリック表現であるMasked … 続きを読む →

カテゴリー: cs.CG, cs.CV | コメントを受け付けていません

A Neural Architecture Search Method using Auxiliary Evaluation Metric based on ResNet Architecture

投稿日: 2025年5月5日作成者: jarxiv

要約本稿では、ResNetをフレームワークとして用いたニューラル・アーキテクチ … 続きを読む →

カテゴリー: cs.CV, cs.NE | コメントを受け付けていません

FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors

投稿日: 2025年5月5日作成者: jarxiv

要約 3Dシーンにおけるテキスト駆動オブジェクト挿入は、自然言語による直感的なシ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Soybean Disease Detection via Interpretable Hybrid CNN-GNN: Integrating MobileNetV2 and GraphSAGE with Cross-Modal Attention

投稿日: 2025年5月5日作成者: jarxiv

要約大豆の葉の病害検出は農業生産性にとって重要であるが、従来の方法では視覚的に … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Project-and-Fuse: Improving RGB-D Semantic Segmentation via Graph Convolution Networks

投稿日: 2025年5月5日作成者: jarxiv

要約既存のRGB-Dセマンティックセグメンテーション手法の多くは、複雑なクロス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

投稿日: 2025年5月5日作成者: jarxiv

要約大規模言語モデル(LLM)は、より多くの推論を行うことで、強化された能力と … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Monitoring morphometric drift in lifelong learning segmentation of the spinal cord

投稿日: 2025年5月5日作成者: jarxiv

要約脊髄のセグメンテーションから得られる形態計測指標は、脊髄に影響を及ぼす神経 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing

投稿日: 2025年5月5日作成者: jarxiv

要約本論文では、リモートセンシング画像から多角形の建物をマッピングするという課 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年5月

FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing

Diffusion-based Adversarial Purification from the Perspective of the Frequency Domain

MASH: Masked Anchored SpHerical Distances for 3D Shape Representation and Generation

A Neural Architecture Search Method using Auxiliary Evaluation Metric based on ResNet Architecture

FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors

Soybean Disease Detection via Interpretable Hybrid CNN-GNN: Integrating MobileNetV2 and GraphSAGE with Cross-Modal Attention

Project-and-Fuse: Improving RGB-D Semantic Segmentation via Graph Convolution Networks

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Monitoring morphometric drift in lifelong learning segmentation of the spinal cord

Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing

最近の投稿

最近のコメント

アーカイブ

カテゴリー