月別アーカイブ: 2025年2月

LP-DETR: Layer-wise Progressive Relations for Object Detection

投稿日: 2025年2月12日作成者: jarxiv

要約このホワイトペーパーでは、マルチスケールリレーションモデリングを通じてDE … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

FlexiCrackNet: A Flexible Pipeline for Enhanced Crack Segmentation with General Features Transfered from SAM

投稿日: 2025年2月12日作成者: jarxiv

要約自動亀裂セグメンテーションは、交通安全維持と構造の完全性システムにおけるイ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CILP-FGDI: Exploiting Vision-Language Model for Generalizable Person Re-Identification

投稿日: 2025年2月12日作成者: jarxiv

要約堅牢なクロスモーダル機能で知られる視覚言語モデルは、さまざまなコンピュータ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting

投稿日: 2025年2月12日作成者: jarxiv

要約複雑なセマンティック環境と破損した画像の多様な穴パターンを完了するための入 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition

投稿日: 2025年2月12日作成者: jarxiv

要約 Audio-Visuual Speech Speech Septureat … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS | コメントを受け付けていません

Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models

投稿日: 2025年2月12日作成者: jarxiv

要約テキストからイメージ（T2I）拡散モデルは、印象的な画像生成機能を実証して … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

From Pixels to Components: Eigenvector Masking for Visual Representation Learning

投稿日: 2025年2月12日作成者: jarxiv

要約画像の目に見える部分からマスクされた予測は、視覚表現学習のための強力な自己 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

MRAnnotator: multi-Anatomy and many-Sequence MRI segmentation of 44 structures

投稿日: 2025年2月12日作成者: jarxiv

要約このレトロスペクティブ研究では、2つのデータセットで44の構造に注釈を付け … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Multiview Point Cloud Registration Based on Minimum Potential Energy for Free-Form Blade Measurement

投稿日: 2025年2月12日作成者: jarxiv

要約ポイントクラウド登録は、産業測定におけるフリーフォームブレードの再構築に不 … 続きを読む →

カテゴリー: cs.CG, cs.CV | コメントを受け付けていません

Matrix3D: Large Photogrammetry Model All-in-One

投稿日: 2025年2月12日作成者: jarxiv

要約同じモデルを使用して、ポーズ推定、深度予測、新しいビュー合成を含むいくつか … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年2月

LP-DETR: Layer-wise Progressive Relations for Object Detection

FlexiCrackNet: A Flexible Pipeline for Enhanced Crack Segmentation with General Features Transfered from SAM

CILP-FGDI: Exploiting Vision-Language Model for Generalizable Person Re-Identification

TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting

mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition

Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models

From Pixels to Components: Eigenvector Masking for Visual Representation Learning

MRAnnotator: multi-Anatomy and many-Sequence MRI segmentation of 44 structures

Multiview Point Cloud Registration Based on Minimum Potential Energy for Free-Form Blade Measurement

Matrix3D: Large Photogrammetry Model All-in-One

最近の投稿

最近のコメント

アーカイブ

カテゴリー