「cs.CV」カテゴリーアーカイブ

Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors

投稿日: 2025年2月12日作成者: jarxiv

要約 3D Gaussian Splatting（3DGS）は、高速トレーニング … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Scaling Pre-training to One Hundred Billion Data for Vision Language Models

投稿日: 2025年2月12日作成者: jarxiv

要約前例のないスケールでのトレーニング前のビジョン言語モデルの可能性についての … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Towards Single-Lens Controllable Depth-of-Field Imaging via Depth-Aware Point Spread Functions

投稿日: 2025年2月12日作成者: jarxiv

要約制御可能なディープオブフィールド（DOF）イメージングは、一般に、重く … 続きを読む →

カテゴリー: cs.CV, cs.RO, eess.IV, physics.optics | コメントを受け付けていません

Causal-Informed Contrastive Learning: Towards Bias-Resilient Pre-training under Concept Drift

投稿日: 2025年2月12日作成者: jarxiv

要約最上層データセットによって推進される大規模な対照的なトレーニングの進化は、 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Divide and Merge: Motion and Semantic Learning in End-to-End Autonomous Driving

投稿日: 2025年2月12日作成者: jarxiv

要約環境とその変化を長期にわたって知覚することは、セマンティクスと動きという2 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LP-DETR: Layer-wise Progressive Relations for Object Detection

投稿日: 2025年2月12日作成者: jarxiv

要約このホワイトペーパーでは、マルチスケールリレーションモデリングを通じてDE … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

FlexiCrackNet: A Flexible Pipeline for Enhanced Crack Segmentation with General Features Transfered from SAM

投稿日: 2025年2月12日作成者: jarxiv

要約自動亀裂セグメンテーションは、交通安全維持と構造の完全性システムにおけるイ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CILP-FGDI: Exploiting Vision-Language Model for Generalizable Person Re-Identification

投稿日: 2025年2月12日作成者: jarxiv

要約堅牢なクロスモーダル機能で知られる視覚言語モデルは、さまざまなコンピュータ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting

投稿日: 2025年2月12日作成者: jarxiv

要約複雑なセマンティック環境と破損した画像の多様な穴パターンを完了するための入 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition

投稿日: 2025年2月12日作成者: jarxiv

要約 Audio-Visuual Speech Speech Septureat … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors

Scaling Pre-training to One Hundred Billion Data for Vision Language Models

Towards Single-Lens Controllable Depth-of-Field Imaging via Depth-Aware Point Spread Functions

Causal-Informed Contrastive Learning: Towards Bias-Resilient Pre-training under Concept Drift

Divide and Merge: Motion and Semantic Learning in End-to-End Autonomous Driving

LP-DETR: Layer-wise Progressive Relations for Object Detection

FlexiCrackNet: A Flexible Pipeline for Enhanced Crack Segmentation with General Features Transfered from SAM

CILP-FGDI: Exploiting Vision-Language Model for Generalizable Person Re-Identification

TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting

mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition

最近の投稿

最近のコメント

アーカイブ

カテゴリー