「cs.CV」カテゴリーアーカイブ

Is Large-Scale Pretraining the Secret to Good Domain Generalization?

投稿日: 2025年1月24日作成者: jarxiv

要約マルチソースドメイン一般化（DG）は、複数のソースドメインでトレーニングし … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

On Disentangled Training for Nonlinear Transform in Learned Image Compression

投稿日: 2025年1月24日作成者: jarxiv

要約学習済み画像圧縮 (LIC) は、従来のコーデックと比較して優れたレート歪 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Solving the long-tailed distribution problem by exploiting the synergies and balance of different techniques

投稿日: 2025年1月24日作成者: jarxiv

要約現実世界のデータでは、ロングテールのデータ分布が一般的であるため、経験に基 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Invariance Principle Meets Vicinal Risk Minimization

投稿日: 2025年1月24日作成者: jarxiv

要約深層学習モデルはコンピュータービジョンタスクでは優れていますが、多くの … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model

投稿日: 2025年1月24日作成者: jarxiv

要約ディープニューラルネットワーク (DNN) は、さまざまな画像セグメン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Training-Free Zero-Shot Temporal Action Detection with Vision-Language Models

投稿日: 2025年1月24日作成者: jarxiv

要約既存のゼロショット時間的アクション検出（ZSTAD）メソッドは、目に見えな … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments

投稿日: 2025年1月24日作成者: jarxiv

要約理想的な条件下での単眼深度の推定を改善するためにかなりの努力が払われていま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting

投稿日: 2025年1月24日作成者: jarxiv

要約この論文では、3D ガウススプラッティング (3DGS) を使用した陰的 … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

EgoHand: Ego-centric Hand Pose Estimation and Gesture Recognition with Head-mounted Millimeter-wave Radar and IMUs

投稿日: 2025年1月24日作成者: jarxiv

要約 Apple Vision Pro などの最近の高度な仮想現実 (VR) ヘ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind

投稿日: 2025年1月24日作成者: jarxiv

要約複雑な現実世界のシナリオで人々の社会的相互作用を理解することは、しばしば複 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Is Large-Scale Pretraining the Secret to Good Domain Generalization?

On Disentangled Training for Nonlinear Transform in Learned Image Compression

Solving the long-tailed distribution problem by exploiting the synergies and balance of different techniques

Invariance Principle Meets Vicinal Risk Minimization

How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model

Training-Free Zero-Shot Temporal Action Detection with Vision-Language Models

PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments

3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting

EgoHand: Ego-centric Hand Pose Estimation and Gesture Recognition with Head-mounted Millimeter-wave Radar and IMUs

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind

最近の投稿

最近のコメント

アーカイブ

カテゴリー