「cs.CV」カテゴリーアーカイブ

Deep neural network-based detection of counterfeit products from smartphone images

投稿日: 2024年11月7日作成者: jarxiv

要約医薬品やワクチンなどの偽造品や、ファッション性の高いハンドバッグ、時計、宝 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models

投稿日: 2024年11月7日作成者: jarxiv

要約事前学習済み拡散モデル (DM) は、最近、逆問題 (IP) を解く際に広 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

LDTrack: Dynamic People Tracking by Service Robots using Diffusion Models

投稿日: 2024年11月7日作成者: jarxiv

要約雑然とした混雑した人間中心の環境でダイナミックな人々を追跡することは、オク … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Multi-branch Spatio-Temporal Graph Neural Network For Efficient Ice Layer Thickness Prediction

投稿日: 2024年11月7日作成者: jarxiv

要約極地の氷層の時空間パターンを理解することは、氷床のバランスの変化を追跡し、 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

bit2bit: 1-bit quanta video reconstruction via self-supervised photon prediction

投稿日: 2024年11月7日作成者: jarxiv

要約 SPAD アレイなどの Quanta イメージセンサーは、数ナノ秒という … 続きを読む →

カテゴリー: 68T45, cs.CV, cs.LG, eess.IV, I.2.10 | コメントを受け付けていません

Pseudo-labeling with Keyword Refining for Few-Supervised Video Captioning

投稿日: 2024年11月7日作成者: jarxiv

要約ビデオのキャプションは、ビデオの内容を説明する文章を生成します。既存の方 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

H-POPE: Hierarchical Polling-based Probing Evaluation of Hallucinations in Large Vision-Language Models

投稿日: 2024年11月7日作成者: jarxiv

要約ラージビジョン言語モデル (LVLM) は、テキストと画像の両方を活用す … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation

投稿日: 2024年11月7日作成者: jarxiv

要約テキストからモーションの生成は、コンピュータービジョンにおいて重要なタス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Gaussian Deja-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities

投稿日: 2024年11月7日作成者: jarxiv

要約 3D ガウススプラッティング (3DGS) の最近の進歩により、3D 頭 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models

投稿日: 2024年11月7日作成者: jarxiv

要約微調整された視覚言語モデル (VLM) は、画像の特徴とテキスト属性の間の … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Deep neural network-based detection of counterfeit products from smartphone images

DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models

LDTrack: Dynamic People Tracking by Service Robots using Diffusion Models

Multi-branch Spatio-Temporal Graph Neural Network For Efficient Ice Layer Thickness Prediction

bit2bit: 1-bit quanta video reconstruction via self-supervised photon prediction

Pseudo-labeling with Keyword Refining for Few-Supervised Video Captioning

H-POPE: Hierarchical Polling-based Probing Evaluation of Hallucinations in Large Vision-Language Models

Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation

Gaussian Deja-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities

RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー