「cs.CV」カテゴリーアーカイブ

A Sociotechnical Lens for Evaluating Computer Vision Models: A Case Study on Detecting and Reasoning about Gender and Emotion

投稿日: 2024年11月22日作成者: jarxiv

要約コンピュータービジョン (CV) テクノロジーの進化の状況において、画像 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.CY, cs.HC | コメントを受け付けていません

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation

投稿日: 2024年11月22日作成者: jarxiv

要約既存のフィードフォワード画像から 3D への手法は、主に 2D マルチビュ … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Enhancing Diagnostic Precision in Gastric Bleeding through Automated Lesion Segmentation: A Deep DuS-KFCM Approach

投稿日: 2024年11月22日作成者: jarxiv

要約内視鏡画像における胃出血のタイムリーかつ正確な分類とセグメント化は、胃合併 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding

投稿日: 2024年11月22日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) の最近の進歩により、ビデオ理 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Multimodal Autoregressive Pre-training of Large Vision Encoders

投稿日: 2024年11月22日作成者: jarxiv

要約大規模ビジョンエンコーダの事前トレーニングのための新しい方法を紹介します。 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Adversarial Poisoning Attack on Quantum Machine Learning Models

投稿日: 2024年11月22日作成者: jarxiv

要約量子機械学習 (QML) への関心が高まり、クラウドプロバイダーを通じて … 続きを読む →

カテゴリー: cs.CR, cs.CV, quant-ph | コメントを受け付けていません

Multimodal 3D Brain Tumor Segmentation with Adversarial Training and Conditional Random Field

投稿日: 2024年11月22日作成者: jarxiv

要約神経膠腫の構造の複雑さと大きな個体差により、脳腫瘍を正確にセグメンテーショ … 続きを読む →

カテゴリー: 15-11, cs.CV, eess.IV, I.4.6 | コメントを受け付けていません

Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model

投稿日: 2024年11月22日作成者: jarxiv

要約マルチモーダル言語モデル (MLLM) は現実世界の環境でますます適用され … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation

投稿日: 2024年11月22日作成者: jarxiv

要約動的シーンのリアルなシミュレーションには、さまざまなマテリアル特性を正確に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Revisiting the Integration of Convolution and Attention for Vision Backbone

投稿日: 2024年11月22日作成者: jarxiv

要約コンボリューション (Convs) とマルチヘッドセルフアテンション … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

A Sociotechnical Lens for Evaluating Computer Vision Models: A Case Study on Detecting and Reasoning about Gender and Emotion

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation

Enhancing Diagnostic Precision in Gastric Bleeding through Automated Lesion Segmentation: A Deep DuS-KFCM Approach

Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding

Multimodal Autoregressive Pre-training of Large Vision Encoders

Adversarial Poisoning Attack on Quantum Machine Learning Models

Multimodal 3D Brain Tumor Segmentation with Adversarial Training and Conditional Random Field

Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model

Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation

Revisiting the Integration of Convolution and Attention for Vision Backbone

最近の投稿

最近のコメント

アーカイブ

カテゴリー