「cs.CV」カテゴリーアーカイブ

Imperceptible Protection against Style Imitation from Diffusion Models

投稿日: 2024年8月29日作成者: jarxiv

要約拡散モデルの最近の進歩により、画像生成の忠実度は大幅に向上しましたが、著作 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

What is YOLOv8: An In-Depth Exploration of the Internal Features of the Next-Generation Object Detector

投稿日: 2024年8月29日作成者: jarxiv

要約この調査では、YOLOv8 オブジェクト検出モデルの詳細な分析を示し、その … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Provable Probabilistic Imaging using Score-Based Generative Priors

投稿日: 2024年8月29日作成者: jarxiv

要約高品質の画像を推定しながらその不確実性を定量化することは、不正設定逆問 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

microYOLO: Towards Single-Shot Object Detection on Microcontrollers

投稿日: 2024年8月29日作成者: jarxiv

要約この進行中の論文では、YOLO を使用したマイクロコントローラーでのシング … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model

投稿日: 2024年8月29日作成者: jarxiv

要約自動運転トレーニングには、さまざまな交通状況、気象シナリオ、道路の種類を含 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

投稿日: 2024年8月29日作成者: jarxiv

要約このペーパーでは、オーディオおよび言語参照ビデオオブジェクトセグメンテ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

投稿日: 2024年8月29日作成者: jarxiv

要約大規模な MLLM (l-MLLM) から知識を抽出することで、小規模なマ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors

投稿日: 2024年8月29日作成者: jarxiv

要約 3D 臨床医療画像の正確なセグメンテーションは、脊椎疾患の診断と治療におい … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data

投稿日: 2024年8月29日作成者: jarxiv

要約複数の部位とスキャナーからの神経画像データセットを組み合わせると、統計的検 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones

投稿日: 2024年8月29日作成者: jarxiv

要約 Gen-Swarms は、ディープ生成モデルの機能を活用し、リアクティブ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Imperceptible Protection against Style Imitation from Diffusion Models

What is YOLOv8: An In-Depth Exploration of the Internal Features of the Next-Generation Object Detector

Provable Probabilistic Imaging using Score-Based Generative Priors

microYOLO: Towards Single-Shot Object Detection on Microcontrollers

GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors

Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data

Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones

最近の投稿

最近のコメント

アーカイブ

カテゴリー