「cs.CV」カテゴリーアーカイブ

What is YOLOv6? A Deep Insight into the Object Detection Model

投稿日: 2024年12月18日作成者: jarxiv

要約この作業では、YOLOv6 オブジェクト検出モデルを詳しく調査し、その設計 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Can Generative Models Improve Self-Supervised Representation Learning?

投稿日: 2024年12月18日作成者: jarxiv

要約自己教師あり表現学習の急速な進歩により、ラベルのないデータを活用してリッチ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Measurement of Medial Elbow Joint Space using Landmark Detection

投稿日: 2024年12月18日作成者: jarxiv

要約肘内側の超音波画像診断は、尺骨側副靱帯 (UCL) 損傷を早期に特定するた … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A New Adversarial Perspective for LiDAR-based 3D Object Detection

投稿日: 2024年12月18日作成者: jarxiv

要約自動運転車 (AV) は、運転シナリオにおける環境認識と意思決定のために … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation

投稿日: 2024年12月18日作成者: jarxiv

要約我々は、2 つの人気のあるデータセット (R2R と RxR) の上に構築 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data Perspective

投稿日: 2024年12月18日作成者: jarxiv

要約人工知能 (AI) の最近の進歩、特に基礎モデル (FM) の自己教師あり … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

EOGS: Gaussian Splatting for Earth Observation

投稿日: 2024年12月18日作成者: jarxiv

要約最近、ガウススプラッティングが NeRF の強力な代替手段として登場し、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers

投稿日: 2024年12月18日作成者: jarxiv

要約スケーラブルベクターグラフィックス (SVG) は、解像度の独立性とス … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

Modality-Inconsistent Continual Learning of Multimodal Large Language Models

投稿日: 2024年12月18日作成者: jarxiv

要約このペーパーでは、一貫性のないモダリティ (画像、音声、またはビデオ) と … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

FunEditor: Achieving Complex Image Edits via Function Aggregation with Diffusion Models

投稿日: 2024年12月18日作成者: jarxiv

要約拡散モデルは生成タスクにおいて優れたパフォーマンスを実証しており、画像編集 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

What is YOLOv6? A Deep Insight into the Object Detection Model

Can Generative Models Improve Self-Supervised Representation Learning?

Measurement of Medial Elbow Joint Space using Landmark Detection

A New Adversarial Perspective for LiDAR-based 3D Object Detection

NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation

Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data Perspective

EOGS: Gaussian Splatting for Earth Observation

SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers

Modality-Inconsistent Continual Learning of Multimodal Large Language Models

FunEditor: Achieving Complex Image Edits via Function Aggregation with Diffusion Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー