「cs.CV」カテゴリーアーカイブ

Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding

投稿日: 2025年3月17日作成者: jarxiv

要約自動手術ワークフロー分析は、教育、研究、臨床的意思決定には重要ですが、注釈 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Towards Sample-specific Backdoor Attack with Clean Labels via Attribute Trigger

投稿日: 2025年3月17日作成者: jarxiv

要約現在、サンプル固有のバックドア攻撃（SSBA）は、現在のバックドア防御のほ … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CV, cs.LG | コメントを受け付けていません

RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification

投稿日: 2025年3月17日作成者: jarxiv

要約拡散モデルは、さまざまな画像生成タスクで顕著な進歩を達成しています。ただ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Framework for a Capability-driven Evaluation of Scenario Understanding for Multimodal Large Language Models in Autonomous Driving

投稿日: 2025年3月17日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLM）は、ドメインに依存しない世界知識と … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Towards A Correct Usage of Cryptography in Semantic Watermarks for Diffusion Models

投稿日: 2025年3月17日作成者: jarxiv

要約セマンティックの透かしの方法により、初期潜在ノイズを変更するだけで、潜在拡 … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CV | コメントを受け付けていません

LuSeg: Efficient Negative and Positive Obstacles Segmentation via Contrast-Driven Multi-Modal Feature Fusion on the Lunar

投稿日: 2025年3月17日作成者: jarxiv

要約月の探査ミッションがますます複雑になるにつれて、安全で自律的なローバーベー … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

MTV-Inpaint: Multi-Task Long Video Inpainting

投稿日: 2025年3月17日作成者: jarxiv

要約ビデオの開始には、ビデオ内のローカル領域を変更し、空間的および時間的な一貫 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Category Prompt Mamba Network for Nuclei Segmentation and Classification

投稿日: 2025年3月17日作成者: jarxiv

要約核のセグメンテーションと分類は、腫瘍免疫微小環境分析に不可欠な基盤を提供し … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos

投稿日: 2025年3月17日作成者: jarxiv

要約長型のビデオ理解は、ビデオデータの冗長性が高いことと、クエリと関係のある情 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

AQUA-SLAM: Tightly-Coupled Underwater Acoustic-Visual-Inertial SLAM with Sensor Calibration

投稿日: 2025年3月17日作成者: jarxiv

要約水中環境は、視認性が限られていること、不十分な照明、および画像の構造的特徴 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding

Towards Sample-specific Backdoor Attack with Clean Labels via Attribute Trigger

RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification

A Framework for a Capability-driven Evaluation of Scenario Understanding for Multimodal Large Language Models in Autonomous Driving

Towards A Correct Usage of Cryptography in Semantic Watermarks for Diffusion Models

LuSeg: Efficient Negative and Positive Obstacles Segmentation via Contrast-Driven Multi-Modal Feature Fusion on the Lunar

MTV-Inpaint: Multi-Task Long Video Inpainting

Category Prompt Mamba Network for Nuclei Segmentation and Classification

VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos

AQUA-SLAM: Tightly-Coupled Underwater Acoustic-Visual-Inertial SLAM with Sensor Calibration

最近の投稿

最近のコメント

アーカイブ

カテゴリー