「cs.AI」カテゴリーアーカイブ

Robust Change Captioning in Remote Sensing: SECOND-CC Dataset and MModalCC Framework

投稿日: 2025年1月20日作成者: jarxiv

要約リモートセンシング変化キャプション (RSICC) は、両時間画像間の変 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

VLSBench: Unveiling Visual Leakage in Multimodal Safety

投稿日: 2025年1月20日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) の安全性に関する懸念は、さま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.CV | コメントを受け付けていません

landmarker: a Toolkit for Anatomical Landmark Localization in 2D/3D Images

投稿日: 2025年1月20日作成者: jarxiv

要約 2D/3D 画像における解剖学的ランドマークの位置特定は、医療画像処理にお … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Universal Actions for Enhanced Embodied Foundation Models

投稿日: 2025年1月20日作成者: jarxiv

要約多様なインターネット規模のデータでのトレーニングは、最近の大規模な基盤モデ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding

投稿日: 2025年1月20日作成者: jarxiv

要約 Tarsier2 は、詳細かつ正確なビデオ説明を生成するために設計された最 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Spatio-temporal Graph Learning on Adaptive Mined Key Frames for High-performance Multi-Object Tracking

投稿日: 2025年1月20日作成者: jarxiv

要約マルチオブジェクト追跡の領域では、ビデオシーケンス内のオブジェクト間の空 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style

投稿日: 2025年1月20日作成者: jarxiv

要約電子商取引の製品背景を生成する最先端の方法は、制作をスケールアップする際に … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

投稿日: 2025年1月20日作成者: jarxiv

要約高解像度拡散モデルを加速するための新しいオートエンコーダーモデルファミ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Bridging Diversity and Uncertainty in Active learning with Self-Supervised Pre-Training

投稿日: 2025年1月20日作成者: jarxiv

要約この研究は、特に自己教師付き事前トレーニング済みモデルのコンテキスト内での … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results

投稿日: 2025年1月20日作成者: jarxiv

要約 2025 年海洋コンピュータビジョン (MaCVi) に関する第 3 回 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Robust Change Captioning in Remote Sensing: SECOND-CC Dataset and MModalCC Framework

VLSBench: Unveiling Visual Leakage in Multimodal Safety

landmarker: a Toolkit for Anatomical Landmark Localization in 2D/3D Images

Universal Actions for Enhanced Embodied Foundation Models

Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding

Spatio-temporal Graph Learning on Adaptive Mined Key Frames for High-performance Multi-Object Tracking

Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

Bridging Diversity and Uncertainty in Active learning with Self-Supervised Pre-Training

3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results

最近の投稿

最近のコメント

アーカイブ

カテゴリー