「cs.CV」カテゴリーアーカイブ

DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models

投稿日: 2025年6月4日作成者: jarxiv

要約生成モデルの急速な進歩に伴い、AIが生成する画像のリアリズムは著しく向上し … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Smartflow: Enabling Scalable Spatiotemporal Geospatial Research

投稿日: 2025年6月4日作成者: jarxiv

要約 BlackSkyは、オープンソースのツールやテクノロジーをベースに構築され … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

We Should Chart an Atlas of All the World’s Models

投稿日: 2025年6月4日作成者: jarxiv

要約公開モデルリポジトリには、現在数百万ものモデルが含まれていますが、ほとんど … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Adversarial Robustness of AI-Generated Image Detectors in the Real World

投稿日: 2025年6月4日作成者: jarxiv

要約ジェネレーティブ・アーティフィシャル・インテリジェンス（GenAI）機能の … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

投稿日: 2025年6月4日作成者: jarxiv

要約拡散変換(DiT)はビデオ生成において画期的な進歩を遂げたが、この長いシー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

投稿日: 2025年6月4日作成者: jarxiv

要約実世界の3Dシーンを包括的に理解するためには、任意の、あるいは以前に見たこ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Effective Dual-Region Augmentation for Reduced Reliance on Large Amounts of Labeled Data

投稿日: 2025年6月4日作成者: jarxiv

要約本論文では、大規模なラベル付きデータセットへの依存を低減する一方で、ソース … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models

投稿日: 2025年6月4日作成者: jarxiv

要約テキストから画像への生成モデル～（Stable Diffusionなど）は … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SASP: Strip-Aware Spatial Perception for Fine-Grained Bird Image Classification

投稿日: 2025年6月4日作成者: jarxiv

要約きめ細かな鳥類画像分類（FBIC）は、生態学的モニタリングや種の同定に大き … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM

投稿日: 2025年6月4日作成者: jarxiv

要約最新のガウス散布法は、3Dシーンのリアルタイムフォトリアリスティックレンダ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models

Smartflow: Enabling Scalable Spatiotemporal Geospatial Research

We Should Chart an Atlas of All the World’s Models

Adversarial Robustness of AI-Generated Image Detectors in the Real World

Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

Effective Dual-Region Augmentation for Reduced Reliance on Large Amounts of Labeled Data

EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models

SASP: Strip-Aware Spatial Perception for Fine-Grained Bird Image Classification

LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM

最近の投稿

最近のコメント

アーカイブ

カテゴリー