投稿者「jarxiv」のアーカイブ

Deep Learning for Retinal Degeneration Assessment: A Comprehensive Analysis of the MARIO AMD Progression Challenge

投稿日: 2025年6月4日作成者: jarxiv

要約 MICCAI 2024で開催されたMARIOチャレンジは、光干渉断層計（O … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters

投稿日: 2025年6月4日作成者: jarxiv

要約近年、音声駆動型ヒューマン・アニメーションが大きく進歩している。しかし、( … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Astrophotography turbulence mitigation via generative models

投稿日: 2025年6月4日作成者: jarxiv

要約写真撮影は、現代の天文学および宇宙研究の要である。しかし、地上の望遠鏡で撮 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Learning on Model Weights using Tree Experts

投稿日: 2025年6月4日作成者: jarxiv

要約公開されているモデルの数は急速に増えているが、そのほとんどは文書化されてい … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples

投稿日: 2025年6月4日作成者: jarxiv

要約我々はPartComposerを発表する：テキストから画像への拡散モデルが … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models

投稿日: 2025年6月4日作成者: jarxiv

要約生成モデルの急速な進歩に伴い、AIが生成する画像のリアリズムは著しく向上し … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Smartflow: Enabling Scalable Spatiotemporal Geospatial Research

投稿日: 2025年6月4日作成者: jarxiv

要約 BlackSkyは、オープンソースのツールやテクノロジーをベースに構築され … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

We Should Chart an Atlas of All the World’s Models

投稿日: 2025年6月4日作成者: jarxiv

要約公開モデルリポジトリには、現在数百万ものモデルが含まれていますが、ほとんど … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Adversarial Robustness of AI-Generated Image Detectors in the Real World

投稿日: 2025年6月4日作成者: jarxiv

要約ジェネレーティブ・アーティフィシャル・インテリジェンス（GenAI）機能の … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

投稿日: 2025年6月4日作成者: jarxiv

要約拡散変換(DiT)はビデオ生成において画期的な進歩を遂げたが、この長いシー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Deep Learning for Retinal Degeneration Assessment: A Comprehensive Analysis of the MARIO AMD Progression Challenge

HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters

Astrophotography turbulence mitigation via generative models

Learning on Model Weights using Tree Experts

PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples

DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models

Smartflow: Enabling Scalable Spatiotemporal Geospatial Research

We Should Chart an Atlas of All the World’s Models

Adversarial Robustness of AI-Generated Image Detectors in the Real World

Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

最近の投稿

最近のコメント

アーカイブ

カテゴリー