投稿者「jarxiv」のアーカイブ

Explainable AI-Enhanced Deep Learning for Pumpkin Leaf Disease Detection: A Comparative Analysis of CNN Architectures

投稿日: 2025年4月11日作成者: jarxiv

要約カボチャの葉の病気は、農業の生産性に対する重大な脅威であり、効果的な管理の … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Zero-Shot Low-dose CT Denoising via Sinogram Flicking

投稿日: 2025年4月11日作成者: jarxiv

要約多くの低用量のCTイメージング方法は、監視された学習に依存しており、これに … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

投稿日: 2025年4月11日作成者: jarxiv

要約この論文では、知識の蒸留なしで純粋に自己改善に依存して、トレーニングサンプ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos

投稿日: 2025年4月11日作成者: jarxiv

要約 360 {\ deg}ビデオは、ダイナミックな視覚世界を表す有望な媒体とし … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MARS: a Multimodal Alignment and Ranking System for Few-Shot Segmentation

投稿日: 2025年4月11日作成者: jarxiv

要約現在の少数のショットセグメンテーションの文献には、クエリとサンプル画像の視 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HoloPart: Generative 3D Part Amodal Segmentation

投稿日: 2025年4月11日作成者: jarxiv

要約 3D部品のアモーダルセグメンテーション – 3D形状を完全で意 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces

投稿日: 2025年4月11日作成者: jarxiv

要約漫画のアバターは、ソーシャルメディア、オンラインチューター、ゲームなど、さ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Taming Data and Transformers for Scalable Audio Generation

投稿日: 2025年4月11日作成者: jarxiv

要約アンビエントサウンドジェネレーターのスケーラビリティは、データ不足、キャプ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians

投稿日: 2025年4月11日作成者: jarxiv

要約デジタルアバターのコミュニティからの関心が高まっているため、コミュニケーシ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

投稿日: 2025年4月11日作成者: jarxiv

要約マルチモーダル信号を通じて世界を効果的に知覚できる汎用モデルの構築は、長年 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Explainable AI-Enhanced Deep Learning for Pumpkin Leaf Disease Detection: A Comparative Analysis of CNN Architectures

Zero-Shot Low-dose CT Denoising via Sinogram Flicking

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos

MARS: a Multimodal Alignment and Ranking System for Few-Shot Segmentation

HoloPart: Generative 3D Part Amodal Segmentation

GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces

Taming Data and Transformers for Scalable Audio Generation

InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー