投稿者「jarxiv」のアーカイブ

Automated Measurement of Eczema Severity with Self-Supervised Learning

投稿日: 2025年4月22日作成者: jarxiv

要約デジタルカメラから取得した画像を使用した湿疹の自動診断により、個人は回復を … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Zero-Shot, But at What Cost? Unveiling the Hidden Overhead of MILS’s LLM-CLIP Framework for Image Captioning

投稿日: 2025年4月22日作成者: jarxiv

要約 MILS（Multimodal Iterative LLM Solver） … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.PF | コメントを受け付けていません

DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation

投稿日: 2025年4月22日作成者: jarxiv

要約テキストからイメージ（T2I）拡散モデルの普及により、テキストの説明から高 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Shape-Guided Clothing Warping for Virtual Try-On

投稿日: 2025年4月22日作成者: jarxiv

要約画像ベースのVirtual Try-Onは、ポーズの一貫性を維持しながら、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SuoiAI: Building a Dataset for Aquatic Invertebrates in Vietnam

投稿日: 2025年4月22日作成者: jarxiv

要約生態学的健康と保全の取り組みにとって、水生生物多様性の理解と監視が重要です … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation

投稿日: 2025年4月22日作成者: jarxiv

要約デジタルモデリングと人間の顔の再構築は、さまざまなアプリケーションに役立ち … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Revealing the 3D Cosmic Web through Gravitationally Constrained Neural Fields

投稿日: 2025年4月22日作成者: jarxiv

要約弱い重力レンズは、主に宇宙の暗黒物質の重力効果によって引き起こされる銀河形 … 続きを読む →

カテゴリー: astro-ph.CO, cs.CV | コメントを受け付けていません

Diffusion Bridge Models for 3D Medical Image Translation

投稿日: 2025年4月22日作成者: jarxiv

要約拡散テンソルイメージング（DTI）は、人間の脳の微細構造に関する重要な洞察 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes

投稿日: 2025年4月22日作成者: jarxiv

要約大規模なマルチモーダルモデル（LMM）は、ビデオフレームを均一に知覚し、本 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

投稿日: 2025年4月22日作成者: jarxiv

要約長いコンテキストマルチモーダル学習のために、フロンティアビジョンモデル（V … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Automated Measurement of Eczema Severity with Self-Supervised Learning

Zero-Shot, But at What Cost? Unveiling the Hidden Overhead of MILS’s LLM-CLIP Framework for Image Captioning

DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation

Shape-Guided Clothing Warping for Virtual Try-On

SuoiAI: Building a Dataset for Aquatic Invertebrates in Vietnam

Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation

Revealing the 3D Cosmic Web through Gravitationally Constrained Neural Fields

Diffusion Bridge Models for 3D Medical Image Translation

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー