投稿者「jarxiv」のアーカイブ

Comparative Analysis of Machine Learning Models for Lung Cancer Mutation Detection and Staging Using 3D CT Scans

投稿日: 2025年5月29日作成者: jarxiv

要約肺がんは世界中の癌死亡率の主な原因であり、重要な突然変異と病期分類を検出す … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning

投稿日: 2025年5月29日作成者: jarxiv

要約画像セグメンテーションのためのマルチモーダル大規模モデルを活用することは、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

An Effective Training Framework for Light-Weight Automatic Speech Recognition Models

投稿日: 2025年5月29日作成者: jarxiv

要約深い学習における最近の進歩により、計算およびメモリの制約を無視しながら有望 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Adversarially Robust AI-Generated Image Detection for Free: An Information Theoretic Perspective

投稿日: 2025年5月29日作成者: jarxiv

要約人工知能生成画像（AIGI）の急速な進歩により、偽造や誤った情報などの悪意 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Chest Disease Detection In X-Ray Images Using Deep Learning Classification Method

投稿日: 2025年5月29日作成者: jarxiv

要約この作業では、複数の分類モデルのパフォーマンスを調査して、胸部X線画像をC … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency

投稿日: 2025年5月29日作成者: jarxiv

要約テキスト間拡散モデルの採用は、信頼性に対する懸念を引き起こし、キャリブレー … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction

投稿日: 2025年5月29日作成者: jarxiv

要約画像の復帰は、さまざまなマルチモーダルタスクの品質が向上したトレーニングデ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

PS4PRO: Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization

投稿日: 2025年5月29日作成者: jarxiv

要約ニューラルレンダリング方法は、2D画像から3Dシーンを再構築する能力に大き … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Chain-of-Talkers (CoTalk): Fast Human Annotation of Dense Image Captions

投稿日: 2025年5月29日作成者: jarxiv

要約密に注釈付きの画像キャプションは、堅牢な視覚系のアラインメントの学習を大幅 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Spatial Knowledge Graph-Guided Multimodal Synthesis

投稿日: 2025年5月29日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLM）の最近の進歩により、能力が大幅に向 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Comparative Analysis of Machine Learning Models for Lung Cancer Mutation Detection and Staging Using 3D CT Scans

SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning

An Effective Training Framework for Light-Weight Automatic Speech Recognition Models

Adversarially Robust AI-Generated Image Detection for Free: An Information Theoretic Perspective

Chest Disease Detection In X-Ray Images Using Deep Learning Classification Method

Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction

PS4PRO: Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization

Chain-of-Talkers (CoTalk): Fast Human Annotation of Dense Image Captions

Spatial Knowledge Graph-Guided Multimodal Synthesis

最近の投稿

最近のコメント

アーカイブ

カテゴリー