投稿者「jarxiv」のアーカイブ

Deep Learning Based Segmentation of Blood Vessels from H&E Stained Oesophageal Adenocarcinoma Whole-Slide Images

投稿日: 2025年1月22日作成者: jarxiv

要約血管（BV）は腫瘍微小環境（TME）において重要な役割を果たしており、がん … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Let There Be Light: Robust Lensless Imaging Under External Illumination With Deep Learning

投稿日: 2025年1月22日作成者: jarxiv

要約レンズレスカメラは、画像形成をアナログ光学からデジタル後処理に移行すること … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

投稿日: 2025年1月22日作成者: jarxiv

要約このペーパーでは、スクリーンショットを入力としてのみ認識し、人間のような対 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.HC | コメントを受け付けていません

VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model

投稿日: 2025年1月22日作成者: jarxiv

要約単一の自己回帰フレームワーク内で視覚的な理解と生成を統合する、新しいマルチ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops

投稿日: 2025年1月22日作成者: jarxiv

要約深層学習 (DL) モデルを使用した前立腺がん (PCa) 検出は、生検中 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV, q-bio.TO | コメントを受け付けていません

Vision-Language Models for Automated Chest X-ray Interpretation: Leveraging ViT and GPT-2

投稿日: 2025年1月22日作成者: jarxiv

要約放射線医学は、その非侵襲的な診断能力により、現代医学において極めて重要な役 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

投稿日: 2025年1月22日作成者: jarxiv

要約 Large Vision Language Model (LVLM) は視 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

DARB-Splatting: Generalizing Splatting with Decaying Anisotropic Radial Basis Functions

投稿日: 2025年1月22日作成者: jarxiv

要約スプラッティングベースの 3D 再構成手法は、3D ガウススプラッティ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

投稿日: 2025年1月22日作成者: jarxiv

要約 Depth Anything は、強力な一般化能力により、単眼の深度推定に … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

投稿日: 2025年1月22日作成者: jarxiv

要約ビデオ理解における基礎モデルを評価するための、専門家レベルの包括的な複数分 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Deep Learning Based Segmentation of Blood Vessels from H&E Stained Oesophageal Adenocarcinoma Whole-Slide Images

Let There Be Light: Robust Lensless Imaging Under External Illumination With Deep Learning

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model

Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops

Vision-Language Models for Automated Chest X-ray Interpretation: Leveraging ViT and GPT-2

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

DARB-Splatting: Generalizing Splatting with Decaying Anisotropic Radial Basis Functions

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

最近の投稿

最近のコメント

アーカイブ

カテゴリー