投稿者「jarxiv」のアーカイブ

VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models

投稿日: 2025年6月17日作成者: jarxiv

要約ビデオインペインティング拡散トランスモデルを使用して、部分微分方程式（PD … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

UltraZoom: Generating Gigapixel Images from Regular Photos

投稿日: 2025年6月17日作成者: jarxiv

要約ハンドヘルドの電話写真など、さりげなくキャプチャされた入力からオブジェクト … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning

投稿日: 2025年6月17日作成者: jarxiv

要約 Vision-Language-action（VLA）モデルの最近の進歩は … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Touch begins where vision ends: Generalizable policies for contact-rich manipulation

投稿日: 2025年6月17日作成者: jarxiv

要約データ駆動型のアプローチは、正確な操作と闘っています。模倣学習には、多く … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value

投稿日: 2025年6月17日作成者: jarxiv

要約拡散モデルは、生成モデリングで顕著な成功を収めています。より安定したトレ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, stat.ML | コメントを受け付けていません

PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images

投稿日: 2025年6月17日作成者: jarxiv

要約カメラや人間のポーズ情報のない明確な被験者のカジュアルにキャプチャされた画 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

How Much is Enough? The Diminishing Returns of Tokenization Training Data

投稿日: 2025年6月17日作成者: jarxiv

要約自然言語処理における重要な初期ステップであるトークン化は、トークン化アルゴ … 続きを読む →

カテゴリー: cs.CE, cs.CL | コメントを受け付けていません

Improving Surgical Risk Prediction Through Integrating Automated Body Composition Analysis: a Retrospective Trial on Colectomy Surgery

投稿日: 2025年6月17日作成者: jarxiv

要約目的：CTスキャンから術前の体組成メトリックが自動的に抽出されたかどうかを … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Towards a Cascaded LLM Framework for Cost-effective Human-AI Decision-Making

投稿日: 2025年6月17日作成者: jarxiv

要約効果的な人間と意思決定のバランスは、3つの重要な要素をバランスさせます。\ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Foundation Models in Medical Imaging — A Review and Outlook

投稿日: 2025年6月17日作成者: jarxiv

要約ファンデーションモデル（FMS）は、非標識データの大規模なコレクションから … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models

UltraZoom: Generating Gigapixel Images from Regular Photos

AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning

Touch begins where vision ends: Generalizable policies for contact-rich manipulation

Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value

PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images

How Much is Enough? The Diminishing Returns of Tokenization Training Data

Improving Surgical Risk Prediction Through Integrating Automated Body Composition Analysis: a Retrospective Trial on Colectomy Surgery

Towards a Cascaded LLM Framework for Cost-effective Human-AI Decision-Making

Foundation Models in Medical Imaging — A Review and Outlook

最近の投稿

最近のコメント

アーカイブ

カテゴリー