「cs.CV」カテゴリーアーカイブ

MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification

投稿日: 2025年3月12日作成者: jarxiv

要約バッグベースの複数インスタンス学習（MIL）アプローチは、スライド画像全体 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding

投稿日: 2025年3月12日作成者: jarxiv

要約マルチモーダルの大手言語モデル（MLLM）の進歩にもかかわらず、現在のアプ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Integration of nested cross-validation, automated hyperparameter optimization, high-performance computing to reduce and quantify the variance of test performance estimation of deep learning models

投稿日: 2025年3月12日作成者: jarxiv

要約医療イメージングのためのディープラーニングモデルの実際のパフォーマンスベン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3D Point Cloud Generation via Autoregressive Up-sampling

投稿日: 2025年3月12日作成者: jarxiv

要約 3Dポイントクラウド生成向けの先駆的なオートレーフレフな生成モデルを紹介し … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

X-Field: A Physically Grounded Representation for 3D X-ray Reconstruction

投稿日: 2025年3月12日作成者: jarxiv

要約 X線イメージングは、医療診断において不可欠ですが、その使用は潜在的な健 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LiSu: A Dataset and Method for LiDAR Surface Normal Estimation

投稿日: 2025年3月12日作成者: jarxiv

要約表面正数は3Dシーンのジオメトリを分析するために広く使用されていますが、L … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding

投稿日: 2025年3月12日作成者: jarxiv

要約ビデオ大規模な言語モデル（Videollms）は、ビデオ理解において顕著な … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM | コメントを受け付けていません

CellStyle: Improved Zero-Shot Cell Segmentation via Style Transfer

投稿日: 2025年3月12日作成者: jarxiv

要約細胞顕微鏡データは豊富です。ただし、対応するセグメンテーション注釈は希少 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling

投稿日: 2025年3月12日作成者: jarxiv

要約テキストからビデオへの拡散モデルの最近の進歩により、単一のプロンプトから高 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Curriculum Direct Preference Optimization for Diffusion and Consistency Models

投稿日: 2025年3月12日作成者: jarxiv

要約直接選好最適化（DPO）は、人間のフィードバック（RLHF）からの強化学習 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification

HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding

Integration of nested cross-validation, automated hyperparameter optimization, high-performance computing to reduce and quantify the variance of test performance estimation of deep learning models

3D Point Cloud Generation via Autoregressive Up-sampling

X-Field: A Physically Grounded Representation for 3D X-ray Reconstruction

LiSu: A Dataset and Method for LiDAR Surface Normal Estimation

ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding

CellStyle: Improved Zero-Shot Cell Segmentation via Style Transfer

Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling

Curriculum Direct Preference Optimization for Diffusion and Consistency Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー