月別アーカイブ: 2025年3月

RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding

投稿日: 2025年3月12日作成者: jarxiv

要約ビデオ理解が可能なマルチモーダルの大手言語モデル（MLLMS）は急速に進ん … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Comparing Satellite Data for Next-Day Wildfire Predictability

投稿日: 2025年3月12日作成者: jarxiv

要約複数の研究が衛星画像を使用して翌日の火災予測を実施しています。 2つの主要 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification

投稿日: 2025年3月12日作成者: jarxiv

要約バッグベースの複数インスタンス学習（MIL）アプローチは、スライド画像全体 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding

投稿日: 2025年3月12日作成者: jarxiv

要約マルチモーダルの大手言語モデル（MLLM）の進歩にもかかわらず、現在のアプ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Integration of nested cross-validation, automated hyperparameter optimization, high-performance computing to reduce and quantify the variance of test performance estimation of deep learning models

投稿日: 2025年3月12日作成者: jarxiv

要約医療イメージングのためのディープラーニングモデルの実際のパフォーマンスベン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3D Point Cloud Generation via Autoregressive Up-sampling

投稿日: 2025年3月12日作成者: jarxiv

要約 3Dポイントクラウド生成向けの先駆的なオートレーフレフな生成モデルを紹介し … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

X-Field: A Physically Grounded Representation for 3D X-ray Reconstruction

投稿日: 2025年3月12日作成者: jarxiv

要約 X線イメージングは、医療診断において不可欠ですが、その使用は潜在的な健 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LiSu: A Dataset and Method for LiDAR Surface Normal Estimation

投稿日: 2025年3月12日作成者: jarxiv

要約表面正数は3Dシーンのジオメトリを分析するために広く使用されていますが、L … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding

投稿日: 2025年3月12日作成者: jarxiv

要約ビデオ大規模な言語モデル（Videollms）は、ビデオ理解において顕著な … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM | コメントを受け付けていません

CellStyle: Improved Zero-Shot Cell Segmentation via Style Transfer

投稿日: 2025年3月12日作成者: jarxiv

要約細胞顕微鏡データは豊富です。ただし、対応するセグメンテーション注釈は希少 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年3月

RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding

Comparing Satellite Data for Next-Day Wildfire Predictability

MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification

HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding

Integration of nested cross-validation, automated hyperparameter optimization, high-performance computing to reduce and quantify the variance of test performance estimation of deep learning models

3D Point Cloud Generation via Autoregressive Up-sampling

X-Field: A Physically Grounded Representation for 3D X-ray Reconstruction

LiSu: A Dataset and Method for LiDAR Surface Normal Estimation

ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding

CellStyle: Improved Zero-Shot Cell Segmentation via Style Transfer

最近の投稿

最近のコメント

アーカイブ

カテゴリー