月別アーカイブ: 2025年5月

CROC: Evaluating and Training T2I Metrics with Pseudo- and Human-Labeled Contrastive Robustness Checks

投稿日: 2025年5月19日作成者: jarxiv

要約評価指標（メタ評価）の評価は、テキストからイメージ（T2I）の生成タスクに … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Resolving the Ambiguity of Complete-to-Partial Point Cloud Registration for Image-Guided Liver Surgery with Patches-to-Partial Matching

投稿日: 2025年5月19日作成者: jarxiv

要約画像誘導肝臓手術では、術前雲として表されることが多い術前と術中のデータの間 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models

投稿日: 2025年5月19日作成者: jarxiv

要約ビジョン言語モデル（VLM）は、画像キャプションやビデオ質問の回答などのオ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different Scenes

投稿日: 2025年5月19日作成者: jarxiv

要約基礎モデルとして、SAMはコンピュータービジョン内の複数のフィールドに大き … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MARRS: Masked Autoregressive Unit-based Reaction Synthesis

投稿日: 2025年5月19日作成者: jarxiv

要約この作業は、挑戦的なタスクを目的としています。つまり、人間のアクション反応 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Dynamic Base model Shift for Delta Compression

投稿日: 2025年5月19日作成者: jarxiv

要約プレイン式財政パラダイムを備えた変圧器ベースのモデルは、複数のタスク上の微 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

投稿日: 2025年5月19日作成者: jarxiv

要約検証可能な報酬（RLVR）による強化学習は最近、特に数学とプログラミングタ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation

投稿日: 2025年5月19日作成者: jarxiv

要約 Vision-and-Language Navigation（VLN）は、 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

reBEN: Refined BigEarthNet Dataset for Remote Sensing Image Analysis

投稿日: 2025年5月19日作成者: jarxiv

要約このペーパーでは、リモートセンシング画像分析のためのディープラーニング（D … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

MutualNeRF: Improve the Performance of NeRF under Limited Samples with Mutual Information Theory

投稿日: 2025年5月19日作成者: jarxiv

要約このペーパーでは、相互情報理論を使用した限られたサンプルの下で、ニューラル … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年5月

CROC: Evaluating and Training T2I Metrics with Pseudo- and Human-Labeled Contrastive Robustness Checks

Resolving the Ambiguity of Complete-to-Partial Point Cloud Registration for Image-Guided Liver Surgery with Patches-to-Partial Matching

Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models

Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different Scenes

MARRS: Masked Autoregressive Unit-based Reaction Synthesis

Dynamic Base model Shift for Delta Compression

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation

reBEN: Refined BigEarthNet Dataset for Remote Sensing Image Analysis

MutualNeRF: Improve the Performance of NeRF under Limited Samples with Mutual Information Theory

最近の投稿

最近のコメント

アーカイブ

カテゴリー