月別アーカイブ: 2024年5月

MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification

投稿日: 2024年5月30日作成者: jarxiv

要約 Large Vision Language Model (LVLM) は、 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG, I.4 | コメントを受け付けていません

LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model

投稿日: 2024年5月30日作成者: jarxiv

要約ビデオテキストスポッティングは、ビデオ内のテキストインスタンスのロー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Going beyond compositional generalization, DDPMs can produce zero-shot interpolation

投稿日: 2024年5月30日作成者: jarxiv

要約ノイズ除去拡散確率モデル (DDPM) は画像生成において顕著な機能を示し … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.NE | コメントを受け付けていません

$E^{3}$Gen: Efficient, Expressive and Editable Avatars Generation

投稿日: 2024年5月30日作成者: jarxiv

要約このペーパーは、効率的で表現力豊かで編集可能なデジタルアバターを生成する … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification

投稿日: 2024年5月30日作成者: jarxiv

要約過去 10 年間、コンピュータービジョンでは、さまざまなトレーニングと学 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos

投稿日: 2024年5月30日作成者: jarxiv

要約ビデオ言語を理解するタスクは短いビデオクリップに焦点を当てており、多くの … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

REBEL: Reinforcement Learning via Regressing Relative Rewards

投稿日: 2024年5月30日作成者: jarxiv

要約近接ポリシー最適化 (PPO) は、もともと連続制御問題のために開発されま … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Intelligent Anomaly Detection for Lane Rendering Using Transformer with Self-Supervised Pre-Training and Customized Fine-Tuning

投稿日: 2024年5月30日作成者: jarxiv

要約デジタル地図を使用したナビゲーションサービスの急増は、ドライバーに大きな … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV, stat.ML | コメントを受け付けていません

Towards Global Glacier Mapping with Deep Learning and Open Earth Observation Data

投稿日: 2024年5月30日作成者: jarxiv

要約地球規模の氷河の正確なマッピングは、気候変動の影響を理解するために不可欠で … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

A study on the adequacy of common IQA measures for medical images

投稿日: 2024年5月30日作成者: jarxiv

要約画質評価 (IQA) は、画像を操作する新しい機械学習アルゴリズムの開発段 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

月別アーカイブ: 2024年5月

MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification

LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model

Going beyond compositional generalization, DDPMs can produce zero-shot interpolation

$E^{3}$Gen: Efficient, Expressive and Editable Avatars Generation

Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification

VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos

REBEL: Reinforcement Learning via Regressing Relative Rewards

Intelligent Anomaly Detection for Lane Rendering Using Transformer with Self-Supervised Pre-Training and Customized Fine-Tuning

Towards Global Glacier Mapping with Deep Learning and Open Earth Observation Data

A study on the adequacy of common IQA measures for medical images

最近の投稿

最近のコメント

アーカイブ

カテゴリー