月別アーカイブ: 2024年4月

UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark

投稿日: 2024年4月16日作成者: jarxiv

要約高価な専門家による評価に代わるものとして、画像美的評価 (IAA) はコン … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

TTK is Getting MPI-Ready

投稿日: 2024年4月16日作成者: jarxiv

要約このシステムペーパーは、メッセージパッシングインターフェイス (MP … 続きを読む →

カテゴリー: cs.CG, cs.CV, cs.DC, cs.LG, cs.MS | コメントを受け付けていません

Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball

投稿日: 2024年4月16日作成者: jarxiv

要約階層は、画像セグメンテーションで日常的に使用されるものを含む、意味分類の自 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception

投稿日: 2024年4月16日作成者: jarxiv

要約画像美的知覚 (IAP) の高度に抽象的な性質は、現在のマルチモーダル大規 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Bridging Vision and Language Spaces with Assignment Prediction

投稿日: 2024年4月16日作成者: jarxiv

要約この論文では、事前トレーニング済み視覚モデルと大規模言語モデル (LLM) … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation

投稿日: 2024年4月16日作成者: jarxiv

要約私たちは、視覚認識（セマンティックセグメンテーションなど）、低レベル画像処 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding

投稿日: 2024年4月16日作成者: jarxiv

要約ゼロショット 3D 点群の理解は、2D Vision-Language M … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap

投稿日: 2024年4月16日作成者: jarxiv

要約 Neural Radiance Fields (NeRF) は、自動運転 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI

投稿日: 2024年4月16日作成者: jarxiv

要約視覚刺激中に収集された fMRI データから被験者が観察した画像の再構成は … 続きを読む →

カテゴリー: cs.CV, cs.LG, q-bio.NC | コメントを受け付けていません

CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning

投稿日: 2024年4月16日作成者: jarxiv

要約ゼロショット学習 (ZSL) は、既知のカテゴリから未知のカテゴリへの意味 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年4月

UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark

TTK is Getting MPI-Ready

Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball

AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception

Bridging Vision and Language Spaces with Assignment Prediction

In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation

Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding

Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap

Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI

CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー