月別アーカイブ: 2024年7月

OmniGS: Fast Radiance Field Reconstruction using Omnidirectional Gaussian Splatting

投稿日: 2024年7月16日作成者: jarxiv

要約 3D ガウススプラッティングに依存したフォトリアリスティックな再構成は、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Human-in-the-Loop Visual Re-ID for Population Size Estimation

投稿日: 2024年7月16日作成者: jarxiv

要約コンピュータービジョンベースの再識別 (Re-ID) システムは、大規 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DataDream: Few-shot Guided Dataset Generation

投稿日: 2024年7月16日作成者: jarxiv

要約テキストから画像への拡散モデルは、画像合成において最先端の結果を達成するこ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

GOEmbed: Gradient Origin Embeddings for Representation Agnostic 3D Feature Learning

投稿日: 2024年7月16日作成者: jarxiv

要約オブジェクトの 2D ビューからの情報を 3D 表現にエンコードすることは … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

投稿日: 2024年7月16日作成者: jarxiv

要約最近の研究では、複数の品質報酬を伴う強化学習 (RL) を使用すると、テキ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition

投稿日: 2024年7月16日作成者: jarxiv

要約深層学習ベースの物体認識システムは、さまざまな敵対的な摂動によって簡単にだ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Benchmarking Vision Language Models for Cultural Understanding

投稿日: 2024年7月16日作成者: jarxiv

要約基礎モデルと視覚言語の事前トレーニングには、特に高度な視覚言語モデル (V … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

A Dual-Attention Aware Deep Convolutional Neural Network for Early Alzheimer’s Detection

投稿日: 2024年7月16日作成者: jarxiv

要約アルツハイマー病 (AD) は神経変性の主要な形態であり、毎年数百万人が罹 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV, F.2.2, I.2.7 | コメントを受け付けていません

OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting

投稿日: 2024年7月16日作成者: jarxiv

要約この論文では、単一のカメラまたは携帯電話で撮影できる従来の狭視野 (NFo … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

In-Loop Filtering via Trained Look-Up Tables

投稿日: 2024年7月16日作成者: jarxiv

要約インループフィルタリング (ILF) は、画像/ビデオコーディング規格 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

月別アーカイブ: 2024年7月

OmniGS: Fast Radiance Field Reconstruction using Omnidirectional Gaussian Splatting

Human-in-the-Loop Visual Re-ID for Population Size Estimation

DataDream: Few-shot Guided Dataset Generation

GOEmbed: Gradient Origin Embeddings for Representation Agnostic 3D Feature Learning

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition

Benchmarking Vision Language Models for Cultural Understanding

A Dual-Attention Aware Deep Convolutional Neural Network for Early Alzheimer’s Detection

OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting

In-Loop Filtering via Trained Look-Up Tables

最近の投稿

最近のコメント

アーカイブ

カテゴリー