月別アーカイブ: 2023年5月

Long-tailed Visual Recognition via Gaussian Clouded Logit Adjustment

投稿日: 2023年5月22日作成者: jarxiv

要約ディープニューラルネットワークはバランスの取れたデータで大きな成功を収 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Survey of Automatic Plankton Image Recognition: Challenges, Existing Solutions and Future Perspectives

投稿日: 2023年5月22日作成者: jarxiv

要約浮遊生物は水生生態系の重要な構成要素であり、環境の変化に迅速に反応するため … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TaLU: A Hybrid Activation Function Combining Tanh and Rectified Linear Unit to Enhance Neural Networks

投稿日: 2023年5月22日作成者: jarxiv

要約分類における深層学習モデルの適用は、ターゲットオブジェクトの正確な検出に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Generating Visual Spatial Description via Holistic 3D Scene Understanding

投稿日: 2023年5月22日作成者: jarxiv

要約視覚的空間記述 (VSD) は、画像内の特定のオブジェクトの空間関係を説明 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner

投稿日: 2023年5月22日作成者: jarxiv

要約大規模な事前トレーニング済みマルチモーダルモデルは、画像キャプション、画 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MM | コメントを受け付けていません

Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes

投稿日: 2023年5月22日作成者: jarxiv

要約人間と動物は物理世界について豊かかつ柔軟な理解を持っており、それによって物 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO, q-bio.NC | コメントを受け付けていません

Dynamic Sparse Training with Structured Sparsity

投稿日: 2023年5月22日作成者: jarxiv

要約ダイナミックスパーストレーニング (DST) 手法は、スパースニュー … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

A One-Class Classifier for the Detection of GAN Manipulated Multi-Spectral Satellite Images

投稿日: 2023年5月22日作成者: jarxiv

要約現在の画像生成モデルによって実現される非常にリアルな画像品質は、多くの学術 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

A Comprehensive Survey on Segment Anything Model for Vision and Beyond

投稿日: 2023年5月22日作成者: jarxiv

要約人工知能 (AI) は、汎用人工知能に向けて進化しています。これは、幅広い … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MaGIC: Multi-modality Guided Image Completion

投稿日: 2023年5月22日作成者: jarxiv

要約バニライメージの補完アプローチは、妥当な生成に使用できる参照情報が限られ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2023年5月

Long-tailed Visual Recognition via Gaussian Clouded Logit Adjustment

Survey of Automatic Plankton Image Recognition: Challenges, Existing Solutions and Future Perspectives

TaLU: A Hybrid Activation Function Combining Tanh and Rectified Linear Unit to Enhance Neural Networks

Generating Visual Spatial Description via Holistic 3D Scene Understanding

Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner

Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes

Dynamic Sparse Training with Structured Sparsity

A One-Class Classifier for the Detection of GAN Manipulated Multi-Spectral Satellite Images

A Comprehensive Survey on Segment Anything Model for Vision and Beyond

MaGIC: Multi-modality Guided Image Completion

最近の投稿

最近のコメント

アーカイブ

カテゴリー