「cs.AI」カテゴリーアーカイブ

RadEdit: stress-testing biomedical vision models via diffusion image editing

投稿日: 2024年4月4日作成者: jarxiv

要約バイオメディカルイメージングのデータセットはしばしば小さく偏りがあるため、 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Domain Generalization through Meta-Learning: A Survey

投稿日: 2024年4月4日作成者: jarxiv

要約ディープニューラルネットワーク（DNN）は人工知能に革命をもたらしたが、実 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.NE | コメントを受け付けていません

Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition

投稿日: 2024年4月4日作成者: jarxiv

要約近年の研究により、大規模データを用いた一般的な視覚学習課題で事前に訓練され … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Text-Driven Image Editing via Learnable Regions

投稿日: 2024年4月4日作成者: jarxiv

要約言語は画像編集のための自然なインターフェースとして登場してきた。本論文では … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Enhancing Interpretability of Vertebrae Fracture Grading using Human-interpretable Prototypes

投稿日: 2024年4月4日作成者: jarxiv

要約椎体骨折の等級付けは、椎体骨折の重症度を分類するものであり、医用画像診断に … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

FlightScope: A Deep Comprehensive Assessment of Aircraft Detection Algorithms in Satellite Imagery

投稿日: 2024年4月4日作成者: jarxiv

要約リモートセンシングされた衛星写真における物体検出は、生物物理学や環境モニタ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

On the Scalability of Diffusion-based Text-to-Image Generation

投稿日: 2024年4月4日作成者: jarxiv

要約モデルとデータサイズの拡大縮小は、LLMの進化においてかなり成功している。 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets

投稿日: 2024年4月4日作成者: jarxiv

要約ヴィジョン・トランスフォーマー（ViT）は、様々なコンピュータ・ビジョン・ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

ALOHa: A New Measure for Hallucination in Captioning Models

投稿日: 2024年4月4日作成者: jarxiv

要約最近、視覚的説明のためのマルチモーダル事前学習が進歩したにもかかわらず、最 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

投稿日: 2024年4月4日作成者: jarxiv

要約我々はVisual AutoRegressive modeling (VA … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

RadEdit: stress-testing biomedical vision models via diffusion image editing

Domain Generalization through Meta-Learning: A Survey

Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition

Text-Driven Image Editing via Learnable Regions

Enhancing Interpretability of Vertebrae Fracture Grading using Human-interpretable Prototypes

FlightScope: A Deep Comprehensive Assessment of Aircraft Detection Algorithms in Satellite Imagery

On the Scalability of Diffusion-based Text-to-Image Generation

DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets

ALOHa: A New Measure for Hallucination in Captioning Models

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

最近の投稿

最近のコメント

アーカイブ

カテゴリー