「cs.AI」カテゴリーアーカイブ

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

投稿日: 2024年4月12日作成者: jarxiv

要約テキストから画像への拡散モデルの制御性を高めるために、ControlNet … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Supervised Fine-tuning in turn Improves Visual Foundation Models

投稿日: 2024年4月12日作成者: jarxiv

要約近年、CLIP のような画像テキストトレーニングが視覚基礎モデルの事前ト … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding

投稿日: 2024年4月12日作成者: jarxiv

要約最近、大規模な基礎モデルが注目を集めており、広範なシナリオで優れたパフォー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

投稿日: 2024年4月12日作成者: jarxiv

要約テキストから画像への生成モデルはますます人気が高まっており、一般の人々が利 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MetaCheckGPT — A Multi-task Hallucination Detector Using LLM Uncertainty and Meta-models

投稿日: 2024年4月12日作成者: jarxiv

要約大規模言語モデル (LLM) における幻覚は、最近重大な問題になっています … 続きを読む →

カテゴリー: 68T07, 68T50, cs.AI, cs.CL, I.2.7 | コメントを受け付けていません

Gemma: Open Models Based on Gemini Research and Technology

投稿日: 2024年4月12日作成者: jarxiv

要約この作品では、Gemini モデルの作成に使用された研究とテクノロジーから … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Towards Robustness of Text-to-Visualization Translation against Lexical and Phrasal Variability

投稿日: 2024年4月12日作成者: jarxiv

要約 Text-to-Vis は、自然言語処理 (NLP) 分野で新たに登場した … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Semantically-correlated memories in a dense associative model

投稿日: 2024年4月12日作成者: jarxiv

要約私は、相関高密度連想記憶 (CDAM) という名前の新しい連想記憶モデルを … 続きを読む →

カテゴリー: 00A69, 68T01, 68T07, 92B20, cs.AI, cs.LG, cs.NE, I.2, q-bio.NC | コメントを受け付けていません

Learning Strategies For Successful Crowd Navigation

投稿日: 2024年4月11日作成者: jarxiv

要約自律移動ロボットに群衆の中をうまく移動できるように教えることは、困難な課題 … 続きを読む →

カテゴリー: cs.AI, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation

投稿日: 2024年4月11日作成者: jarxiv

要約 Embodied AI コミュニティは、3D 座標、オブジェクト、言語記述 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

Supervised Fine-tuning in turn Improves Visual Foundation Models

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding

OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

MetaCheckGPT — A Multi-task Hallucination Detector Using LLM Uncertainty and Meta-models

Gemma: Open Models Based on Gemini Research and Technology

Towards Robustness of Text-to-Visualization Translation against Lexical and Phrasal Variability

Semantically-correlated memories in a dense associative model

Learning Strategies For Successful Crowd Navigation

GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation

最近の投稿

最近のコメント

アーカイブ

カテゴリー