「cs.CL」カテゴリーアーカイブ

DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

投稿日: 2024年7月4日作成者: jarxiv

要約大規模言語モデル（LLM）の開発は、大規模マルチモーダルモデル（LMM）の … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

投稿日: 2024年7月4日作成者: jarxiv

要約本論文では、視覚言語モデル(VLM)の特権を味わい、検出、視覚的質問応答( … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.DB | コメントを受け付けていません

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

投稿日: 2024年7月4日作成者: jarxiv

要約長い文脈の入出力をサポートする汎用的なラージビジョン言語モデル、Inter … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

LLM-Oracle Machines

投稿日: 2024年7月4日作成者: jarxiv

要約現代のAIアプリケーションは、大規模言語モデル（LLM）を活用し、その知識 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.FL, F.1.1 | コメントを受け付けていません

Empathic Grounding: Explorations using Multimodal Interaction and Large Language Models with Conversational Agents

投稿日: 2024年7月3日作成者: jarxiv

要約我々は、会話におけるグラウンディングのクラークの概念化の拡張として、会話エ … 続きを読む →

カテゴリー: cs.CL, cs.HC, cs.RO | コメントを受け付けていません

AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents

投稿日: 2024年7月3日作成者: jarxiv

要約言語、視覚、さらに最近ではアクションを組み込んだ基盤モデルは、インターネッ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Natural Language Can Help Bridge the Sim2Real Gap

投稿日: 2024年7月3日作成者: jarxiv

要約画像条件付きロボットポリシーを学習する際の主な課題は、低レベルの制御に役 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG, cs.RO, I.2.6 | コメントを受け付けていません

Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration

投稿日: 2024年7月3日作成者: jarxiv

要約大規模言語モデル (LLM) の普及により、インターネット上に AI 生成 … 続きを読む →

カテゴリー: cs.CL, cs.IR | コメントを受け付けていません

HGOT: Hierarchical Graph of Thoughts for Retrieval-Augmented In-Context Learning in Factuality Evaluation

投稿日: 2024年7月3日作成者: jarxiv

要約大規模言語モデル (LLM) が多数のアプリケーションで広く採用されるよう … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification

投稿日: 2024年7月3日作成者: jarxiv

要約大規模マルチモーダルモデル (LMM) におけるマルチモーダル数学推論の … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

LLM-Oracle Machines

Empathic Grounding: Explorations using Multimodal Interaction and Large Language Models with Conversational Agents

AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents

Natural Language Can Help Bridge the Sim2Real Gap

Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration

HGOT: Hierarchical Graph of Thoughts for Retrieval-Augmented In-Context Learning in Factuality Evaluation

MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification

最近の投稿

最近のコメント

アーカイブ

カテゴリー