「cs.AI」カテゴリーアーカイブ

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

投稿日: 2024年7月18日作成者: jarxiv

要約大規模言語モデル (LLM) の目覚ましい進歩を利用して、ロボットナビゲ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.RO | コメントを受け付けていません

Rethinking the Integration of Prediction and Planning in Deep Learning-Based Automated Driving Systems: A Review

投稿日: 2024年7月18日作成者: jarxiv

要約自動運転は、個人、公共、貨物のモビリティに革命をもたらす可能性があります。 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MA, cs.RO | コメントを受け付けていません

Subequivariant Reinforcement Learning in 3D Multi-Entity Physical Environments

投稿日: 2024年7月18日作成者: jarxiv

要約 3D 環境におけるマルチエンティティシステムの学習ポリシーは、エンティテ … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

The Oscars of AI Theater: A Survey on Role-Playing with Language Models

投稿日: 2024年7月18日作成者: jarxiv

要約この調査では、言語モデルを使用したロールプレイングの急成長分野を調査し、初 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation

投稿日: 2024年7月18日作成者: jarxiv

要約この研究では、シーンのテキストをソース言語 (ヒンディー語など) からター … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MM | コメントを受け付けていません

End-to-End Evaluation for Low-Latency Simultaneous Speech Translation

投稿日: 2024年7月18日作成者: jarxiv

要約低遅延音声翻訳の課題は、いくつかの出版物や共有タスクによって示されているよ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline

投稿日: 2024年7月18日作成者: jarxiv

要約マルチメディアコンテンツの急速な拡大により、大規模なコレクションから関連 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Struct-X: Enhancing Large Language Models Reasoning with Structured Data

投稿日: 2024年7月18日作成者: jarxiv

要約論理情報とリレーショナル情報が豊富な構造化データには、大規模言語モデル ( … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

投稿日: 2024年7月18日作成者: jarxiv

要約大規模言語モデル (LLM) は、単純な数学問題の処理において目覚ましい進 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Towards Collaborative Intelligence: Propagating Intentions and Reasoning for Multi-Agent Coordination with Large Language Models

投稿日: 2024年7月18日作成者: jarxiv

要約マルチエージェントシステムで効果的にコラボレーションするには、エージェン … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Rethinking the Integration of Prediction and Planning in Deep Learning-Based Automated Driving Systems: A Review

Subequivariant Reinforcement Learning in 3D Multi-Entity Physical Environments

The Oscars of AI Theater: A Survey on Role-Playing with Language Models

Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation

End-to-End Evaluation for Low-Latency Simultaneous Speech Translation

MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline

Struct-X: Enhancing Large Language Models Reasoning with Structured Data

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Towards Collaborative Intelligence: Propagating Intentions and Reasoning for Multi-Agent Coordination with Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー