「cs.AI」カテゴリーアーカイブ

Beyond Text: Utilizing Vocal Cues to Improve Decision Making in LLMs for Robot Navigation Tasks

投稿日: 2024年4月24日作成者: jarxiv

要約 LLM は人間の会話におけるテキストの処理には優れていますが、ソーシャル … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts

投稿日: 2024年4月24日作成者: jarxiv

要約アップサイクルされた専門家混合 (MoE) をマージするだけで、命令調整さ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SE | コメントを受け付けていません

Neuro-Inspired Hierarchical Multimodal Learning

投稿日: 2024年4月24日作成者: jarxiv

要約現実世界の包括的かつ正確な認識を得るには、さまざまなソースやモダリティから … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Aligning LLM Agents by Learning Latent Preference from User Edits

投稿日: 2024年4月24日作成者: jarxiv

要約私たちは、エージェントの出力に対して行われたユーザー編集に基づいて、言語エ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR, cs.LG | コメントを受け付けていません

A review of deep learning-based information fusion techniques for multimodal medical image classification

投稿日: 2024年4月24日作成者: jarxiv

要約マルチモーダル医用画像処理は、さまざまな画像処理装置からの情報を組み合わせ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Visual Grounding Methods for VQA are Working for the Wrong Reasons!

投稿日: 2024年4月24日作成者: jarxiv

要約既存のビジュアル質問応答 (VQA) 手法は、正しい理由から正しい答えを生 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Taming Diffusion Probabilistic Models for Character Control

投稿日: 2024年4月24日作成者: jarxiv

要約モーション拡散確率モデルを効果的に利用して、高品質で多様なキャラクターア … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method

投稿日: 2024年4月24日作成者: jarxiv

要約高解像度の要求に応えるために、事前にトレーニングされた大規模な低解像度拡散 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

VT-Former: An Exploratory Study on Vehicle Trajectory Prediction for Highway Surveillance through Graph Isomorphism and Transformer

投稿日: 2024年4月24日作成者: jarxiv

要約道路の安全性の向上は、高度道路交通システム (ITS) にとって不可欠なコ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Deep Models for Multi-View 3D Object Recognition: A Review

投稿日: 2024年4月24日作成者: jarxiv

要約人間の意思決定は、多くの場合、複数の視点や視点からの視覚情報に依存します。 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Beyond Text: Utilizing Vocal Cues to Improve Decision Making in LLMs for Robot Navigation Tasks

XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts

Neuro-Inspired Hierarchical Multimodal Learning

Aligning LLM Agents by Learning Latent Preference from User Edits

A review of deep learning-based information fusion techniques for multimodal medical image classification

Visual Grounding Methods for VQA are Working for the Wrong Reasons!

Taming Diffusion Probabilistic Models for Character Control

CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method

VT-Former: An Exploratory Study on Vehicle Trajectory Prediction for Highway Surveillance through Graph Isomorphism and Transformer

Deep Models for Multi-View 3D Object Recognition: A Review

最近の投稿

最近のコメント

アーカイブ

カテゴリー