「cs.AI」カテゴリーアーカイブ

MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders

投稿日: 2024年9月11日作成者: jarxiv

要約大規模言語モデル (LLM) の急速な進歩により、自然言語処理機能が大幅に … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

投稿日: 2024年9月11日作成者: jarxiv

要約 GPT-4o のようなモデルは、音声による大規模言語モデル (LLM) と … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS, I.2.7 | コメントを受け付けていません

Insuring Uninsurable Risks from AI: The State as Insurer of Last Resort

投稿日: 2024年9月11日作成者: jarxiv

要約多くの専門家は、AI システムは遅かれ早かれ、存続リスクを含む保険不可能な … 続きを読む →

カテゴリー: cs.AI, cs.CY, cs.LG | コメントを受け付けていません

Liability and Insurance for Catastrophic Losses: the Nuclear Power Precedent and Lessons for AI

投稿日: 2024年9月11日作成者: jarxiv

要約 AI システムがより自律的かつ高機能になるにつれ、専門家は、AI システム … 続きを読む →

カテゴリー: cs.AI, cs.CY, cs.LG | コメントを受け付けていません

Benchmarking Sub-Genre Classification For Mainstage Dance Music

投稿日: 2024年9月11日作成者: jarxiv

要約音楽の分類は、幅広い用途に対応しており、音楽情報の検索において最も重要なタ … 続きを読む →

カテゴリー: cs.AI, cs.MM, cs.SD, I.2.1 | コメントを受け付けていません

Geometric-Averaged Preference Optimization for Soft Preference Labels

投稿日: 2024年9月11日作成者: jarxiv

要約 LLM を人間の好みに合わせるためのアルゴリズムの多くは、人間の好みが二値 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

HybridFC: A Hybrid Fact-Checking Approach for Knowledge Graphs

投稿日: 2024年9月11日作成者: jarxiv

要約ナレッジグラフ内のアサーションの真実性を予測することを目的としたファクト … 続きを読む →

カテゴリー: cs.AI, cs.DB, cs.LG | コメントを受け付けていません

VITA: Towards Open-Source Interactive Omni Multimodal LLM

投稿日: 2024年9月11日作成者: jarxiv

要約 GPT-4o の優れたマルチモーダル機能とインタラクティブなエクスペリエン … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Aligning Machine and Human Visual Representations across Abstraction Levels

投稿日: 2024年9月11日作成者: jarxiv

要約ディープニューラルネットワークは、視覚タスクにおける人間の行動のモデル … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Extending 6D Object Pose Estimators for Stereo Vision

投稿日: 2024年9月11日作成者: jarxiv

要約オブジェクトの 6D 姿勢を正確、迅速、かつ確実に推定することは、依然とし … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Insuring Uninsurable Risks from AI: The State as Insurer of Last Resort

Liability and Insurance for Catastrophic Losses: the Nuclear Power Precedent and Lessons for AI

Benchmarking Sub-Genre Classification For Mainstage Dance Music

Geometric-Averaged Preference Optimization for Soft Preference Labels

HybridFC: A Hybrid Fact-Checking Approach for Knowledge Graphs

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Aligning Machine and Human Visual Representations across Abstraction Levels

Extending 6D Object Pose Estimators for Stereo Vision

最近の投稿

最近のコメント

アーカイブ

カテゴリー