「cs.AI」カテゴリーアーカイブ

Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning

投稿日: 2024年5月31日作成者: jarxiv

要約オフライン強化学習では、分布外 (OOD) という課題が顕著です。これに … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics

投稿日: 2024年5月31日作成者: jarxiv

要約多くの場合、自然言語は、人間がロボットのタスクを指定するための最も簡単で便 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

投稿日: 2024年5月31日作成者: jarxiv

要約不確実性に直面した場合、*情報を探す*能力は基本的に重要です。医療診断や … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Systematic Analysis for Pretrained Language Model Priming for Parameter-Efficient Fine-tuning

投稿日: 2024年5月31日作成者: jarxiv

要約事前トレーニング済み言語モデル (PLM) を下流タスクに適応させるための … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Language Models Need Inductive Biases to Count Inductively

投稿日: 2024年5月31日作成者: jarxiv

要約自然数を定義するペアノの公理という数学的レンズを通して見ても、数えることを … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Text clustering with LLM embeddings

投稿日: 2024年5月31日作成者: jarxiv

要約テキストクラスタリングは、増え続けるデジタルコンテンツを整理するための … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, I.2.6 | コメントを受け付けていません

Code Repair with LLMs gives an Exploration-Exploitation Tradeoff

投稿日: 2024年5月31日作成者: jarxiv

要約大規模言語モデル (LLM) を使用してソースコードを繰り返し改善および … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.PL, cs.SE | コメントを受け付けていません

Reasoning about concepts with LLMs: Inconsistencies abound

投稿日: 2024年5月31日作成者: jarxiv

要約知識を要約して抽象的な概念に整理する能力は、学習と推論の鍵となります。多 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Aligning Crowd Feedback via Distributional Preference Reward Modeling

投稿日: 2024年5月31日作成者: jarxiv

要約深層強化学習は、大規模言語モデル (LLM) を人間の好みに合わせるために … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Iterative Feature Boosting for Explainable Speech Emotion Recognition

投稿日: 2024年5月31日作成者: jarxiv

要約音声感情認識 (SER) では、実際の重要性を考慮せずに事前定義された特徴 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS, I.2.1 | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning

Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics

Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

Systematic Analysis for Pretrained Language Model Priming for Parameter-Efficient Fine-tuning

Language Models Need Inductive Biases to Count Inductively

Text clustering with LLM embeddings

Code Repair with LLMs gives an Exploration-Exploitation Tradeoff

Reasoning about concepts with LLMs: Inconsistencies abound

Aligning Crowd Feedback via Distributional Preference Reward Modeling

Iterative Feature Boosting for Explainable Speech Emotion Recognition

最近の投稿

最近のコメント

アーカイブ

カテゴリー