「cs.AI」カテゴリーアーカイブ

Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

投稿日: 2024年7月11日作成者: jarxiv

要約このペーパーでは、入力長の拡張が大規模言語モデル (LLM) の機能に与え … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Uncovering Layer-Dependent Activation Sparsity Patterns in ReLU Transformers

投稿日: 2024年7月11日作成者: jarxiv

要約これまでの研究では、ReLU Transformers 内の MLP が高 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Agent Lumos: Unified and Modular Training for Open-Source Language Agents

投稿日: 2024年7月11日作成者: jarxiv

要約クローズドソースエージェントは、特に複雑な対話型タスクにおいて、手頃な価格 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents

投稿日: 2024年7月11日作成者: jarxiv

要約大規模言語モデル (LLM) は、さまざまな自律エージェントシステムに不 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Toto: Time Series Optimized Transformer for Observability

投稿日: 2024年7月11日作成者: jarxiv

要約この技術レポートでは、Datadog によって開発された時系列予測のための … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

投稿日: 2024年7月11日作成者: jarxiv

要約この研究は、大規模言語モデル (LLM) を人間の好みに合わせて調整する方 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Vegetable Peeling: A Case Study in Constrained Dexterous Manipulation

投稿日: 2024年7月11日作成者: jarxiv

要約最近の研究では、特に手の中の物体の向きを変えるなど、器用な操作の問題への対 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Is Your LLM Outdated? Evaluating LLMs at Temporal Generalization

投稿日: 2024年7月11日作成者: jarxiv

要約大規模言語モデル (LLM) の急速な進歩は、言語理解と情報処理の向上に合 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Training on the Test Task Confounds Evaluation and Emergence

投稿日: 2024年7月11日作成者: jarxiv

要約私たちは、テストタスクでのトレーニングと呼ばれる、大規模な言語モデルの評 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

A Coding-Theoretic Analysis of Hyperspherical Prototypical Learning Geometry

投稿日: 2024年7月11日作成者: jarxiv

要約超球プロトタイプ学習 (HPL) は、単位超球上でクラスプロトタイプを設 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.SP, stat.ML | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Uncovering Layer-Dependent Activation Sparsity Patterns in ReLU Transformers

Agent Lumos: Unified and Modular Training for Open-Source Language Agents

Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents

Toto: Time Series Optimized Transformer for Observability

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

Vegetable Peeling: A Case Study in Constrained Dexterous Manipulation

Is Your LLM Outdated? Evaluating LLMs at Temporal Generalization

Training on the Test Task Confounds Evaluation and Emergence

A Coding-Theoretic Analysis of Hyperspherical Prototypical Learning Geometry

最近の投稿

最近のコメント

アーカイブ

カテゴリー