月別アーカイブ: 2024年6月

Benchmark Data Contamination of Large Language Models: A Survey

投稿日: 2024年6月7日作成者: jarxiv

要約 GPT-4、Claude-3、Gemini などの大規模言語モデル (LL … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Reflect-RL: Two-Player Online RL Fine-Tuning for LMs

投稿日: 2024年6月7日作成者: jarxiv

要約言語モデル (LM) がさまざまな分野でその機能を実証するにつれて、マルチ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Transformers need glasses! Information over-squashing in language tasks

投稿日: 2024年6月7日作成者: jarxiv

要約私たちは、ほとんどの既存のフロンティア大規模言語モデル (LLM) のアー … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

投稿日: 2024年6月7日作成者: jarxiv

要約大規模言語モデル (LLM) の精度、効率、堅牢性を向上させるための、斬新 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Characterizing Similarities and Divergences in Conversational Tones in Humans and LLMs by Sampling with People

投稿日: 2024年6月7日作成者: jarxiv

要約会話のトーン、つまり話者がコミュニケーションをとる際のマナーや態度は、効果 … 続きを読む →

カテゴリー: cs.CL, cs.HC | コメントを受け付けていません

What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages

投稿日: 2024年6月7日作成者: jarxiv

要約大規模な言語モデルは何を学習できるのでしょうか? 定義上、言語モデル (L … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

投稿日: 2024年6月7日作成者: jarxiv

要約コード関連のアプリケーションに適用される大規模言語モデル (LLM) は、 … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SE | コメントを受け付けていません

Measuring and Addressing Indexical Bias in Information Retrieval

投稿日: 2024年6月7日作成者: jarxiv

要約情報検索 (IR) システムは、関連するコンテンツを配信するように設計され … 続きを読む →

カテゴリー: cs.CL, cs.IR | コメントを受け付けていません

Don’t Rank, Combine! Combining Machine Translation Hypotheses Using Quality Estimation

投稿日: 2024年6月7日作成者: jarxiv

要約ニューラル機械翻訳システムは、ソース文が与えられた場合にターゲット文の確率 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Redundancy-aware Action Spaces for Robot Learning

投稿日: 2024年6月7日作成者: jarxiv

要約関節空間制御とタスク空間制御は、ロボット学習文献の中でロボットアームを制 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

月別アーカイブ: 2024年6月

Benchmark Data Contamination of Large Language Models: A Survey

Reflect-RL: Two-Player Online RL Fine-Tuning for LMs

Transformers need glasses! Information over-squashing in language tasks

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Characterizing Similarities and Divergences in Conversational Tones in Humans and LLMs by Sampling with People

What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

Measuring and Addressing Indexical Bias in Information Retrieval

Don’t Rank, Combine! Combining Machine Translation Hypotheses Using Quality Estimation

Redundancy-aware Action Spaces for Robot Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー