月別アーカイブ: 2024年6月

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping

投稿日: 2024年6月28日作成者: jarxiv

要約自己調整は、確実なモデル機能を確保しながら、人間によるアノテーションのコス … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

LiveBench: A Challenging, Contamination-Free LLM Benchmark

投稿日: 2024年6月28日作成者: jarxiv

要約ベンチマークからのテストデータが新しいモデルのトレーニングセットに入る … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Jump Starting Bandits with LLM-Generated Prior Knowledge

投稿日: 2024年6月28日作成者: jarxiv

要約私たちは、大規模言語モデル (LLM) をコンテキストマルチアームバン … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

WebCanvas: Benchmarking Web Agents in Online Environments

投稿日: 2024年6月28日作成者: jarxiv

要約 Web エージェントが実際に役立つためには、ユーザーインターフェイスとコ … 続きを読む →

カテゴリー: 68T50, cs.AI, cs.CL, cs.LG, I.2.7 | コメントを受け付けていません

CHESS: Contextual Harnessing for Efficient SQL Synthesis

投稿日: 2024年6月28日作成者: jarxiv

要約自然言語の質問を SQL クエリ (テキストから SQL) に変換するため … 続きを読む →

カテゴリー: cs.AI, cs.DB, cs.LG | コメントを受け付けていません

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

投稿日: 2024年6月28日作成者: jarxiv

要約大規模言語モデル (LLM) は、ジェイルブレイクや、さらには悪意のない微 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language

投稿日: 2024年6月28日作成者: jarxiv

要約ヘイトスピーチは社会の調和に重大な脅威をもたらします。過去 2 年間で、 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs?

投稿日: 2024年6月28日作成者: jarxiv

要約モデル編集の問題は、言語モデルが時間の経過とともに世界に関する新しい事実を … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space

投稿日: 2024年6月28日作成者: jarxiv

要約最新の生成モデルは、トレーニングデータの基礎となる抽象概念を識別して操作 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

The Remarkable Robustness of LLMs: Stages of Inference?

投稿日: 2024年6月28日作成者: jarxiv

要約隣接するレイヤーを削除および交換することにより、大規模言語モデルの顕著な堅 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年6月

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Jump Starting Bandits with LLM-Generated Prior Knowledge

WebCanvas: Benchmarking Web Agents in Online Environments

CHESS: Contextual Harnessing for Efficient SQL Synthesis

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language

Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs?

Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space

The Remarkable Robustness of LLMs: Stages of Inference?

最近の投稿

最近のコメント

アーカイブ

カテゴリー