投稿者「jarxiv」のアーカイブ

Improving large language models with concept-aware fine-tuning

投稿日: 2025年6月10日作成者: jarxiv

要約大規模な言語モデル（LLM）は、現代AIの基礎となっています。ただし、次 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Beyond Numeric Rewards: In-Context Dueling Bandits with LLM Agents

投稿日: 2025年6月10日作成者: jarxiv

要約コンテキスト内補強学習（ICRL）は、基礎モデルの時代の強化学習（RL）の … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

投稿日: 2025年6月10日作成者: jarxiv

要約大規模な言語モデル（LLM）は、コンテキストの理解に大幅な改善を実証してい … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Introspective Growth: Automatically Advancing LLM Expertise in Technology Judgment

投稿日: 2025年6月10日作成者: jarxiv

要約大規模な言語モデル（LLM）は、概念的な理解の兆候をますます示していますが … 続きを読む →

カテゴリー: cs.CL, cs.CY, cs.DL, cs.IR | コメントを受け付けていません

MEMOIR: Lifelong Model Editing with Minimal Overwrite and Informed Retention for LLMs

投稿日: 2025年6月10日作成者: jarxiv

要約実際のシステムに展開された言語モデルは、多くの場合、新しい知識または修正さ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

投稿日: 2025年6月10日作成者: jarxiv

要約 GeminiやChatGptなどのマルチモーダルファンデーションモデルは、 … 続きを読む →

カテゴリー: cs.CL, eess.AS | コメントを受け付けていません

Quantum Graph Transformer for NLP Sentiment Classification

投稿日: 2025年6月10日作成者: jarxiv

要約量子機械学習は、特に複雑で構造化されたデータが重要であるドメインで、より効 … 続きを読む →

カテゴリー: cs.CL, quant-ph | コメントを受け付けていません

Statistical Hypothesis Testing for Auditing Robustness in Language Models

投稿日: 2025年6月10日作成者: jarxiv

要約入力摂動など、任意の介入の下で大規模な言語モデル（LLM）システムの出力が … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Language Models over Canonical Byte-Pair Encodings

投稿日: 2025年6月10日作成者: jarxiv

要約現代の言語モデルは、バイトペアエンコーディングなどの決定論的トークン剤を介 … 続きを読む →

カテゴリー: cs.CL, cs.FL, cs.LG | コメントを受け付けていません

General-Reasoner: Advancing LLM Reasoning Across All Domains

投稿日: 2025年6月10日作成者: jarxiv

要約強化学習（RL）は最近、大規模な言語モデル（LLM）の推論能力を高める上で … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Improving large language models with concept-aware fine-tuning

Beyond Numeric Rewards: In-Context Dueling Bandits with LLM Agents

Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

Introspective Growth: Automatically Advancing LLM Expertise in Technology Judgment

MEMOIR: Lifelong Model Editing with Minimal Overwrite and Informed Retention for LLMs

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Quantum Graph Transformer for NLP Sentiment Classification

Statistical Hypothesis Testing for Auditing Robustness in Language Models

Language Models over Canonical Byte-Pair Encodings

General-Reasoner: Advancing LLM Reasoning Across All Domains

最近の投稿

最近のコメント

アーカイブ

カテゴリー