「cs.CL」カテゴリーアーカイブ

Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities

投稿日: 2025年5月22日作成者: jarxiv

要約多言語の大規模な言語モデル（MLLM）での暗記に関する最初の包括的な研究を … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Probing Semantic Routing in Large Mixture-of-Expert Models

投稿日: 2025年5月22日作成者: jarxiv

要約過去1年間で、大規模な（> 100Bパラメーター）混合物（MOE）モデルが … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning

投稿日: 2025年5月22日作成者: jarxiv

要約大規模な言語モデル（LLM）は、大規模なデータセットに関する広範なトレーニ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses

投稿日: 2025年5月22日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、チャットボットからエージェントシステムに … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

dMel: Speech Tokenization made Simple

投稿日: 2025年5月22日作成者: jarxiv

要約大規模な言語モデルは、膨大なテキストデータに自己監視された事前供与を活用す … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval

投稿日: 2025年5月22日作成者: jarxiv

要約大規模な言語モデル（LLM）は、侵入攻撃に対して脆弱であることが知られてお … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

SWE-smith: Scaling Data for Software Engineering Agents

投稿日: 2025年5月22日作成者: jarxiv

要約ソフトウェアエンジニアリングの言語モデル（LMS）の最近の進歩にもかかわら … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SE | コメントを受け付けていません

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

投稿日: 2025年5月22日作成者: jarxiv

要約人間の認知は、通常、個別の言語トークンを厳密に使用するのではなく、抽象的で … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Large Language Models as Computable Approximations to Solomonoff Induction

投稿日: 2025年5月22日作成者: jarxiv

要約大規模な言語モデル（LLMS）の急速な進歩は、経験的な成功を説明するために … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment

投稿日: 2025年5月22日作成者: jarxiv

要約 GPT-4などの大規模な言語モデルの高い計算コストと遅延により、臨床環境で … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities

Probing Semantic Routing in Large Mixture-of-Expert Models

DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning

Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses

dMel: Speech Tokenization made Simple

Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval

SWE-smith: Scaling Data for Software Engineering Agents

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

Large Language Models as Computable Approximations to Solomonoff Induction

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment

最近の投稿

最近のコメント

アーカイブ

カテゴリー