「cs.CL」カテゴリーアーカイブ

OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation

投稿日: 2025年4月21日作成者: jarxiv

要約大規模な言語モデル（LLM）の一般的な能力が改善され、エージェントアプリケ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

投稿日: 2025年4月21日作成者: jarxiv

要約脱獄攻撃に対するLLMの堅牢性は、ユーザーが安全対策を回避し、モデル能力を … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Controlled Territory and Conflict Tracking (CONTACT): (Geo-)Mapping Occupied Territory from Open Source Intelligence

投稿日: 2025年4月21日作成者: jarxiv

要約オープンソースインテリジェンスは、領土制御の評価を通知できる非構造化された … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, I.2.6 | コメントを受け付けていません

Understanding Epistemic Language with a Language-augmented Bayesian Theory of Mind

投稿日: 2025年4月21日作成者: jarxiv

要約これらの信念を直接観察することはできませんが、人々は他人の信念についての主 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Only Send What You Need: Learning to Communicate Efficiently in Federated Multilingual Machine Translation

投稿日: 2025年4月21日作成者: jarxiv

要約 Federated Learning（FL）は、複数のクライアントがグロー … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Scaling sparse feature circuit finding for in-context learning

投稿日: 2025年4月21日作成者: jarxiv

要約スパース自動エンコーダー（SAE）は、大規模な言語モデルのアクティベーショ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

投稿日: 2025年4月21日作成者: jarxiv

要約ペイウォール、ライセンス、著作権規則は、多くの場合、科学的知識の広範な普及 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations

投稿日: 2025年4月21日作成者: jarxiv

要約人間は効率的な言語学習者であり、本質的に社会的な生き物です。私たちの言語 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning

投稿日: 2025年4月21日作成者: jarxiv

要約強化学習（RL）は、大規模な言語モデルの推論能力を強化するための強力なパラ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

投稿日: 2025年4月21日作成者: jarxiv

要約生成AI（2020-2023）の「Act I」と呼ばれる可能性のある大規模 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Controlled Territory and Conflict Tracking (CONTACT): (Geo-)Mapping Occupied Territory from Open Source Intelligence

Understanding Epistemic Language with a Language-augmented Bayesian Theory of Mind

Only Send What You Need: Learning to Communicate Efficiently in Federated Multilingual Machine Translation

Scaling sparse feature circuit finding for in-context learning

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations

Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

最近の投稿

最近のコメント

アーカイブ

カテゴリー