「cs.CL」カテゴリーアーカイブ

Gaussian mixture models as a proxy for interacting language models

投稿日: 2025年6月4日作成者: jarxiv

要約大規模言語モデル（LLM）は、多くの場面で人間の能力や行動と一致する能力を … 続きを読む →

カテゴリー: 62R07, cs.CL, cs.LG, stat.ML | コメントを受け付けていません

Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems

投稿日: 2025年6月4日作成者: jarxiv

要約 Text-to-SQL システムは、クエリを実行可能な SQL コードに自 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?

投稿日: 2025年6月4日作成者: jarxiv

要約近年の自然言語処理における目覚ましい改善は、主に文脈ニューラル言語モデルの … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective

投稿日: 2025年6月4日作成者: jarxiv

要約強化学習(RL)は、複雑な長鎖思考(long-CoT)推論において大規模言 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

On the class of coding optimality of human languages and the origins of Zipf’s law

投稿日: 2025年6月4日作成者: jarxiv

要約ここでは、符号化システムの最適性に関する新しいクラスを提示する。そのクラス … 続きを読む →

カテゴリー: cs.CL, physics.soc-ph | コメントを受け付けていません

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

投稿日: 2025年6月4日作成者: jarxiv

要約最近の大規模言語モデル(LLM)は、オンライン強化学習(RL)の恩恵を受け … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

A$^2$ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization

投稿日: 2025年6月4日作成者: jarxiv

要約長いコンテキストの大規模言語モデル（LLM）は、KVキャッシュの大きなメモ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Literary Evidence Retrieval via Long-Context Language Models

投稿日: 2025年6月4日作成者: jarxiv

要約現代のロングコンテクスト言語モデルは、文学的フィクションをどの程度理解して … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Rethinking Evaluation Metrics for Grammatical Error Correction: Why Use a Different Evaluation Process than Human?

投稿日: 2025年6月4日作成者: jarxiv

要約文法誤り訂正（GEC）における自動評価メトリクスの目標の1つは、人間の嗜好 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Beyond Text Compression: Evaluating Tokenizers Across Scales

投稿日: 2025年6月4日作成者: jarxiv

要約トークナイザーの選択は言語モデルの性能に大きな影響を与えるが、トークナイザ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Gaussian mixture models as a proxy for interacting language models

Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems

Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?

Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective

On the class of coding optimality of human languages and the origins of Zipf’s law

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

A$^2$ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization

Literary Evidence Retrieval via Long-Context Language Models

Rethinking Evaluation Metrics for Grammatical Error Correction: Why Use a Different Evaluation Process than Human?

Beyond Text Compression: Evaluating Tokenizers Across Scales

最近の投稿

最近のコメント

アーカイブ

カテゴリー