投稿者「jarxiv」のアーカイブ

Towards Better Instruction Following Retrieval Models

投稿日: 2025年5月28日作成者: jarxiv

要約標準ペアでのみ訓練された最新の情報検索（IR）モデルは、明示的なユーザーの … 続きを読む →

カテゴリー: cs.CL, cs.IR | コメントを受け付けていません

ANCHOLIK-NER: A Benchmark Dataset for Bangla Regional Named Entity Recognition

投稿日: 2025年5月28日作成者: jarxiv

要約地域の方言の名前付きエンティティ認識（NER）は、特にバングラのような低リ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Turning Up the Heat: Min-p Sampling for Creative and Coherent LLM Outputs

投稿日: 2025年5月28日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、各デコードステップでの語彙上の確率分布か … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

One-shot Entropy Minimization

投稿日: 2025年5月28日作成者: jarxiv

要約 13,440の大規模な言語モデルをトレーニングし、エントロピーの最小化には … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Words Like Knives: Backstory-Personalized Modeling and Detection of Violent Communication

投稿日: 2025年5月28日作成者: jarxiv

要約密接な関係にある会話の故障は、個人的な歴史と感情的な文脈によって深く形作ら … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Thinking beyond the anthropomorphic paradigm benefits LLM research

投稿日: 2025年5月28日作成者: jarxiv

要約擬人化、または人間の特性のテクノロジーへの帰属は、高度な技術的専門知識を持 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance

投稿日: 2025年5月28日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、入力言語や出力言語とは異なる場合がある潜 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

投稿日: 2025年5月28日作成者: jarxiv

要約メモリは、大規模な言語モデル（LLMS）ベースのエージェントを支えるAIシ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion

投稿日: 2025年5月28日作成者: jarxiv

要約拡散言語モデルは、並列トークンの生成と固有の双方向性を提供し、自己回帰アプ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Beyond ‘Aha!’: Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

投稿日: 2025年5月28日作成者: jarxiv

要約大規模な推論モデル（LRMS）は、既に長い考え方の推論のために潜在能力を持 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Towards Better Instruction Following Retrieval Models

ANCHOLIK-NER: A Benchmark Dataset for Bangla Regional Named Entity Recognition

Turning Up the Heat: Min-p Sampling for Creative and Coherent LLM Outputs

One-shot Entropy Minimization

Words Like Knives: Backstory-Personalized Modeling and Detection of Violent Communication

Thinking beyond the anthropomorphic paradigm benefits LLM research

Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance

Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion

Beyond ‘Aha!’: Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー