「cs.CL」カテゴリーアーカイブ

Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation

投稿日: 2025年2月17日作成者: jarxiv

要約最近の生成大規模な言語モデル（LLMS）は、英語以外の言語で顕著なパフォー … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

A Critical Look At Tokenwise Reward-Guided Text Generation

投稿日: 2025年2月17日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、人間のフィードバック（RLHF）からのい … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

DeltaProduct: Increasing the Expressivity of DeltaNet Through Products of Householders

投稿日: 2025年2月17日作成者: jarxiv

要約線形再発性ニューラルネットワーク（線形RNN）は、シーケンスモデリングのた … 続きを読む →

カテゴリー: cs.CL, cs.FL, cs.LG | コメントを受け付けていません

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation

投稿日: 2025年2月17日作成者: jarxiv

要約現代の言語モデルは、数兆個のトークンで構成される大規模で構造化されていない … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Agentic Verification for Ambiguous Query Disambiguation

投稿日: 2025年2月17日作成者: jarxiv

要約この作業では、検索された世代（RAG）におけるクエリを曖昧にしているという … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Enhancing Multilingual LLM Pretraining with Model-Based Data Selection

投稿日: 2025年2月17日作成者: jarxiv

要約データセットのキュレーションは、強力な大規模な言語モデル（LLM）パフォー … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Unknown Word Detection for English as a Second Language (ESL) Learners Using Gaze and Pre-trained Language Models

投稿日: 2025年2月17日作成者: jarxiv

要約第二言語（ESL）としての英語学習者は、テキストの理解を妨げる不明な単語に … 続きを読む →

カテゴリー: cs.CL, cs.HC | コメントを受け付けていません

Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction

投稿日: 2025年2月17日作成者: jarxiv

要約大規模な言語モデル（LLMS）の最近の進捗状況により、タスク固有のデータセ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

投稿日: 2025年2月17日作成者: jarxiv

要約大規模な言語モデル（LLM）は、さまざまな言語タスクにわたって顕著な能力を … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Prediction hubs are context-informed frequent tokens in LLMs

投稿日: 2025年2月17日作成者: jarxiv

要約ハブネス、少数のポイントの傾向は、他のポイントの不均衡な数の最近隣人の中に … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation

A Critical Look At Tokenwise Reward-Guided Text Generation

DeltaProduct: Increasing the Expressivity of DeltaNet Through Products of Householders

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation

Agentic Verification for Ambiguous Query Disambiguation

Enhancing Multilingual LLM Pretraining with Model-Based Data Selection

Unknown Word Detection for English as a Second Language (ESL) Learners Using Gaze and Pre-trained Language Models

Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction

Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

Prediction hubs are context-informed frequent tokens in LLMs

最近の投稿

最近のコメント

アーカイブ

カテゴリー