「cs.CL」カテゴリーアーカイブ

LLM-based MOFs Synthesis Condition Extraction using Few-Shot Demonstrations

投稿日: 2025年2月26日作成者: jarxiv

要約文献からの金属有機フレームワーク（MOFS）合成ルートの抽出は、望ましい機 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

AMPO: Active Multi-Preference Optimization

投稿日: 2025年2月26日作成者: jarxiv

要約マルチプレーファレンス最適化は、役立つものや望ましくない応答のセット全体を … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models

投稿日: 2025年2月26日作成者: jarxiv

要約価値は、個人的および集団的認識、認知、および行動の中心的なドライバーです。 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

AgentRM: Enhancing Agent Generalization with Reward Modeling

投稿日: 2025年2月26日作成者: jarxiv

要約既存のLLMベースのエージェントは、開催されたタスクで強力なパフォーマンス … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Utility-inspired Reward Transformations Improve Reinforcement Learning Training of Language Models

投稿日: 2025年2月26日作成者: jarxiv

要約強化学習フィードバックを使用して大規模な言語モデル（LLM）をトレーニング … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, econ.GN, q-fin.EC | コメントを受け付けていません

TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning

投稿日: 2025年2月26日作成者: jarxiv

要約推論は、大規模な言語モデル（LLM）の基本的な能力であり、複雑な問題を理解 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

QuantMoE-Bench: Examining Post-Training Quantization for Mixture-of-Experts

投稿日: 2025年2月26日作成者: jarxiv

要約混合物（MOE）は、大規模な言語モデルの学習能力を拡大する有望な方法です。 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Disambiguate First Parse Later: Generating Interpretations for Ambiguity Resolution in Semantic Parsing

投稿日: 2025年2月26日作成者: jarxiv

要約あいまいさと特別な除外の処理は、特にテキストからSQLのセマンティック解析 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

投稿日: 2025年2月26日作成者: jarxiv

要約最近のDeepSeek-R1リリースは、大規模な言語モデル（LLMS）の一 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SE | コメントを受け付けていません

FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response

投稿日: 2025年2月26日作成者: jarxiv

要約大規模な言語モデル（LLM）には、実質的な常識推論の可能性があります。た … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

LLM-based MOFs Synthesis Condition Extraction using Few-Shot Demonstrations

AMPO: Active Multi-Preference Optimization

Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models

AgentRM: Enhancing Agent Generalization with Reward Modeling

Utility-inspired Reward Transformations Improve Reinforcement Learning Training of Language Models

TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning

QuantMoE-Bench: Examining Post-Training Quantization for Mixture-of-Experts

Disambiguate First Parse Later: Generating Interpretations for Ambiguity Resolution in Semantic Parsing

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response

最近の投稿

最近のコメント

アーカイブ

カテゴリー