「cs.LG」カテゴリーアーカイブ

Towards Quantifying the Hessian Structure of Neural Networks

投稿日: 2025年5月6日作成者: jarxiv

要約実証研究は、ニューラルネットワークのヘシアンマトリックス（NNS）が遮断に … 続きを読む →

カテゴリー: cs.LG, math.OC, stat.ML | コメントを受け付けていません

Impact of Noisy Supervision in Foundation Model Learning

投稿日: 2025年5月6日作成者: jarxiv

要約基礎モデルは通常、大規模なデータセットで事前に訓練されており、チューニング … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing

投稿日: 2025年5月6日作成者: jarxiv

要約テキスト生成のための大規模な言語モデル（LLMS）の使用の増加により、AI … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC, cs.LG | コメントを受け付けていません

JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings

投稿日: 2025年5月6日作成者: jarxiv

要約監視されていない対照学習は、自然言語処理のホットな研究トピックになりました … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

RM-R1: Reward Modeling as Reasoning

投稿日: 2025年5月6日作成者: jarxiv

要約報酬モデリングは、特に人間のフィードバック（RLHF）からの強化学習を通じ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

投稿日: 2025年5月6日作成者: jarxiv

要約大規模な言語モデル（LLMS）のチェーンオブテーブ（COT）推論は、潜在的 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Unveiling the Mechanisms of Explicit CoT Training: How CoT Enhances Reasoning Generalization

投稿日: 2025年5月6日作成者: jarxiv

要約大規模な言語モデル（LLMS）のトレーニングへの明示的なチェーン（COT） … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Bielik v3 Small: Technical Report

投稿日: 2025年5月6日作成者: jarxiv

要約ポーランド語処理用に最適化された一連のパラメーター効率の高い生成テキストモ … 続きを読む →

カテゴリー: 68T50, cs.AI, cs.CL, cs.LG, I.2.7 | コメントを受け付けていません

EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning

投稿日: 2025年5月6日作成者: jarxiv

要約大規模な言語モデル（LLM）の補強学習（RL）の最近の進歩は、多目的タスク … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay

投稿日: 2025年5月6日作成者: jarxiv

要約マルチターン相互作用のための効果的なAIエージェントのトレーニングには、現 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Towards Quantifying the Hessian Structure of Neural Networks

Impact of Noisy Supervision in Foundation Model Learning

Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing

JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings

RM-R1: Reward Modeling as Reasoning

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Unveiling the Mechanisms of Explicit CoT Training: How CoT Enhances Reasoning Generalization

Bielik v3 Small: Technical Report

EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay

最近の投稿

最近のコメント

アーカイブ

カテゴリー