「cs.LG」カテゴリーアーカイブ

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

投稿日: 2025年5月26日作成者: jarxiv

要約 DPOから蒸留まで、訓練後の言語モデル（LLM）は、行動を改良し、新しいス … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

From Lists to Emojis: How Format Bias Affects Model Alignment

投稿日: 2025年5月26日作成者: jarxiv

要約この論文では、人間のフィードバック（RLHF）からの強化学習における形式バ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

投稿日: 2025年5月26日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、高度な推論、長型のコンテンツ生成、および … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

投稿日: 2025年5月26日作成者: jarxiv

要約強化学習（RL）は、バイナリ検証信号を通じて自己改善を可能にすることにより … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

投稿日: 2025年5月26日作成者: jarxiv

要約ディープラーニングは多くのドメインで顕著な成功を収めていますが、歴史的に表 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Fourier-Based 3D Multistage Transformer for Aberration Correction in Multicellular Specimens

投稿日: 2025年5月26日作成者: jarxiv

要約高解像度の組織イメージングは、分解能とコントラストを分解するサンプル誘 … 続きを読む →

カテゴリー: cs.AI, cs.LG, eess.IV, physics.bio-ph, q-bio.QM | コメントを受け付けていません

An Example Safety Case for Safeguards Against Misuse

投稿日: 2025年5月26日作成者: jarxiv

要約 AI誤用セーフガードの既存の評価は、実際の決定に接続することがしばしば困難 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Towards Copyright Protection for Knowledge Bases of Retrieval-augmented Language Models via Reasoning

投稿日: 2025年5月26日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、ドメイン固有の知識で応答を補うために、検 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.IR, cs.LG | コメントを受け付けていません

Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons

投稿日: 2025年5月26日作成者: jarxiv

要約多くの好みの誘発アルゴリズムは、異なる属性を持つ命題論理式またはアイテムに … 続きを読む →

カテゴリー: cs.AI, cs.FL, cs.LG, cs.SY, eess.SY | コメントを受け付けていません

On the Impact of the Utility in Semivalue-based Data Valuation

投稿日: 2025年5月26日作成者: jarxiv

要約 Semivalueベースのデータ評価は、協同ゲーム理論の直感を使用して、各 … 続きを読む →

カテゴリー: cs.AI, cs.GT, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

From Lists to Emojis: How Format Bias Affects Model Alignment

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

Fourier-Based 3D Multistage Transformer for Aberration Correction in Multicellular Specimens

An Example Safety Case for Safeguards Against Misuse

Towards Copyright Protection for Knowledge Bases of Retrieval-augmented Language Models via Reasoning

Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons

On the Impact of the Utility in Semivalue-based Data Valuation

最近の投稿

最近のコメント

アーカイブ

カテゴリー