投稿者「jarxiv」のアーカイブ

Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving

投稿日: 2025年1月29日作成者: jarxiv

要約大規模な言語モデル（LLM）は多くの自然言語タスクで優れていますが、特に象 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Accelerated Training through Iterative Gradient Propagation Along the Residual Path

投稿日: 2025年1月29日作成者: jarxiv

要約深い学習の礎であるにもかかわらず、バックプロパゲーションは、非常に深いモデ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Solving Roughly Forced Nonlinear PDEs via Misspecified Kernel Methods and Neural Networks

投稿日: 2025年1月29日作成者: jarxiv

要約ガウスプロセス（GPS）またはニューラルネットワーク（NNS）を使用して、 … 続きを読む →

カテゴリー: cs.LG, cs.NA, math.NA | コメントを受け付けていません

Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction

投稿日: 2025年1月29日作成者: jarxiv

要約人間のフィードバック（RLHF）や直接選好最適化（DPO）からの強化学習な … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning

投稿日: 2025年1月29日作成者: jarxiv

要約最大エンゴロピー強化学習を通じて学習したポリシーの一般化と堅牢性の特性は、 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Convergence of two-timescale gradient descent ascent dynamics: finite-dimensional and mean-field perspectives

投稿日: 2025年1月29日作成者: jarxiv

要約ツータイムスケール勾配降下（GDA）は、MIN-MAXゲームでNASH平衡 … 続きを読む →

カテゴリー: cs.LG, cs.NA, math.NA, math.OC | コメントを受け付けていません

CoRe-Net: Co-Operational Regressor Network with Progressive Transfer Learning for Blind Radar Signal Restoration

投稿日: 2025年1月29日作成者: jarxiv

要約実世界のレーダー信号は、センサーノイズ、エコー、干渉、意図的な詰まり、タイ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Scanning Trojaned Models Using Out-of-Distribution Samples

投稿日: 2025年1月29日作成者: jarxiv

要約深いニューラルネットワークでのトロイの木馬（バックドア）のスキャンは、実世 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Refusal in LLMs is an Affine Function

投稿日: 2025年1月29日作成者: jarxiv

要約アクティベーションに直接介入することにより、言語モデルの動作を操縦するため … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation

投稿日: 2025年1月29日作成者: jarxiv

要約パーソナライズされたニュースの見出しの世代は、ユーザーが好みに合わせて調整 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving

Accelerated Training through Iterative Gradient Propagation Along the Residual Path

Solving Roughly Forced Nonlinear PDEs via Misspecified Kernel Methods and Neural Networks

Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction

Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning

Convergence of two-timescale gradient descent ascent dynamics: finite-dimensional and mean-field perspectives

CoRe-Net: Co-Operational Regressor Network with Progressive Transfer Learning for Blind Radar Signal Restoration

Scanning Trojaned Models Using Out-of-Distribution Samples

Refusal in LLMs is an Affine Function

Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー