「cs.LG」カテゴリーアーカイブ

On the Role of Speech Data in Reducing Toxicity Detection Bias

投稿日: 2025年5月19日作成者: jarxiv

要約テキスト毒性検出システムは、人口統計グループに言及しているサンプルに不均衡 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient

投稿日: 2025年5月19日作成者: jarxiv

要約モデルベースの強化学習（RL）は、ほとんどのモデルのないRLアルゴリズムを … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Prototype Augmented Hypernetworks for Continual Learning

投稿日: 2025年5月19日作成者: jarxiv

要約継続的な学習（CL）は、事前の知識を忘れることなく一連のタスクを学ぶことを … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

TANTE: Time-Adaptive Operator Learning via Neural Taylor Expansion

投稿日: 2025年5月19日作成者: jarxiv

要約時間依存の部分微分方程式（PDE）の演算子学習は、近年急速な進歩を遂げてお … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

MID-L: Matrix-Interpolated Dropout Layer with Layer-wise Neuron Selection

投稿日: 2025年5月19日作成者: jarxiv

要約最新のニューラルネットワークは、多くの場合、すべての入力に対してすべてのニ … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.NE | コメントを受け付けていません

EdgeWisePersona: A Dataset for On-Device User Profiling from Natural Language Interactions

投稿日: 2025年5月19日作成者: jarxiv

要約このペーパーでは、スマートホーム環境でのマルチセッションの自然言語の相互作 … 続きを読む →

カテゴリー: cs.AI, cs.HC, cs.LG | コメントを受け付けていません

Mergenetic: a Simple Evolutionary Model Merging Library

投稿日: 2025年5月19日作成者: jarxiv

要約モデルのマージにより、既存のモデルの機能を新しいモデルに組み合わせることが … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.NE | コメントを受け付けていません

Exploratory Diffusion Model for Unsupervised Reinforcement Learning

投稿日: 2025年5月19日作成者: jarxiv

要約監視されていない強化学習（URL）は、報酬のない環境で多様な状態またはスキ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages

投稿日: 2025年5月19日作成者: jarxiv

要約優先データセットは、人間のフィードバック（RLHF）からの強化学習を備えた … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Automatic Reward Shaping from Confounded Offline Data

投稿日: 2025年5月19日作成者: jarxiv

要約人工知能の重要なタスクは、不明な環境でエージェントを制御するための効果的な … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

On the Role of Speech Data in Reducing Toxicity Detection Bias

Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient

Prototype Augmented Hypernetworks for Continual Learning

TANTE: Time-Adaptive Operator Learning via Neural Taylor Expansion

MID-L: Matrix-Interpolated Dropout Layer with Layer-wise Neuron Selection

EdgeWisePersona: A Dataset for On-Device User Profiling from Natural Language Interactions

Mergenetic: a Simple Evolutionary Model Merging Library

Exploratory Diffusion Model for Unsupervised Reinforcement Learning

HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages

Automatic Reward Shaping from Confounded Offline Data

最近の投稿

最近のコメント

アーカイブ

カテゴリー