「cs.LG」カテゴリーアーカイブ

A Critical Look At Tokenwise Reward-Guided Text Generation

投稿日: 2025年2月17日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、人間のフィードバック（RLHF）からのい … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

DeltaProduct: Increasing the Expressivity of DeltaNet Through Products of Householders

投稿日: 2025年2月17日作成者: jarxiv

要約線形再発性ニューラルネットワーク（線形RNN）は、シーケンスモデリングのた … 続きを読む →

カテゴリー: cs.CL, cs.FL, cs.LG | コメントを受け付けていません

Enhancing Multilingual LLM Pretraining with Model-Based Data Selection

投稿日: 2025年2月17日作成者: jarxiv

要約データセットのキュレーションは、強力な大規模な言語モデル（LLM）パフォー … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

From Markov to Laplace: How Mamba In-Context Learns Markov Chains

投稿日: 2025年2月17日作成者: jarxiv

要約トランスベースの言語モデルはこれまでAI革命を推進してきましたが、その計算 … 続きを読む →

カテゴリー: cs.AI, cs.IT, cs.LG, math.IT | コメントを受け付けていません

Is Deep Learning finally better than Decision Trees on Tabular Data?

投稿日: 2025年2月17日作成者: jarxiv

要約表形式データは、多くの実際のアプリケーションでの汎用性と使いやすさのために … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving

投稿日: 2025年2月17日作成者: jarxiv

要約数学的問題のための自動化された正式な証明生成で最先端の（SOTA）パフォー … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Dynamic Reinforcement Learning for Actors

投稿日: 2025年2月17日作成者: jarxiv

要約この論文で提案されている動的強化学習（動的RL）は、各瞬間にアクター（アク … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.NE | コメントを受け付けていません

Do Large Language Models Reason Causally Like Us? Even Better?

投稿日: 2025年2月17日作成者: jarxiv

要約因果推論は、知性のコアコンポーネントです。大規模な言語モデル（LLM）は … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Forget the Data and Fine-Tuning! Just Fold the Network to Compress

投稿日: 2025年2月17日作成者: jarxiv

要約モデル折りたたみを導入します。これは、層全体で構造的に類似したニューロンを … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Shield Synthesis for LTL Modulo Theories

投稿日: 2025年2月17日作成者: jarxiv

要約近年、機械学習（ML）モデルは、さまざまなドメインで顕著な成功を収めていま … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.LO, cs.RO | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

A Critical Look At Tokenwise Reward-Guided Text Generation

DeltaProduct: Increasing the Expressivity of DeltaNet Through Products of Householders

Enhancing Multilingual LLM Pretraining with Model-Based Data Selection

From Markov to Laplace: How Mamba In-Context Learns Markov Chains

Is Deep Learning finally better than Decision Trees on Tabular Data?

Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving

Dynamic Reinforcement Learning for Actors

Do Large Language Models Reason Causally Like Us? Even Better?

Forget the Data and Fine-Tuning! Just Fold the Network to Compress

Shield Synthesis for LTL Modulo Theories

最近の投稿

最近のコメント

アーカイブ

カテゴリー