「cs.LG」カテゴリーアーカイブ

HR-Bandit: Human-AI Collaborated Linear Recourse Bandit

投稿日: 2024年10月21日作成者: jarxiv

要約人間の医師は、患者がより効果的な治療を受けるために症状を修正できるようにす … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search

投稿日: 2024年10月21日作成者: jarxiv

要約大規模言語モデル (LLM) の計算コストが高いため、量子化、スパース化、 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens

投稿日: 2024年10月21日作成者: jarxiv

要約言語モデルは多くの場合、トレーニングデータセット内の過去のトークンが与え … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Harnessing Causality in Reinforcement Learning With Bagged Decision Times

投稿日: 2024年10月21日作成者: jarxiv

要約袋詰めされた決定時間を持つ問題のクラスに対する強化学習 (RL) を検討し … 続きを読む →

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning

投稿日: 2024年10月21日作成者: jarxiv

要約大規模言語モデル (LLM) は、強化学習 (RL) タスクの報酬関数の設 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Stochastic Gradient Descent Jittering for Inverse Problems: Alleviating the Accuracy-Robustness Tradeoff

投稿日: 2024年10月21日作成者: jarxiv

要約逆問題は、破損または摂動された測定値から目に見えないデータを再構築すること … 続きを読む →

カテゴリー: cs.LG, eess.SP | コメントを受け付けていません

Decomposing The Dark Matter of Sparse Autoencoders

投稿日: 2024年10月21日作成者: jarxiv

要約スパースオートエンコーダ (SAE) は、言語モデルのアクティベーション … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Self-supervised contrastive learning performs non-linear system identification

投稿日: 2024年10月21日作成者: jarxiv

要約自己教師あり学習 (SSL) アプローチは、多くのタスクや領域で大きな成功 … 続きを読む →

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus

投稿日: 2024年10月21日作成者: jarxiv

要約自然言語推論 (NLI) は、文のペアの含意関係を認識するタスクであり、自 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

投稿日: 2024年10月21日作成者: jarxiv

要約モデル回答の検証可能性を確保することは、質問応答 (QA) ドメインにおけ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

HR-Bandit: Human-AI Collaborated Linear Recourse Bandit

EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search

Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens

Harnessing Causality in Reinforcement Learning With Bagged Decision Times

A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning

Stochastic Gradient Descent Jittering for Inverse Problems: Alleviating the Accuracy-Robustness Tradeoff

Decomposing The Dark Matter of Sparse Autoencoders

Self-supervised contrastive learning performs non-linear system identification

A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー