「cs.LG」カテゴリーアーカイブ

Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior

投稿日: 2024年8月2日作成者: jarxiv

要約私たちは未知のオブジェクトの再配置のタスクに焦点を当てます。このタスクでは … 続きを読む →

カテゴリー: cs.LG, cs.RO | コメントを受け付けていません

Sparks of Quantum Advantage and Rapid Retraining in Machine Learning

投稿日: 2024年8月2日作成者: jarxiv

要約量子コンピューティングの出現は、古典的なコンピュータよりも効率的に複雑な問 … 続きを読む →

カテゴリー: cs.ET, cs.LG, quant-ph, stat.ML | コメントを受け付けていません

End-to-End Reinforcement Learning of Koopman Models for Economic Nonlinear Model Predictive Control

投稿日: 2024年8月2日作成者: jarxiv

要約 (経済的) 非線形モデル予測制御 ((e)NMPC) には、十分に正確で計 … 続きを読む →

カテゴリー: cs.LG, cs.SY, eess.SY | コメントを受け付けていません

Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders

投稿日: 2024年8月2日作成者: jarxiv

要約スパースオートエンコーダ (SAE) は、言語モデル (LM) のアクテ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Improving Retrieval for RAG based Question Answering Models on Financial Documents

投稿日: 2024年8月2日作成者: jarxiv

要約正確な応答を生成する際の大規模言語モデル (LLM) の有効性は、特に検索 … 続きを読む →

カテゴリー: cs.CL, cs.IR, cs.LG, q-fin.GN | コメントを受け付けていません

ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition

投稿日: 2024年8月2日作成者: jarxiv

要約自己注意は大規模言語モデル (LLM) の重要なコンポーネントですが、長い … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Generalization in Neural Networks: A Broad Survey

投稿日: 2024年8月2日作成者: jarxiv

要約このペーパーでは、(1) サンプル、(2) ディストリビューション、(3) … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Learning Backdoors for Mixed Integer Linear Programs with Contrastive Learning

投稿日: 2024年8月2日作成者: jarxiv

要約現実世界の問題の多くは、混合整数線形計画法 (MILP) として効率的にモ … 続きを読む →

カテゴリー: cs.AI, cs.LG, math.OC | コメントを受け付けていません

Dataset Distillation for Offline Reinforcement Learning

投稿日: 2024年8月2日作成者: jarxiv

要約オフライン強化学習では、多くの場合、ポリシーをトレーニングできる高品質のデ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Enhancing Stability for Large Models Training in Constrained Bandwidth Networks

投稿日: 2024年8月2日作成者: jarxiv

要約数十億のパラメータを使用して非常に大規模な言語モデルをトレーニングすること … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior

Sparks of Quantum Advantage and Rapid Retraining in Machine Learning

End-to-End Reinforcement Learning of Koopman Models for Economic Nonlinear Model Predictive Control

Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders

Improving Retrieval for RAG based Question Answering Models on Financial Documents

ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition

Generalization in Neural Networks: A Broad Survey

Learning Backdoors for Mixed Integer Linear Programs with Contrastive Learning

Dataset Distillation for Offline Reinforcement Learning

Enhancing Stability for Large Models Training in Constrained Bandwidth Networks

最近の投稿

最近のコメント

アーカイブ

カテゴリー