「cs.LG」カテゴリーアーカイブ

Real-Time Device Reach Forecasting Using HLL and MinHash Data Sketches

投稿日: 2025年2月21日作成者: jarxiv

要約ユーザーが指定したターゲティング属性に基づいて、適切な数のテレビ（デバイス … 続きを読む →

カテゴリー: 60G25, cs.AI, cs.DB, cs.LG, I.5.3 | コメントを受け付けていません

Ray-Tracing for Conditionally Activated Neural Networks

投稿日: 2025年2月21日作成者: jarxiv

要約このホワイトペーパーでは、専門家（MOE）層の複数の混合物の階層構造を組み … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Rapid Word Learning Through Meta In-Context Learning

投稿日: 2025年2月21日作成者: jarxiv

要約人間は、いくつかの実例から新しい単語を迅速に学び、次に新しい文脈で体系的か … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Optimizing Model Selection for Compound AI Systems

投稿日: 2025年2月21日作成者: jarxiv

要約 Self RefineやMulti-Agent Debateなどの複数のL … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.MA | コメントを受け付けていません

The Computational Limits of State-Space Models and Mamba via the Lens of Circuit Complexity

投稿日: 2025年2月21日作成者: jarxiv

要約この論文では、回路の複雑さフレームワークを使用して、MAMBAおよび状態空 … 続きを読む →

カテゴリー: cs.AI, cs.CC, cs.CL, cs.LG | コメントを受け付けていません

Large Language Model Confidence Estimation via Black-Box Access

投稿日: 2025年2月21日作成者: jarxiv

要約モデルの応答に対する不確実性または自信を推定することは、応答だけでなく、モ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension

投稿日: 2025年2月21日作成者: jarxiv

要約低メモリの要件と速い収束を備えた大規模な言語モデル（LLMS）の効率的なオ … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

投稿日: 2025年2月21日作成者: jarxiv

要約サンプリングベースの検索は、テスト時間計算を利用するための単純なパラダイム … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

投稿日: 2025年2月21日作成者: jarxiv

要約投機的なサンプリングは、ドラフト – ヴェイロ化メカニズムを利 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

投稿日: 2025年2月21日作成者: jarxiv

要約大規模な言語モデル（LLM）は、長いシーケンスの処理において顕著な可能性を … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.DC, cs.LG, cs.PF | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Real-Time Device Reach Forecasting Using HLL and MinHash Data Sketches

Ray-Tracing for Conditionally Activated Neural Networks

Rapid Word Learning Through Meta In-Context Learning

Optimizing Model Selection for Compound AI Systems

The Computational Limits of State-Space Models and Mamba via the Lens of Circuit Complexity

Large Language Model Confidence Estimation via Black-Box Access

Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

最近の投稿

最近のコメント

アーカイブ

カテゴリー