「cs.LG」カテゴリーアーカイブ

Distributed Mixture-of-Agents for Edge Inference with Large Language Models

投稿日: 2024年12月31日作成者: jarxiv

要約 Mixture-of-Agents (MoA) は、大規模言語モデル (L … 続きを読む →

カテゴリー: cs.CL, cs.DC, cs.IT, cs.LG, cs.NI, math.IT | コメントを受け付けていません

About rectified sigmoid function for enhancing the accuracy of Physics-Informed Neural Networks

投稿日: 2024年12月31日作成者: jarxiv

要約この記事では、物理的問題を解決するための 1 つの隠れ層と修正された活性化 … 続きを読む →

カテゴリー: (Primary), 65M99, cs.AI, cs.LG, cs.NA, I.2.1, math.NA, physics.comp-ph | コメントを受け付けていません

Towards Empirical Interpretation of Internal Circuits and Properties in Grokked Transformers on Modular Polynomials

投稿日: 2024年12月31日作成者: jarxiv

要約 Grokking は、遅延一般化の謎を明らかにするために積極的に研究されて … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Scaling Capability in Token Space: An Analysis of Large Vision Language Model

投稿日: 2024年12月31日作成者: jarxiv

要約スケーリング機能は、パラメーターの数とトレーニングデータのサイズに関して … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Geometric-Averaged Preference Optimization for Soft Preference Labels

投稿日: 2024年12月31日作成者: jarxiv

要約 LLM を人間の好みに合わせるためのアルゴリズムの多くは、人間の好みが二値 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Enhancing Annotated Bibliography Generation with LLM Ensembles

投稿日: 2024年12月31日作成者: jarxiv

要約この研究は、大規模言語モデル (LLM) アンサンブルを通じて注釈付き参考 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Causal-aware Graph Neural Architecture Search under Distribution Shifts

投稿日: 2024年12月31日作成者: jarxiv

要約グラフ NAS は、グラフとアーキテクチャ間の相関関係を活用して GNN … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Hedging Is Not All You Need: A Simple Baseline for Online Learning Under Haphazard Inputs

投稿日: 2024年12月31日作成者: jarxiv

要約エッジデバイスからのデータなど、無計画なストリーミングデータを処理する … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection

投稿日: 2024年12月31日作成者: jarxiv

要約多変量時系列 (MTS) 異常検出は、相互に関連する複数の時系列で構成され … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Nash CoT: Multi-Path Inference with Preference Equilibrium

投稿日: 2024年12月31日作成者: jarxiv

要約思考連鎖 (CoT) は、複雑な推論タスクにおける大規模言語モデル (LL … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.GT, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Distributed Mixture-of-Agents for Edge Inference with Large Language Models

About rectified sigmoid function for enhancing the accuracy of Physics-Informed Neural Networks

Towards Empirical Interpretation of Internal Circuits and Properties in Grokked Transformers on Modular Polynomials

Scaling Capability in Token Space: An Analysis of Large Vision Language Model

Geometric-Averaged Preference Optimization for Soft Preference Labels

Enhancing Annotated Bibliography Generation with LLM Ensembles

Causal-aware Graph Neural Architecture Search under Distribution Shifts

Hedging Is Not All You Need: A Simple Baseline for Online Learning Under Haphazard Inputs

Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection

Nash CoT: Multi-Path Inference with Preference Equilibrium

最近の投稿

最近のコメント

アーカイブ

カテゴリー