月別アーカイブ: 2025年3月

Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate

投稿日: 2025年3月6日作成者: jarxiv

要約強化学習は、学習ベースの制御のための数学的枠組みを提供します。その成功は、 … 続きを読む →

カテゴリー: cs.LG, math.OC | コメントを受け付けていません

On Discriminative Probabilistic Modeling for Self-Supervised Representation Learning

投稿日: 2025年3月6日作成者: jarxiv

要約（マルチモーダル）自己教師の表現学習のデータ予測タスクの連続ドメインに関す … 続きを読む →

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

Graph-Augmented LSTM for Forecasting Sparse Anomalies in Graph-Structured Time Series

投稿日: 2025年3月6日作成者: jarxiv

要約時系列データで異常を検出することは、多くのドメインで重要なタスクです。課 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Towards Understanding Distilled Reasoning Models: A Representational Approach

投稿日: 2025年3月6日作成者: jarxiv

要約この論文では、モデル蒸留が大規模な言語モデル（LLM）の推論機能の開発にど … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Opportunistic Routing in Wireless Communications via Learnable State-Augmented Policies

投稿日: 2025年3月6日作成者: jarxiv

要約このペーパーでは、大規模なワイヤレス通信ネットワークでのパケットベースの情 … 続きを読む →

カテゴリー: cs.LG, eess.SP | コメントを受け付けていません

Constrained Gaussian Wasserstein Optimal Transport with Commutative Covariance Matrices

投稿日: 2025年3月6日作成者: jarxiv

要約最適な輸送では、信号処理と機械学習における広範なアプリケーションが見つかり … 続きを読む →

カテゴリー: cs.IT, cs.LG, math.IT | コメントを受け付けていません

PacketCLIP: Multi-Modal Embedding of Network Traffic and Language for Cybersecurity Reasoning

投稿日: 2025年3月6日作成者: jarxiv

要約交通分類はサイバーセキュリティに不可欠ですが、暗号化されたトラフィックは重 … 続きを読む →

カテゴリー: cs.CR, cs.LG | コメントを受け付けていません

Personalize Your LLM: Fake it then Align it

投稿日: 2025年3月6日作成者: jarxiv

要約大規模な言語モデル（LLM）のパーソナライズは、ユーザーエクスペリエンスを … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Unified Mind Model: Reimagining Autonomous Agents in the LLM Era

投稿日: 2025年3月6日作成者: jarxiv

要約大規模な言語モデル（LLM）は最近、ドメイン、タスク、言語（ChatGPT … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Visualising Policy-Reward Interplay to Inform Zeroth-Order Preference Optimisation of Large Language Models

投稿日: 2025年3月6日作成者: jarxiv

要約バックプロパゲーションのような1次方法を使用した微調整LLMは、計算的に集 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

月別アーカイブ: 2025年3月

Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate

On Discriminative Probabilistic Modeling for Self-Supervised Representation Learning

Graph-Augmented LSTM for Forecasting Sparse Anomalies in Graph-Structured Time Series

Towards Understanding Distilled Reasoning Models: A Representational Approach

Opportunistic Routing in Wireless Communications via Learnable State-Augmented Policies

Constrained Gaussian Wasserstein Optimal Transport with Commutative Covariance Matrices

PacketCLIP: Multi-Modal Embedding of Network Traffic and Language for Cybersecurity Reasoning

Personalize Your LLM: Fake it then Align it

Unified Mind Model: Reimagining Autonomous Agents in the LLM Era

Visualising Policy-Reward Interplay to Inform Zeroth-Order Preference Optimisation of Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー