「cs.LG」カテゴリーアーカイブ

Preble: Efficient Distributed Prompt Scheduling for LLM Serving

投稿日: 2024年10月4日作成者: jarxiv

要約大規模言語モデル(LLM)へのプロンプトは、単純なユーザへの質問を超えて進 … 続きを読む →

カテゴリー: cs.DC, cs.LG | コメントを受け付けていません

Data Similarity-Based One-Shot Clustering for Multi-Task Hierarchical Federated Learning

投稿日: 2024年10月4日作成者: jarxiv

要約我々は、ユーザが異なるタスクの学習に取り組む階層的連合学習環境におけるクラ … 続きを読む →

カテゴリー: cs.IT, cs.LG, cs.NI, eess.SP, math.IT | コメントを受け付けていません

OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable?

投稿日: 2024年10月4日作成者: jarxiv

要約分布シフトは様々な形で現れるため、分布外（OOD）汎化は困難である。多数の … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

An Online Automatic Modulation Classification Scheme Based on Isolation Distributional Kernel

投稿日: 2024年10月4日作成者: jarxiv

要約自動変調分類（AMC）は、現代の非協力的な通信ネットワークにおける重要な技 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI

投稿日: 2024年10月4日作成者: jarxiv

要約知的な具現化エージェントは、長い経験の履歴を意思決定に統合することで、新し … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Forecasting Smog Clouds With Deep Learning

投稿日: 2024年10月4日作成者: jarxiv

要約この概念実証研究では、2地点間の二酸化窒素（NO2）、オゾン（O3）、（微 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

How to Train Long-Context Language Models (Effectively)

投稿日: 2024年10月4日作成者: jarxiv

要約我々は、ロングコンテクスト情報を効果的に利用するための言語モデル(LM)の … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Jailbreaking LLMs with Arabic Transliteration and Arabizi

投稿日: 2024年10月4日作成者: jarxiv

要約本研究では、「脱獄」攻撃に対する大規模言語モデル（LLM）の潜在的な脆弱性 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization

投稿日: 2024年10月4日作成者: jarxiv

要約人間のフィードバックからの強化学習（RLHF）は、言語モデルを人間の嗜好に … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Grounding Large Language Models In Embodied Environment With Imperfect World Models

投稿日: 2024年10月4日作成者: jarxiv

要約様々なアプリケーションで広く成功を収めているにもかかわらず、大規模言語モデ … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.RO | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Preble: Efficient Distributed Prompt Scheduling for LLM Serving

Data Similarity-Based One-Shot Clustering for Multi-Task Hierarchical Federated Learning

OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable?

An Online Automatic Modulation Classification Scheme Based on Isolation Distributional Kernel

ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI

Forecasting Smog Clouds With Deep Learning

How to Train Long-Context Language Models (Effectively)

Jailbreaking LLMs with Arabic Transliteration and Arabizi

On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization

Grounding Large Language Models In Embodied Environment With Imperfect World Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー