「cs.LG」カテゴリーアーカイブ

Can GPT Improve the State of Prior Authorization via Guideline Based Automated Question Answering?

投稿日: 2024年10月28日作成者: jarxiv

要約健康保険会社には、事前承認 (PA) と呼ばれる定義されたプロセスがありま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead

投稿日: 2024年10月28日作成者: jarxiv

要約低ランク適応 (LoRA) を使用して大規模言語モデル (LLM) を微調 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.DC, cs.LG | コメントを受け付けていません

Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension

投稿日: 2024年10月28日作成者: jarxiv

要約大規模言語モデル (LLM) は、さまざまなドメインの多くのタスクで顕著な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

MILES: Making Imitation Learning Easy with Self-Supervision

投稿日: 2024年10月28日作成者: jarxiv

要約模倣学習におけるデータ収集には、多くの場合、強化学習を組み込んだ手法の場合 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Less is More: Extreme Gradient Boost Rank-1 Adaption for Efficient Finetuning of LLMs

投稿日: 2024年10月28日作成者: jarxiv

要約大規模言語モデル (LLM) の微調整は、事前トレーニングされたモデルを下 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Enhancing Resilience and Scalability in Travel Booking Systems: A Microservices Approach to Fault Tolerance, Load Balancing, and Service Discovery

投稿日: 2024年10月28日作成者: jarxiv

要約このペーパーでは、スケーラブルで信頼性の高い航空予約システムの開発における … 続きを読む →

カテゴリー: cs.AI, cs.CE, cs.LG, cs.SE | コメントを受け付けていません

Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions

投稿日: 2024年10月28日作成者: jarxiv

要約オフライン環境での好みに基づく強化学習 (PBRL) は、チャットボットな … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

On Designing Effective RL Reward at Training Time for LLM Reasoning

投稿日: 2024年10月28日作成者: jarxiv

要約報酬モデルは、LLM の推論能力を向上させるためにますます重要になっていま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Adversarial Environment Design via Regret-Guided Diffusion Models

投稿日: 2024年10月28日作成者: jarxiv

要約環境変化に強いエージェントをトレーニングすることは、深層強化学習 (RL) … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Automated Rewards via LLM-Generated Progress Functions

投稿日: 2024年10月28日作成者: jarxiv

要約大規模言語モデル (LLM) には、さまざまなタスクにわたって広範なドメイ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Can GPT Improve the State of Prior Authorization via Guideline Based Automated Question Answering?

Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead

Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension

MILES: Making Imitation Learning Easy with Self-Supervision

Less is More: Extreme Gradient Boost Rank-1 Adaption for Efficient Finetuning of LLMs

Enhancing Resilience and Scalability in Travel Booking Systems: A Microservices Approach to Fault Tolerance, Load Balancing, and Service Discovery

Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions

On Designing Effective RL Reward at Training Time for LLM Reasoning

Adversarial Environment Design via Regret-Guided Diffusion Models

Automated Rewards via LLM-Generated Progress Functions

最近の投稿

最近のコメント

アーカイブ

カテゴリー