「cs.CL」カテゴリーアーカイブ

Mini-batch Coresets for Memory-efficient Training of Large Language Models

投稿日: 2024年10月11日作成者: jarxiv

要約より大きなミニバッチを使用してトレーニングすると、収束率が向上し、優れたパ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

投稿日: 2024年10月11日作成者: jarxiv

要約大規模言語モデル (LLM) は、機械翻訳 (MT) の品質を合理的に向上 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models

投稿日: 2024年10月11日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は、さまざまなタスクにわたっ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.MM | コメントを受け付けていません

$\textbf{PLUM}$: Improving Code LMs with Execution-Guided On-Policy Preference Learning Driven By Synthetic Test Cases

投稿日: 2024年10月11日作成者: jarxiv

要約優先学習は、正しいコードと間違ったコードを区別するようにモデルが明示的にト … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.PL, cs.SE | コメントを受け付けていません

From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions

投稿日: 2024年10月11日作成者: jarxiv

要約ツール学習により、大規模言語モデル (LLM) はツールを呼び出して外部環 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

PaliGemma: A versatile 3B VLM for transfer

投稿日: 2024年10月11日作成者: jarxiv

要約 PaliGemma は、SigLIP-So400m ビジョンエンコーダと … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs

投稿日: 2024年10月11日作成者: jarxiv

要約この論文では、視覚情報がモデルの内部常識知識と矛盾する、マルチモーダル大規 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Agent S: An Open Agentic Framework that Uses Computers Like a Human

投稿日: 2024年10月11日作成者: jarxiv

要約 Agent S は、グラフィカルユーザーインターフェイス (GUI) … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

投稿日: 2024年10月11日作成者: jarxiv

要約既存のマルチモーダル検索ベンチマークは、モデルが外部のテキスト知識を取得し … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

投稿日: 2024年10月11日作成者: jarxiv

要約コードは、その精度と精度により、大規模な言語モデルの数学的推論能力を強化す … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Mini-batch Coresets for Memory-efficient Training of Large Language Models

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models

$\textbf{PLUM}$: Improving Code LMs with Execution-Guided On-Policy Preference Learning Driven By Synthetic Test Cases

From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions

PaliGemma: A versatile 3B VLM for transfer

Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs

Agent S: An Open Agentic Framework that Uses Computers Like a Human

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

最近の投稿

最近のコメント

アーカイブ

カテゴリー