投稿者「jarxiv」のアーカイブ

LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models

投稿日: 2025年6月16日作成者: jarxiv

要約エキスパート（MOE）の混合物は、最近、継続的なマルチモーダル学習のための … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Improving Causal Interventions in Amnesic Probing with Mean Projection or LEACE

投稿日: 2025年6月16日作成者: jarxiv

要約健忘環境は、モデルの挙動に関する特定の言語情報の影響を調べるために使用され … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

LLMs for Sentence Simplification: A Hybrid Multi-Agent prompting Approach

投稿日: 2025年6月16日作成者: jarxiv

要約このペーパーでは、複雑な文章を論理的で単純化した文のシーケンスに変換すると … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Configurable Preference Tuning with Rubric-Guided Synthetic Data

投稿日: 2025年6月16日作成者: jarxiv

要約直接選好最適化（DPO）を支えるなど、AIアライメントの人間のフィードバッ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

The Cambrian Explosion of Mixed-Precision Matrix Multiplication for Quantized Deep Learning Inference

投稿日: 2025年6月16日作成者: jarxiv

要約 Deep Learning（DL）の最近の進歩により、FP16、BF16、 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

VM14K: First Vietnamese Medical Benchmark

投稿日: 2025年6月16日作成者: jarxiv

要約医療ベンチマークは、英語を話す非英語を話すコミュニティのヘルスケアにおける … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Entropy Controllable Direct Preference Optimization

投稿日: 2025年6月16日作成者: jarxiv

要約大規模な言語モデル（LLM）の訓練後、人間のフィードバック（RLHF）から … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Table-R1: Region-based Reinforcement Learning for Table Understanding

投稿日: 2025年6月16日作成者: jarxiv

要約テーブルは、構造化された列列相互作用のために言語モデルのユニークな課題を提 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

DART: Distilling Autoregressive Reasoning to Silent Thought

投稿日: 2025年6月16日作成者: jarxiv

要約チェーンオブテーブ（COT）の推論は、複雑なタスクの解決において大規模な言 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

投稿日: 2025年6月16日作成者: jarxiv

要約ディープリサーチエージェントは、LLMベースのエージェントの顕著なカテゴリ … 続きを読む →

カテゴリー: cs.CL, cs.IR | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models

Improving Causal Interventions in Amnesic Probing with Mean Projection or LEACE

LLMs for Sentence Simplification: A Hybrid Multi-Agent prompting Approach

Configurable Preference Tuning with Rubric-Guided Synthetic Data

The Cambrian Explosion of Mixed-Precision Matrix Multiplication for Quantized Deep Learning Inference

VM14K: First Vietnamese Medical Benchmark

Entropy Controllable Direct Preference Optimization

Table-R1: Region-based Reinforcement Learning for Table Understanding

DART: Distilling Autoregressive Reasoning to Silent Thought

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

最近の投稿

最近のコメント

アーカイブ

カテゴリー