「cs.LG」カテゴリーアーカイブ

Efficient distributed representations with linear-time attention scores normalization

投稿日: 2024年10月31日作成者: jarxiv

要約注意スコア行列 ${\rm SoftMax}(XY^T)$ は、オブジェク … 続きを読む →

カテゴリー: cs.CL, cs.LG, stat.ML | コメントを受け付けていません

VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning

投稿日: 2024年10月31日作成者: jarxiv

要約大規模言語モデル (LLM) および大規模マルチモーダルモデル (LMM … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Linear Adversarial Concept Erasure

投稿日: 2024年10月31日作成者: jarxiv

要約テキストデータに基づいてトレーニングされた最新のニューラルモデルは、直 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback

投稿日: 2024年10月31日作成者: jarxiv

要約自然言語記述からの高密度報酬の自動合成は、強化学習 (RL) における有望 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.RO | コメントを受け付けていません

Controlling Language and Diffusion Models by Transporting Activations

投稿日: 2024年10月31日作成者: jarxiv

要約大規模な生成モデルの機能が向上し、その導入がますます広範囲に行われるように … 続きを読む →

カテゴリー: 49Q22, 68T07, cs.AI, cs.CL, cs.CV, cs.LG, I.2.6 | コメントを受け付けていません

Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models

投稿日: 2024年10月31日作成者: jarxiv

要約言語モデル (LM) 表現にはどのような潜在的な機能がエンコードされていま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Don’t Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification

投稿日: 2024年10月31日作成者: jarxiv

要約最先端のエクストリームマルチラベルテキスト分類 (XMTC) モデルは … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Benchmarking Agentic Workflow Generation

投稿日: 2024年10月31日作成者: jarxiv

要約大規模言語モデル (LLM) は、幅広いタスクを処理する優れた能力を備えて … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC, cs.LG, cs.MA | コメントを受け付けていません

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

投稿日: 2024年10月31日作成者: jarxiv

要約大規模言語モデル (LLM) 推論の計算上の課題は、特にプロンプトの長 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning

投稿日: 2024年10月31日作成者: jarxiv

要約インコンテキスト学習は、大規模言語モデル (LLM) が追加のトレーニング … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Efficient distributed representations with linear-time attention scores normalization

VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning

Linear Adversarial Concept Erasure

Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback

Controlling Language and Diffusion Models by Transporting Activations

Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models

Don’t Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification

Benchmarking Agentic Workflow Generation

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー