「cs.LG」カテゴリーアーカイブ

An Aspect Extraction Framework using Different Embedding Types, Learning Models, and Dependency Structure

投稿日: 2025年3月6日作成者: jarxiv

要約エンティティの特定の特徴に関連するセンチメント表現に細粒の洞察を提供する能 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency

投稿日: 2025年3月6日作成者: jarxiv

要約チェーンオブシュート（COT）は、大規模な言語モデル（LLM）の推論パフォ … 続きを読む →

カテゴリー: cs.CL, cs.LG, stat.ML | コメントを受け付けていません

PowerAttention: Exponentially Scaling of Receptive Fields for Effective Sparse Attention

投稿日: 2025年3月6日作成者: jarxiv

要約大規模な言語モデル（LLM）は、長いコンテキストを処理する際の注意メカニズ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Unveiling Simplicities of Attention: Adaptive Long-Context Head Identification

投稿日: 2025年3月6日作成者: jarxiv

要約長いコンテキストを処理する能力は、多くの自然言語処理タスクにとって重要です … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Analogical Reasoning Inside Large Language Models: Concept Vectors and the Limits of Abstraction

投稿日: 2025年3月6日作成者: jarxiv

要約類推的な推論は概念的な抽象化に依存していますが、大規模な言語モデル（LLM … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition

投稿日: 2025年3月6日作成者: jarxiv

要約最新のディープラーニングモデルは、多くの場合、全体的なパフォーマンスが高い … 続きを読む →

カテゴリー: cs.CL, cs.LG, eess.AS | コメントを受け付けていません

Effective LLM Knowledge Learning via Model Generalization

投稿日: 2025年3月6日作成者: jarxiv

要約大規模な言語モデル（LLM）は、広範な世界知識を含む膨大な文書で訓練されて … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Improving LLM Safety Alignment with Dual-Objective Optimization

投稿日: 2025年3月6日作成者: jarxiv

要約大規模な言語モデル（LLM）の既存のトレーニング時間安全アライメント手法は … 続きを読む →

カテゴリー: cs.CL, cs.CR, cs.LG | コメントを受け付けていません

Online Scheduling for LLM Inference with KV Cache Constraints

投稿日: 2025年3月6日作成者: jarxiv

要約トレーニングされたモデルがユーザープロンプトに応じて一度に1つの単語を生成 … 続きを読む →

カテゴリー: cs.AI, cs.LG, math.OC | コメントを受け付けていません

LLMs can be Dangerous Reasoners: Analyzing-based Jailbreak Attack on Large Language Models

投稿日: 2025年3月6日作成者: jarxiv

要約大規模な言語モデル（LLMS）の急速な発展は、さまざまなタスクにわたって大 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

An Aspect Extraction Framework using Different Embedding Types, Learning Models, and Dependency Structure

From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency

PowerAttention: Exponentially Scaling of Receptive Fields for Effective Sparse Attention

Unveiling Simplicities of Attention: Adaptive Long-Context Head Identification

Analogical Reasoning Inside Large Language Models: Concept Vectors and the Limits of Abstraction

CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition

Effective LLM Knowledge Learning via Model Generalization

Improving LLM Safety Alignment with Dual-Objective Optimization

Online Scheduling for LLM Inference with KV Cache Constraints

LLMs can be Dangerous Reasoners: Analyzing-based Jailbreak Attack on Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー