「cs.LG」カテゴリーアーカイブ

NeMo-Inspector: A Visualization Tool for LLM Generation Analysis

投稿日: 2025年5月5日作成者: jarxiv

要約大規模言語モデル（LLM）を新しいタスクに適応させ、その全体的な能力を向上 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias

投稿日: 2025年5月5日作成者: jarxiv

要約言語認識タスクは自然言語処理（NLP）の基本であり、大規模言語モデル（LL … 続きを読む →

カテゴリー: cs.CL, cs.LG, stat.ML | コメントを受け付けていません

Llama-Nemotron: Efficient Reasoning Models

投稿日: 2025年5月5日作成者: jarxiv

要約 Llama-Nemotronシリーズは、卓越した推論能力、推論効率、オープ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Attack and defense techniques in large language models: A survey and new perspectives

投稿日: 2025年5月5日作成者: jarxiv

要約大規模言語モデル（LLM）は、多くの自然言語処理タスクの中心的存在となって … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG | コメントを受け付けていません

Towards the Resistance of Neural Network Watermarking to Fine-tuning

投稿日: 2025年5月5日作成者: jarxiv

要約本稿では、ディープニューラルネットワーク(DNN)に所有者情報を埋め込むた … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Competition Dynamics Shape Algorithmic Phases of In-Context Learning

投稿日: 2025年5月5日作成者: jarxiv

要約文脈内学習（In-Context Learning: ICL）は、大規模言 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

ICLR: In-Context Learning of Representations

投稿日: 2025年5月5日作成者: jarxiv

要約最近の研究では、事前学習データによって指定された意味論が、大規模言語モデル … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Clustering Internet Memes Through Template Matching and Multi-Dimensional Similarity

投稿日: 2025年5月5日作成者: jarxiv

要約ミームのクラスタリングは、毒性検出、バイラリティモデリング、タイピングのた … 続きを読む →

カテゴリー: cs.CL, cs.IR, cs.LG, cs.MM | コメントを受け付けていません

MoDeGPT: Modular Decomposition for Large Language Model Compression

投稿日: 2025年5月5日作成者: jarxiv

要約大規模言語モデル(LLM)は、様々なタスクにおいて卓越した性能を発揮するこ … 続きを読む →

カテゴリー: (Primary), cs.CL, cs.LG, I.2.7, stat.ML | コメントを受け付けていません

FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning

投稿日: 2025年5月5日作成者: jarxiv

要約大規模言語モデル(LLM)のファインチューニングはタスク適応に不可欠である … 続きを読む →

カテゴリー: cs.CL, cs.DC, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

NeMo-Inspector: A Visualization Tool for LLM Generation Analysis

How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias

Llama-Nemotron: Efficient Reasoning Models

Attack and defense techniques in large language models: A survey and new perspectives

Towards the Resistance of Neural Network Watermarking to Fine-tuning

Competition Dynamics Shape Algorithmic Phases of In-Context Learning

ICLR: In-Context Learning of Representations

Clustering Internet Memes Through Template Matching and Multi-Dimensional Similarity

MoDeGPT: Modular Decomposition for Large Language Model Compression

FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning

最近の投稿

最近のコメント

アーカイブ

カテゴリー