月別アーカイブ: 2025年4月

LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams

投稿日: 2025年4月25日作成者: jarxiv

要約長いコンテキストの理解は、特に音声ベースの要素、高い冗長性、および不均一な … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure

投稿日: 2025年4月25日作成者: jarxiv

要約オートレーフレフな言語モデルは、さまざまなシナリオで優れたパフォーマンスを … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

PicPersona-TOD : A Dataset for Personalizing Utterance Style in Task-Oriented Dialogue with Image Persona

投稿日: 2025年4月25日作成者: jarxiv

要約タスク指向のダイアログ（TOD）システムは、自然言語の相互作用を通じてユー … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?

投稿日: 2025年4月25日作成者: jarxiv

要約非常に低リソース（XLR）言語には、NLPモデルのトレーニングにはかなりの … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Creating Targeted, Interpretable Topic Models with LLM-Generated Text Augmentation

投稿日: 2025年4月25日作成者: jarxiv

要約トピックモデリングやクラスタリングなどの監視されていない機械学習手法は、政 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation

投稿日: 2025年4月25日作成者: jarxiv

要約透かしは、大規模な言語モデル（LLM）で誤った情報と闘い、知的財産を保護す … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Transferable text data distillation by trajectory matching

投稿日: 2025年4月25日作成者: jarxiv

要約大規模な言語モデル（LLM）の領域では、大規模なモデルのサイズが大きくなる … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Probabilistic Subspace Manifolds for Contextual Inference in Large Language Models

投稿日: 2025年4月25日作成者: jarxiv

要約トークンの埋め込みを学習した多様体にわたって確率分布として表すことで、より … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Not All Data Are Unlearned Equally

投稿日: 2025年4月25日作成者: jarxiv

要約 Machine Ulearningは、訓練されたモデルから特定のデータポイ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars

投稿日: 2025年4月25日作成者: jarxiv

要約潜在的なセマンティクスを獲得する機能は、言語モデルのパフォーマンスを決定す … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年4月

LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams

OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure

PicPersona-TOD : A Dataset for Personalizing Utterance Style in Task-Oriented Dialogue with Image Persona

Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?

Creating Targeted, Interpretable Topic Models with LLM-Generated Text Augmentation

Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation

Transferable text data distillation by trajectory matching

Probabilistic Subspace Manifolds for Contextual Inference in Large Language Models

Not All Data Are Unlearned Equally

When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars

最近の投稿

最近のコメント

アーカイブ

カテゴリー