月別アーカイブ: 2025年2月

Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs

投稿日: 2025年2月21日作成者: jarxiv

要約大規模な言語モデルは、微調整を通じてタスク固有のアプリケーションで顕著な能 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension

投稿日: 2025年2月21日作成者: jarxiv

要約低メモリの要件と速い収束を備えた大規模な言語モデル（LLMS）の効率的なオ … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

Towards Economical Inference: Enabling DeepSeek’s Multi-Head Latent Attention in Any Transformer-based LLMs

投稿日: 2025年2月21日作成者: jarxiv

要約 Multi-Head Latent Atterness（MLA）は、Key … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Revealing and Mitigating Over-Attention in Knowledge Editing

投稿日: 2025年2月21日作成者: jarxiv

要約大規模な言語モデルは、幅広いタスクで優れたパフォーマンスを実証していますが … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

投稿日: 2025年2月21日作成者: jarxiv

要約サンプリングベースの検索は、テスト時間計算を利用するための単純なパラダイム … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

投稿日: 2025年2月21日作成者: jarxiv

要約投機的なサンプリングは、ドラフト – ヴェイロ化メカニズムを利 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Interpretable Text Embeddings and Text Similarity Explanation: A Primer

投稿日: 2025年2月21日作成者: jarxiv

要約テキストの埋め込みモデルとテキスト埋め込みモデルは、多くのAIおよびNLP … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

投稿日: 2025年2月21日作成者: jarxiv

要約大規模な言語モデル（LLM）は、長いシーケンスの処理において顕著な可能性を … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.DC, cs.LG, cs.PF | コメントを受け付けていません

An Open-Source Tool for Mapping War Destruction at Scale in Ukraine using Sentinel-1 Time Series

投稿日: 2025年2月21日作成者: jarxiv

要約詳細な戦争影響評価へのアクセスは、人道的組織が影響を受ける集団を効果的に支 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Single-image Reflectance and Transmittance Estimation from Any Flatbed Scanner

投稿日: 2025年2月21日作成者: jarxiv

要約フラットベッドスキャナーは、高解像度の単像材料キャプチャのための有望なデバ … 続きを読む →

カテゴリー: (Primary), 68U05, 68U10, cs.AI, cs.CV, cs.GR, cs.LG, I.2.6 | コメントを受け付けていません

月別アーカイブ: 2025年2月

Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs

Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension

Towards Economical Inference: Enabling DeepSeek’s Multi-Head Latent Attention in Any Transformer-based LLMs

Revealing and Mitigating Over-Attention in Knowledge Editing

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Interpretable Text Embeddings and Text Similarity Explanation: A Primer

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

An Open-Source Tool for Mapping War Destruction at Scale in Ukraine using Sentinel-1 Time Series

Single-image Reflectance and Transmittance Estimation from Any Flatbed Scanner

最近の投稿

最近のコメント

アーカイブ

カテゴリー