「cs.LG」カテゴリーアーカイブ

Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models

投稿日: 2025年3月4日作成者: jarxiv

要約大規模言語モデル(LLM)は近年、様々なタスクにおいて目覚ましい成功を収め … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Efficient Learning Under Density Shift in Incremental Settings Using Cramér-Rao-Based Regularization

投稿日: 2025年3月4日作成者: jarxiv

要約データ量と速度の継続的な急増は、アルゴリズムレベルに存在する機械学習の課題 … 続きを読む →

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

Towards Graph Foundation Models: A Study on the Generalization of Positional and Structural Encodings

投稿日: 2025年3月4日作成者: jarxiv

要約位置エンコーディングと構造エンコーディング（PSE）をグラフ・ニューラル・ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Scintillation pulse characterization with spectrum-inspired temporal neural networks: case studies on particle detector signals

投稿日: 2025年3月4日作成者: jarxiv

要約シンチレータを用いた粒子検出器は、高エネルギー物理学や宇宙素粒子物理学の実 … 続きを読む →

カテゴリー: cs.LG, physics.data-an, physics.ins-det | コメントを受け付けていません

Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation

投稿日: 2025年3月4日作成者: jarxiv

要約大量のコーパスで学習された大規模言語モデル（LLM）は、驚くべき能力を発揮 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Exact Certification of (Graph) Neural Networks Against Label Poisoning

投稿日: 2025年3月4日作成者: jarxiv

要約機械学習モデルは、ラベルの反転、すなわち、パフォーマンスを低下させるための … 続きを読む →

カテゴリー: cs.CR, cs.LG | コメントを受け付けていません

The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs

投稿日: 2025年3月4日作成者: jarxiv

要約マルチモーダル大規模言語モデル(MLLM)は素晴らしい能力を発揮してきた。 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

OLMoE: Open Mixture-of-Experts Language Models

投稿日: 2025年3月4日作成者: jarxiv

要約 OLMoEは、スパースなMoE（Mixture-of-Experts）を活 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

A Closer Look at Machine Unlearning for Large Language Models

投稿日: 2025年3月4日作成者: jarxiv

要約大規模な言語モデル（LLM）は、プライバシーや法的な懸念を引き起こす、機密 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

HORAE: A Domain-Agnostic Modeling Language for Automating Multimodal Service Regulation

投稿日: 2025年3月4日作成者: jarxiv

要約人工知能はサービス規制の分野に急速に浸透しつつある。本稿では、多様なドメイ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models

Efficient Learning Under Density Shift in Incremental Settings Using Cramér-Rao-Based Regularization

Towards Graph Foundation Models: A Study on the Generalization of Positional and Structural Encodings

Scintillation pulse characterization with spectrum-inspired temporal neural networks: case studies on particle detector signals

Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation

Exact Certification of (Graph) Neural Networks Against Label Poisoning

The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs

OLMoE: Open Mixture-of-Experts Language Models

A Closer Look at Machine Unlearning for Large Language Models

HORAE: A Domain-Agnostic Modeling Language for Automating Multimodal Service Regulation

最近の投稿

最近のコメント

アーカイブ

カテゴリー