「cs.LG」カテゴリーアーカイブ

ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models

投稿日: 2024年12月24日作成者: jarxiv

要約活性化の希薄性とは、活性化出力の中に寄与度が低い要素がかなり存在することを … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, I.2.7 | コメントを受け付けていません

Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

投稿日: 2024年12月24日作成者: jarxiv

要約大規模言語モデル (LLM) による安全性リスクに対処する 1 つの方法は … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Tracking the Feature Dynamics in LLM Training: A Mechanistic Study

投稿日: 2024年12月24日作成者: jarxiv

要約トレーニングのダイナミクスと機能の進化を理解することは、大規模言語モデル … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

ResearchTown: Simulator of Human Research Community

投稿日: 2024年12月24日作成者: jarxiv

要約大規模言語モデル (LLM) は科学分野で顕著な可能性を示していますが、根 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Evaluation of Bio-Inspired Models under Different Learning Settings For Energy Efficiency in Network Traffic Prediction

投稿日: 2024年12月24日作成者: jarxiv

要約携帯電話トラフィック予測は、ネットワークオペレータが効率的にリソースを割 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

The Dynamic Duo of Collaborative Masking and Target for Advanced Masked Autoencoder Learning

投稿日: 2024年12月24日作成者: jarxiv

要約マスクされたオートエンコーダ (MAE) は最近、自己教師あり視覚表現学習 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV, eess.SP | コメントを受け付けていません

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity

投稿日: 2024年12月24日作成者: jarxiv

要約最近、Linformer や Mamba などのアーキテクチャが、トランス … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

PC Agent: While You Sleep, AI Works — A Cognitive Journey into Digital World

投稿日: 2024年12月24日作成者: jarxiv

要約研究資料の整理、レポートの下書き、明日に必要なプレゼンテーションの作成など … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks

投稿日: 2024年12月24日作成者: jarxiv

要約最近、広範な一般知識と強力な推論能力を備えた大規模言語モデル (LLM) … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Mirage: A Multi-Level Superoptimizer for Tensor Programs

投稿日: 2024年12月24日作成者: jarxiv

要約テンソルプログラム用の初のマルチレベルスーパーオプティマイザーである … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.PL | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models

Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

Tracking the Feature Dynamics in LLM Training: A Mechanistic Study

ResearchTown: Simulator of Human Research Community

Evaluation of Bio-Inspired Models under Different Learning Settings For Energy Efficiency in Network Traffic Prediction

The Dynamic Duo of Collaborative Masking and Target for Advanced Masked Autoencoder Learning

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity

PC Agent: While You Sleep, AI Works — A Cognitive Journey into Digital World

CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks

Mirage: A Multi-Level Superoptimizer for Tensor Programs

最近の投稿

最近のコメント

アーカイブ

カテゴリー