月別アーカイブ: 2025年3月

Model Assembly Learning with Heterogeneous Layer Weight Merging

投稿日: 2025年3月28日作成者: jarxiv

要約モデルのマージは、複数のモデルのパラメーターを組み合わせることにより、追加 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

TSKANMixer: Kolmogorov-Arnold Networks with MLP-Mixer Model for Time Series Forecasting

投稿日: 2025年3月28日作成者: jarxiv

要約時系列の予測は、経済学、エネルギー、ヘルスケア、交通管理など、多様な分野で … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing

投稿日: 2025年3月28日作成者: jarxiv

要約デジタル通信の急速な成長により、多言語コミュニティでは、コードミックス、特 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Intelligent IoT Attack Detection Design via ODLLM with Feature Ranking-based Knowledge Base

投稿日: 2025年3月28日作成者: jarxiv

要約モノのインターネット（IoT）デバイスの広範な採用は、特に分散型サービス拒 … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.NI | コメントを受け付けていません

LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning

投稿日: 2025年3月28日作成者: jarxiv

要約近年、大規模な言語モデル（LLM）は、自然言語処理（NLP）の大幅な進歩を … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Adaptive Orchestration for Large-Scale Inference on Heterogeneous Accelerator Systems Balancing Cost, Performance, and Resilience

投稿日: 2025年3月28日作成者: jarxiv

要約生成AIワークロードの急増により、運用コストを含めながらGPUと特殊なアク … 続きを読む →

カテゴリー: 68U01, cs.AI, cs.PF | コメントを受け付けていません

Elementwise Layer Normalization

投稿日: 2025年3月28日作成者: jarxiv

要約最近の論文では、層の正規化のドロップイン置換として動的タン（DYT）を提案 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Outlier dimensions favor frequent tokens in language model

投稿日: 2025年3月28日作成者: jarxiv

要約最終層の外れ値の寸法、つまり、大部分の入力に対して極端なアクティベーション … 続きを読む →

カテゴリー: cs.AI, cs.CL, I.2.7 | コメントを受け付けていません

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

投稿日: 2025年3月28日作成者: jarxiv

要約大規模な言語モデル（LLMS）のアラインメントは、アプリケーションでの安全 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation

投稿日: 2025年3月28日作成者: jarxiv

要約大規模な推論モデル（LRM）は顕著な推論能力を示しますが、主にパラメトリッ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

月別アーカイブ: 2025年3月

Model Assembly Learning with Heterogeneous Layer Weight Merging

TSKANMixer: Kolmogorov-Arnold Networks with MLP-Mixer Model for Time Series Forecasting

COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing

Intelligent IoT Attack Detection Design via ODLLM with Feature Ranking-based Knowledge Base

LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning

Adaptive Orchestration for Large-Scale Inference on Heterogeneous Accelerator Systems Balancing Cost, Performance, and Resilience

Elementwise Layer Normalization

Outlier dimensions favor frequent tokens in language model

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー