「cond-mat.dis-nn」カテゴリーアーカイブ

A replica analysis of under-bagging

投稿日: 2024年4月16日作成者: jarxiv

要約不均衡なデータから分類器をトレーニングするための一般的なアンサンブル学習方 … 続きを読む →

カテゴリー: cond-mat.dis-nn, cond-mat.stat-mech, cs.LG, stat.ML | コメントを受け付けていません

Rotation-equivariant Graph Neural Networks for Learning Glassy Liquids Representations

投稿日: 2024年4月15日作成者: jarxiv

要約ガラス状液体の静的構造とそのダイナミクスを関連付けるという難しい問題は、デ … 続きを読む →

カテゴリー: cond-mat.dis-nn, cond-mat.soft, cs.LG | コメントを受け付けていません

A Dynamical Model of Neural Scaling Laws

投稿日: 2024年4月15日作成者: jarxiv

要約さまざまなタスクにおいて、ニューラルネットワークのパフォーマンスは、トレ … 続きを読む →

カテゴリー: cond-mat.dis-nn, cs.LG, stat.ML | コメントを受け付けていません

Grokking as the Transition from Lazy to Rich Training Dynamics

投稿日: 2024年4月12日作成者: jarxiv

要約私たちは、ニューラルネットワークのトレーニング損失がテスト損失よりもはる … 続きを読む →

カテゴリー: cond-mat.dis-nn, cs.LG, stat.ML | コメントを受け付けていません

Neural population geometry and optimal coding of tasks with shared latent structure

投稿日: 2024年4月12日作成者: jarxiv

要約人間と動物は環境内の潜在的な構造を認識し、その情報を適用して世界を効率的に … 続きを読む →

カテゴリー: cond-mat.dis-nn, cond-mat.stat-mech, cs.LG, cs.NE, q-bio.NC | コメントを受け付けていません

Mapping of attention mechanisms to a generalized Potts model

投稿日: 2024年4月5日作成者: jarxiv

要約トランスフォーマーは、自然言語処理と機械学習に革命をもたらしたニューラルネ … 続きを読む →

カテゴリー: cond-mat.dis-nn, cond-mat.stat-mech, cs.CL, stat.ML | コメントを受け付けていません

X-LoRA: Mixture of Low-Rank Adapter Experts, a Flexible Framework for Large Language Models with Applications in Protein Mechanics and Molecular Design

投稿日: 2024年4月2日作成者: jarxiv

要約我々は、低ランク適応（LoRA）に基づく深い層ごとのトークンレベルのアプロ … 続きを読む →

カテゴリー: cond-mat.dis-nn, cond-mat.soft, cs.AI, cs.CL, cs.LG, q-bio.QM | コメントを受け付けていません

Robustness of the Random Language Model

投稿日: 2024年3月25日作成者: jarxiv

要約ランダム言語モデル (De Giuli 2019) は、人間言語とコンピュ … 続きを読む →

カテゴリー: cond-mat.dis-nn, cs.CL | コメントを受け付けていません

Asymptotic generalization error of a single-layer graph convolutional network

投稿日: 2024年3月21日作成者: jarxiv

要約グラフ畳み込みネットワークは実用的な有望性を示していますが、サンプル数の関 … 続きを読む →

カテゴリー: cond-mat.dis-nn, cs.LG | コメントを受け付けていません

The Training Process of Many Deep Networks Explores the Same Low-Dimensional Manifold

投稿日: 2024年3月20日作成者: jarxiv

要約私たちは、トレーニング中にディープネットワークの予測の軌跡を分析するための … 続きを読む →