「cs.LG」カテゴリーアーカイブ

Is Adversarial Training with Compressed Datasets Effective?

投稿日: 2025年4月8日作成者: jarxiv

要約データセット凝縮（DC）は、より大きなデータセットから小さく、合成のデータ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Understanding Virtual Nodes: Oversquashing and Node Heterogeneity

投稿日: 2025年4月8日作成者: jarxiv

要約メッセージパッシングニューラルネットワーク（MPNN）は、さまざまなアプリ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Learning Coarse-Grained Dynamics on Graph

投稿日: 2025年4月8日作成者: jarxiv

要約グラフニューラルネットワーク（GNN）非マルコビアンモデリングフレームワー … 続きを読む →

カテゴリー: cond-mat.dis-nn, cs.LG, cs.NA, math.NA | コメントを受け付けていません

Dimension-Free Convergence of Diffusion Models for Approximate Gaussian Mixtures

投稿日: 2025年4月8日作成者: jarxiv

要約拡散モデルは、特に反復除去を通じて高品質のサンプルを生成する際に、並外れた … 続きを読む →

カテゴリー: cs.LG, cs.NA, math.NA, math.ST, stat.ML, stat.TH | コメントを受け付けていません

A Llama walks into the ‘Bar’: Efficient Supervised Fine-Tuning for Legal Reasoning in the Multi-state Bar Exam

投稿日: 2025年4月8日作成者: jarxiv

要約法的推論タスクは、ドメイン固有の知識と推論プロセスの複雑さのために、大規模 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, I.2.1 | コメントを受け付けていません

Towards Visual Text Grounding of Multimodal Large Language Model

投稿日: 2025年4月8日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLMS）の既存の進化にもかかわらず、特に … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Differential Transformer

投稿日: 2025年4月8日作成者: jarxiv

要約トランスは、無関係なコンテキストに全体的に注意を向ける傾向があります。こ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Mixture-of-Personas Language Models for Population Simulation

投稿日: 2025年4月8日作成者: jarxiv

要約大規模な言語モデル（LLMS）の進歩は、LLMが社会科学研究や機械学習モデ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

DeltaProduct: Improving State-Tracking in Linear RNNs via Householder Products

投稿日: 2025年4月8日作成者: jarxiv

要約線形再発性ニューラルネットワーク（線形RNN）は、シーケンスモデリングのた … 続きを読む →

カテゴリー: cs.CL, cs.FL, cs.LG | コメントを受け付けていません

State Tuning: State-based Test-Time Scaling on RWKV-7

投稿日: 2025年4月8日作成者: jarxiv

要約テスト時間スケーリングは、機械学習における顕著な研究方向として浮上しており … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Is Adversarial Training with Compressed Datasets Effective?

Understanding Virtual Nodes: Oversquashing and Node Heterogeneity

Learning Coarse-Grained Dynamics on Graph

Dimension-Free Convergence of Diffusion Models for Approximate Gaussian Mixtures

A Llama walks into the ‘Bar’: Efficient Supervised Fine-Tuning for Legal Reasoning in the Multi-state Bar Exam

Towards Visual Text Grounding of Multimodal Large Language Model

Differential Transformer

Mixture-of-Personas Language Models for Population Simulation

DeltaProduct: Improving State-Tracking in Linear RNNs via Householder Products

State Tuning: State-based Test-Time Scaling on RWKV-7

最近の投稿

最近のコメント

アーカイブ

カテゴリー