「cs.LG」カテゴリーアーカイブ

Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning

投稿日: 2024年7月26日作成者: jarxiv

要約量子強化学習 (QRL) の出現は、特に変分量子回路 (VQC) 上に構築 … 続きを読む →

カテゴリー: cs.AI, cs.DC, cs.LG, cs.NE, quant-ph | コメントを受け付けていません

When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

投稿日: 2024年7月26日作成者: jarxiv

要約自己回帰大規模言語モデル (LLM) は、言語タスクで目覚ましいパフォーマ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer

投稿日: 2024年7月26日作成者: jarxiv

要約ビジョントランスフォーマー (ViT) は優れたパフォーマンスを示し、複 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

投稿日: 2024年7月26日作成者: jarxiv

要約大規模言語モデル (LLM) は、言語タスクで優れたパフォーマンスを示して … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Exploring Scaling Trends in LLM Robustness

投稿日: 2024年7月26日作成者: jarxiv

要約言語モデルの機能は、モデルのサイズとトレーニングデータをスケーリングする … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG, I.2.7 | コメントを受け付けていません

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

投稿日: 2024年7月26日作成者: jarxiv

要約基礎モデルでインテリジェントなエージェントの動作を可能にするための中心的な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

LoRA-Pro: Are Low-Rank Adapters Properly Optimized?

投稿日: 2024年7月26日作成者: jarxiv

要約 LoRA としても知られる低ランク適応は、元の行列を 2 つの低ランク行列 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Network Inversion of Convolutional Neural Nets

投稿日: 2024年7月26日作成者: jarxiv

要約ニューラルネットワークは、さまざまなアプリケーションにわたる強力なツール … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

HVM-1: Large-scale video models pretrained with nearly 5000 hours of human-like video data

投稿日: 2024年7月26日作成者: jarxiv

要約私たちは、時空間マスクされたオートエンコーダー (ST- MAE) アルゴ … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.NE, q-bio.NC | コメントを受け付けていません

3D Diffuser Actor: Policy Diffusion with 3D Scene Representations

投稿日: 2024年7月26日作成者: jarxiv

要約拡散ポリシーは、ロボットと環境の状態を条件としてロボットの動作分布を学習す … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning

When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Exploring Scaling Trends in LLM Robustness

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

LoRA-Pro: Are Low-Rank Adapters Properly Optimized?

Network Inversion of Convolutional Neural Nets

HVM-1: Large-scale video models pretrained with nearly 5000 hours of human-like video data

3D Diffuser Actor: Policy Diffusion with 3D Scene Representations

最近の投稿

最近のコメント

アーカイブ

カテゴリー