「cs.AR」カテゴリーアーカイブ

Introducing Instruction-Accurate Simulators for Performance Estimation of Autotuning Workloads

投稿日: 2025年5月20日作成者: jarxiv

要約機械学習（ML）ワークロードの加速には、最適化スペースが大きいため、効率的 … 続きを読む →

カテゴリー: cs.AR, cs.LG | コメントを受け付けていません

GPU Performance Portability needs Autotuning

投稿日: 2025年5月16日作成者: jarxiv

要約 LLMSが複雑になるにつれて、最先端のパフォーマンスを達成するには、アルゴ … 続きを読む →

カテゴリー: cs.AI, cs.AR, cs.PL | コメントを受け付けていません

Scaling Laws for Floating Point Quantization Training

投稿日: 2025年5月14日作成者: jarxiv

要約低精度トレーニングは、トレーニングと下流の推論コストの両方を削減するための … 続きを読む →

カテゴリー: cs.AR, cs.CL, cs.LG | コメントを受け付けていません

MINIMALIST: switched-capacitor circuits for efficient in-memory computation of gated recurrent units

投稿日: 2025年5月14日作成者: jarxiv

要約再発性ニューラルネットワーク（RNN）は、特に埋め込まれたエッジコンピュー … 続きを読む →

カテゴリー: cs.AI, cs.AR, cs.LG, eess.SP | コメントを受け付けていません

ApproXAI: Energy-Efficient Hardware Acceleration of Explainable AI using Approximate Computing

投稿日: 2025年5月13日作成者: jarxiv

要約説明可能な人工知能（XAI）は、最適化の問題として解釈可能性をフレーミング … 続きを読む →

カテゴリー: cs.AI, cs.AR | コメントを受け付けていません

LightNobel: Improving Sequence Length Limitation in Protein Structure Prediction Model via Adaptive Activation Quantization

投稿日: 2025年5月12日作成者: jarxiv

要約 Alphafold2やESMFoldなどのタンパク質構造予測モデル（PPM … 続きを読む →

カテゴリー: B.7, cs.AI, cs.AR, cs.ET, cs.LG | コメントを受け付けていません

Assessing Tenstorrent’s RISC-V MatMul Acceleration Capabilities

投稿日: 2025年5月12日作成者: jarxiv

要約大規模な言語モデル（LLMS）サービスとしての生成AIの需要の増加により、 … 続きを読む →

カテゴリー: cs.AI, cs.AR, cs.PF | コメントを受け付けていません

TransAxx: Efficient Transformers with Approximate Computing

投稿日: 2025年5月8日作成者: jarxiv

要約変圧器アーキテクチャによって最近導入されたVision Transfran … 続きを読む →

カテゴリー: cs.AR, cs.LG | コメントを受け付けていません

Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and Recognition

投稿日: 2025年5月8日作成者: jarxiv

要約セキュリティの強化や認可された会場への非接触アクセスなど、いくつかのアプリ … 続きを読む →

カテゴリー: cs.AR, cs.CV, eess.IV | コメントを受け付けていません

Edge-GPU Based Face Tracking for Face Detection and Recognition Acceleration

投稿日: 2025年5月8日作成者: jarxiv

要約リアルタイムで正確な顔の検出と公共の場所での認識に特化した費用対効果の高い … 続きを読む →

カテゴリー: cs.AR, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

「cs.AR」カテゴリーアーカイブ

Introducing Instruction-Accurate Simulators for Performance Estimation of Autotuning Workloads

GPU Performance Portability needs Autotuning

Scaling Laws for Floating Point Quantization Training

MINIMALIST: switched-capacitor circuits for efficient in-memory computation of gated recurrent units

ApproXAI: Energy-Efficient Hardware Acceleration of Explainable AI using Approximate Computing

LightNobel: Improving Sequence Length Limitation in Protein Structure Prediction Model via Adaptive Activation Quantization

Assessing Tenstorrent’s RISC-V MatMul Acceleration Capabilities

TransAxx: Efficient Transformers with Approximate Computing

Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and Recognition

Edge-GPU Based Face Tracking for Face Detection and Recognition Acceleration

最近の投稿

最近のコメント

アーカイブ

カテゴリー