「cs.PF」カテゴリーアーカイブ

Assessing Tenstorrent’s RISC-V MatMul Acceleration Capabilities

投稿日: 2025年5月12日作成者: jarxiv

要約大規模な言語モデル（LLMS）サービスとしての生成AIの需要の増加により、 … 続きを読む →

カテゴリー: cs.AI, cs.AR, cs.PF | コメントを受け付けていません

CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing

投稿日: 2025年5月5日作成者: jarxiv

要約大規模言語モデルは、様々なタスクにおいて目覚ましい成功を収めているが、推論 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.PF | コメントを受け付けていません

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

投稿日: 2025年5月2日作成者: jarxiv

要約量子化は、大規模な言語モデル（LLM）推論を加速できます。 INT8の量子 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.PF | コメントを受け付けていません

NSFlow: An End-to-End FPGA Framework with Scalable Dataflow Architecture for Neuro-Symbolic AI

投稿日: 2025年4月30日作成者: jarxiv

要約 Neuro-Symbolic AI（NSAI）は、AIシステムの透明性、推 … 続きを読む →

カテゴリー: cs.AI, cs.AR, cs.LG, cs.PF | コメントを受け付けていません

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

投稿日: 2025年4月22日作成者: jarxiv

要約大規模な言語モデル（LLM）は、長いシーケンスと複雑な推論タスクの処理にお … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.DC, cs.LG, cs.PF | コメントを受け付けていません

Zero-Shot, But at What Cost? Unveiling the Hidden Overhead of MILS’s LLM-CLIP Framework for Image Captioning

投稿日: 2025年4月22日作成者: jarxiv

要約 MILS（Multimodal Iterative LLM Solver） … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.PF | コメントを受け付けていません

The Transient Cost of Learning in Queueing Systems

投稿日: 2025年4月8日作成者: jarxiv

要約キューイングシステムは、通信ネットワーク、ヘルスケア、サービスシステムなど … 続きを読む →

カテゴリー: cs.DS, cs.LG, cs.PF, math.PR | コメントを受け付けていません

Performance Modeling of Data Storage Systems using Generative Models

投稿日: 2025年4月7日作成者: jarxiv

要約システムの高精度モデリングは、産業データ解析の主要分野のひとつである。シス … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.PF | コメントを受け付けていません

NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices

投稿日: 2025年4月7日作成者: jarxiv

要約 Neural Radiance Fields（NeRF）は、3D再構成にお … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.LG, cs.MM, cs.PF | コメントを受け付けていません

A Hitchhiker’s Guide to Understanding Performances of Two-Class Classifiers

投稿日: 2025年4月7日作成者: jarxiv

要約分類器の性能を正しく理解することは、様々なシナリオにおいて不可欠である。し … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.PF | コメントを受け付けていません

「cs.PF」カテゴリーアーカイブ

Assessing Tenstorrent’s RISC-V MatMul Acceleration Capabilities

CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

NSFlow: An End-to-End FPGA Framework with Scalable Dataflow Architecture for Neuro-Symbolic AI

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Zero-Shot, But at What Cost? Unveiling the Hidden Overhead of MILS’s LLM-CLIP Framework for Image Captioning

The Transient Cost of Learning in Queueing Systems

Performance Modeling of Data Storage Systems using Generative Models

NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices

A Hitchhiker’s Guide to Understanding Performances of Two-Class Classifiers

最近の投稿

最近のコメント

アーカイブ

カテゴリー