CPU frequency scheduling of real-time applications on embedded devices with temporal encoding-based deep reinforcement learning

要約

小型デバイスは、ソフトデッドラインで定期的な専用タスクを実行するために、IoT およびスマートシティアプリケーションで頻繁に使用されます。
この研究は、小型デバイス上の定期的なタスクのための効率的な電源管理方法を導き出す方法の開発に焦点を当てています。
まず、小型デバイスで使用される既存の Linux 組み込みメソッドの制限を検討します。
Linux の組み込みソリューションで管理するのが難しい 3 つの典型的なワークロード/システムパターンを示します。
我々は、3 つのシステムパターンが存在する場合でも効果的な DVFS ガバナーを導出する、時間エンコーディングを備えた強化学習ベースの手法を開発しました。
派生ガバナーは、組み込み Linux メカニズムと同じパフォーマンスカウンターを 1 つだけ使用し、ワークロードの明示的なタスクモデルを必要としません。
Nvidia Jetson Nano ボードにプロトタイプシステムを実装し、2 つの自己設計アプリケーションと 4 つのベンチマークアプリケーションを含む 6 つのアプリケーションで実験しました。
さまざまな期限制約の下で、私たちのアプローチは、パフォーマンス要件に適応し、エネルギー節約において組み込み Linux アプローチを上回る DVFS ガバナーを迅速に導き出すことができます。
Mibench ワークロードでは、パフォーマンススラックが 0.04 秒から 0.4 秒の範囲にあるため、提案された方法はオンデマンドと比較して 3% ～ 11% 多くのエネルギーを節約できます。
テストされた AudioReg および FaceReg アプリケーションでは、5% ～ 14% の省エネ改善が見られました。
私たちは、カーネル内の量子化ニューラルネットワークエンジンの実装をオープンソース化しました。
コードベースは https://github.com/coladog/tinyagent にあります。

要約(オリジナル)

Small devices are frequently used in IoT and smart-city applications to perform periodic dedicated tasks with soft deadlines. This work focuses on developing methods to derive efficient power-management methods for periodic tasks on small devices. We first study the limitations of the existing Linux built-in methods used in small devices. We illustrate three typical workload/system patterns that are challenging to manage with Linux’s built-in solutions. We develop a reinforcement-learning-based technique with temporal encoding to derive an effective DVFS governor even with the presence of the three system patterns. The derived governor uses only one performance counter, the same as the built-in Linux mechanism, and does not require an explicit task model for the workload. We implemented a prototype system on the Nvidia Jetson Nano Board and experimented with it with six applications, including two self-designed and four benchmark applications. Under different deadline constraints, our approach can quickly derive a DVFS governor that can adapt to performance requirements and outperform the built-in Linux approach in energy saving. On Mibench workloads, with performance slack ranging from 0.04 s to 0.4 s, the proposed method can save 3% – 11% more energy compared to Ondemand. AudioReg and FaceReg applications tested have 5%- 14% energy-saving improvement. We have open-sourced the implementation of our in-kernel quantized neural network engine. The codebase can be found at: https://github.com/coladog/tinyagent.

arxiv情報

著者	Ti Zhou,Man Lin
発行日	2023-09-07 15:28:03+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

CPU frequency scheduling of real-time applications on embedded devices with temporal encoding-based deep reinforcement learning

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー