Architecturing Binarized Neural Networks for Traffic Sign Recognition

要約

交通標識は、道路の安全と交通の流れの管理をサポートするため、自動運転のビジョンシステムに不可欠な要素です。
ディープラーニングの使用は、畳み込みニューラルネットワーク (CNN) を使用して高精度の結果が得られるため、交通標識の分類でよく知られていますが (最先端の技術は 99.46\%)、二値化ニューラルネットワーク (BNN) についてはほとんど知られていません。
CNN と比較して、BNN はモデルのサイズを縮小し、畳み込み演算を簡素化し、自動運転のコンテキストで現れる計算量が制限され、エネルギーに制約のあるデバイスで有望な結果を示しています。
この作業は、構成層の特性を研究することにより、BNN を構築するためのボトムアップアプローチを提示します。
これらの構成層 (2 値化された畳み込み層、最大プーリング、バッチ正規化、全結合層) は、トレーニングにドイツの交通標識認識ベンチマーク (GTSRB) を使用して、カーネルサイズ、フィルター数、ニューロンのさまざまな値をさまざまに組み合わせて調査されます。
.
その結果、GTSRB で $90\%$ (最大は $96.45\%$) を超え、平均が $80\%$ (最大は $88.99\%$) を超える BNN アーキテクチャを提案します。
テスト用の中国のデータセット。
これらのアーキテクチャのパラメータ数は、100k から 2M 未満までさまざまです。
この論文に付随する資料は、https://github.com/apostovan21/BinarizedNeuralNetwork で公開されています。

要約(オリジナル)

Traffic signs support road safety and managing the flow of traffic, hence are an integral part of any vision system for autonomous driving. While the use of deep learning is well-known in traffic signs classification due to the high accuracy results obtained using convolutional neural networks (CNNs) (state of the art is 99.46\%), little is known about binarized neural networks (BNNs). Compared to CNNs, BNNs reduce the model size and simplify convolution operations and have shown promising results in computationally limited and energy-constrained devices which appear in the context of autonomous driving. This work presents a bottom-up approach for architecturing BNNs by studying characteristics of the constituent layers. These constituent layers (binarized convolutional layers, max pooling, batch normalization, fully connected layers) are studied in various combinations and with different values of kernel size, number of filters and of neurons by using the German Traffic Sign Recognition Benchmark (GTSRB) for training. As a result, we propose BNNs architectures which achieve more than $90\%$ for GTSRB (the maximum is $96.45\%$) and an average greater than $80\%$ (the maximum is $88.99\%$) considering also the Belgian and Chinese datasets for testing. The number of parameters of these architectures varies from 100k to less than 2M. The accompanying material of this paper is publicly available at https://github.com/apostovan21/BinarizedNeuralNetwork.

arxiv情報

著者	Andreea Postovan,Mădălina Eraşcu
発行日	2023-03-27 08:46:31+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Architecturing Binarized Neural Networks for Traffic Sign Recognition

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー