More precise edge detections

要約

画像エッジ検出 (ED) は、コンピュータービジョンの基本タスクです。
ED アルゴリズムのパフォーマンスは CNN ベースのモデルの導入により大幅に改善されましたが、現在のモデルは、特に低い誤差許容距離しか許可されない場合、依然として満足のいく精度率に悩まされています。
したがって、より正確な予測のためのモデルアーキテクチャにはまだ調査が必要です。
一方で、人間が提供する避けられないノイズのトレーニングデータは、入力がエッジマップそのものであっても、不満足なモデル予測につながる可能性があるため、これも改善が必要です。
この論文では、カスケードスキッピング密度ブロック (CSDB) を使用して、より正確な ED モデルを示します。
私たちのモデルは、いくつかのデータセット、特に平均精度率 (AP) において最先端 (SOTA) 予測を取得しており、これは広範な実験によって確認されています。
さらに、私たちのモデルにはダウンサンプル操作が含まれておらず、広く信じられている操作が必要ではないことが証明されています。
また、トレーニング用のデータ拡張に関する新しい修正が採用されており、これによりノイズのないデータをモデルのトレーニングに使用できるようになり、エッジマップ自体で予測するモデルのパフォーマンスが向上します。

要約(オリジナル)

Image Edge detection (ED) is a base task in computer vision. While the performance of the ED algorithm has been improved greatly by introducing CNN-based models, current models still suffer from unsatisfactory precision rates especially when only a low error toleration distance is allowed. Therefore, model architecture for more precise predictions still needs an investigation. On the other hand, the unavoidable noise training data provided by humans would lead to unsatisfactory model predictions even when inputs are edge maps themselves, which also needs improvement. In this paper, more precise ED models are presented with cascaded skipping density blocks (CSDB). Our models obtain state-of-the-art(SOTA) predictions in several datasets, especially in average precision rate (AP), which is confirmed by extensive experiments. Moreover, our models do not include down-sample operations, demonstrating those widely believed operations are not necessary. Also, a novel modification on data augmentation for training is employed, which allows noiseless data to be employed in model training and thus improves the performance of models predicting on edge maps themselves.

arxiv情報

著者	Hao Shu,Guo-Ping Qiu
発行日	2024-07-29 13:24:55+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

More precise edge detections

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー