SimCC: a Simple Coordinate Classification Perspective for Human Pose Estimation

要約

2Dヒートマップベースのアプローチは、高性能のために何年もの間、人間のポーズ推定（HPE）を支配してきました。
ただし、2Dヒートマップベースの方法における長年の量子化誤差の問題は、いくつかのよく知られた欠点につながります。1）低解像度入力のパフォーマンスが制限されている。
2）特徴マップの解像度を向上させてローカリゼーションの精度を高めるには、コストのかかる複数のアップサンプリングレイヤーが必要です。
3）量子化誤差を低減するために追加の後処理が採用されています。
これらの問題に対処するために、HPEを水平座標と垂直座標の2つの分類タスクとして再定式化する\textit{SimCC}と呼ばれるまったく新しいスキームを検討することを目指しています。
提案されたSimCCは、各ピクセルをいくつかのビンに均一に分割するため、\emph{sub-pixel}のローカリゼーション精度と低い量子化誤差を実現します。
その恩恵を受けて、SimCCは追加の改良後処理を省略し、特定の設定でアップサンプリングレイヤーを除外できるため、HPEのパイプラインがよりシンプルで効果的になります。
COCO、CrowdPose、およびMPIIデータセットで実施された広範な実験は、SimCCがヒートマップベースの対応物よりも、特に低解像度の設定で大幅に優れていることを示しています。

要約(オリジナル)

The 2D heatmap-based approaches have dominated Human Pose Estimation (HPE) for years due to high performance. However, the long-standing quantization error problem in the 2D heatmap-based methods leads to several well-known drawbacks: 1) The performance for the low-resolution inputs is limited; 2) To improve the feature map resolution for higher localization precision, multiple costly upsampling layers are required; 3) Extra post-processing is adopted to reduce the quantization error. To address these issues, we aim to explore a brand new scheme, called \textit{SimCC}, which reformulates HPE as two classification tasks for horizontal and vertical coordinates. The proposed SimCC uniformly divides each pixel into several bins, thus achieving \emph{sub-pixel} localization precision and low quantization error. Benefiting from that, SimCC can omit additional refinement post-processing and exclude upsampling layers under certain settings, resulting in a more simple and effective pipeline for HPE. Extensive experiments conducted over COCO, CrowdPose, and MPII datasets show that SimCC outperforms heatmap-based counterparts, especially in low-resolution settings by a large margin.

arxiv情報

著者	Yanjie Li,Sen Yang,Peidong Liu,Shoukui Zhang,Yunxiao Wang,Zhicheng Wang,Wankou Yang,Shu-Tao Xia
発行日	2022-07-05 16:13:26+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

SimCC: a Simple Coordinate Classification Perspective for Human Pose Estimation

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー