Semantic Scene Segmentation for Robotics

要約

包括的なシーンの理解は、ロボットの自律性を可能にする重要な要素です。
セマンティックセグメンテーションは、シーンを理解する重要なタスクの 1 つであり、自動運転、家庭用サービスロボット、ラストワンマイル配送など、いくつかのロボットアプリケーションにとって極めて重要です。
セマンティックセグメンテーションは、画像の各ピクセルにセマンティッククラスラベルが割り当てられるシーン表現を提供することを目的とした高密度予測タスクです。
したがって、セマンティックセグメンテーションでは、背景を含むすべてのシーン要素のオブジェクトカテゴリ、位置、および形状を組み込んだ完全なシーンコンテキストが考慮されます。
長年にわたって、セマンティックセグメンテーション用の多数のアルゴリズムが提案されてきました。
しかし、最近の深層学習の進歩と、計算能力の向上および大規模なラベル付きデータセットの利用可能性が相まって、セマンティックセグメンテーションは大幅に進歩しました。
この章では、セマンティックセグメンテーションのタスクを紹介し、このタスクに対処するために長年にわたって提案されてきた深層学習技術を紹介します。
まず、セマンティックセグメンテーションのタスクを定義し、それを他の密接に関連するシーン理解の問題と対比します。
セマンティックセグメンテーションと一般的に使用される損失関数のためのさまざまなアルゴリズムとアーキテクチャについて詳しく説明します。
さらに、セマンティックセグメンテーションで使用されるデータセット、ベンチマーク、メトリクスの概要を示します。
この章の最後では、この分野におけるさらなる研究の課題と機会について説明します。

要約(オリジナル)

Comprehensive scene understanding is a critical enabler of robot autonomy. Semantic segmentation is one of the key scene understanding tasks which is pivotal for several robotics applications including autonomous driving, domestic service robotics, last mile delivery, amongst many others. Semantic segmentation is a dense prediction task that aims to provide a scene representation in which each pixel of an image is assigned a semantic class label. Therefore, semantic segmentation considers the full scene context, incorporating the object category, location, and shape of all the scene elements, including the background. Numerous algorithms have been proposed for semantic segmentation over the years. However, the recent advances in deep learning combined with the boost in the computational capacity and the availability of large-scale labeled datasets have led to significant advances in semantic segmentation. In this chapter, we introduce the task of semantic segmentation and present the deep learning techniques that have been proposed to address this task over the years. We first define the task of semantic segmentation and contrast it with other closely related scene understanding problems. We detail different algorithms and architectures for semantic segmentation and the commonly employed loss functions. Furthermore, we present an overview of datasets, benchmarks, and metrics that are used in semantic segmentation. We conclude the chapter with a discussion of challenges and opportunities for further research in this area.

arxiv情報

著者	Juana Valeria Hurtado,Abhinav Valada
発行日	2024-01-15 10:52:09+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Semantic Scene Segmentation for Robotics

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー