Image semantic segmentation of indoor scenes: A survey

Author

R. Velastegui, M. Tatarchenko, S. Karaoğlu and T. Gevers

Abstract

This survey provides a comprehensive evaluation of various deep learning-based segmentation architectures. It covers a wide range of models, from traditional ones like FCN and PSPNet to more modern approaches like SegFormer and FAN. In addition to assessing the methods in terms of segmentation accuracy, we propose to also evaluate the methods in terms of temporal consistency and corruption vulnerability. Most of the existing surveys on semantic segmentation focus on outdoor datasets. In contrast, this survey focuses on indoor scenarios to enhance the applicability of segmentation methods in this specific domain. Furthermore, our evaluation consists of a performance analysis of the methods in prevalent real-world segmentation scenarios that pose particular challenges. These complex situations involve scenes impacted by diverse forms of noise, blur corruptions, camera movements, optical aberrations, among other factors. By jointly exploring the segmentation accuracy, temporal consistency, and corruption vulnerability in challenging real-world situations, our survey offers insights that go beyond existing surveys, facilitating the understanding and development of better image segmentation methods for indoor scenes.

Bibtex

@misc{velastegui24cviu_semseg,
      title={Image semantic segmentation of indoor scenes: A survey}, 
      author={Ronny Velastegui and Maxim Tatarchenko and Sezer Karaoglu and Theo Gevers},
      year={2024},
      booktitle={CVIU}
}