/
/
/
Integrating RGB and DSM Data for Enhanced Building Segmentation in UAV Images

Integrating RGB and DSM Data for Enhanced Building Segmentation in UAV Images

Original Research ArticleDec 1, 2025Online First Articles https://doi.org/10.55003/cast.2025.265709

Abstract

Accurate building segmentation in unmanned aerial vehicle (UAV) orthophotos remains a significant challenge due to the visual similarity between buildings and non-target elements such as trees, roads, and background clutter. This study proposes an enhanced segmentation method—referred to as RGB-DSM-IMP (M3)—which integrates RGB imagery, Digital Surface Model (DSM) data, and a novel background removal preprocessing step. The Mask Region-Based Convolutional Neural Network (Mask R-CNN) framework was employed to evaluate three segmentation strategies: a baseline model using only RGB imagery, a second model combining RGB imagery with DSM data, and the proposed model that incorporates both data types along with preprocessing. All models were trained and tested on drone-acquired images representing a variety of building types and environmental conditions. Performance was evaluated using precision, recall, F1-score, average precision (AP), mean intersection over union (mIoU), and mean average precision (mAP). The enhanced model achieved the highest results across all metrics, with an average F1-score of 0.74, mIoU of 0.74, and mAP of 0.63. These findings highlight the benefit of integrating elevation data to enhance spatial differentiation and demonstrate the effectiveness of background removal in reducing misclassifications caused by visually similar objects. In addition, the method maintained a practical inference time per image, supporting its real-world applicability. Overall, the study demonstrates that combining height-based information with strategic preprocessing significantly improves the accuracy and robustness of building segmentation in complex aerial imagery.

background removal
building segmentation
Digital Surface Model (DSM)
Mask R-CNN
UAV Imagery

How to Cite

Khiewwan, K. ., Asavasuthirakul, D. ., & Chimlek, S. . (2025). Integrating RGB and DSM Data for Enhanced Building Segmentation in UAV Images. Current Applied Science and Technology, e0265709. https://doi.org/10.55003/cast.2025.265709

References

  • Al-Najjar, H. A. H., Kalantar, B., Pradhan, B., Saeidi, V., Halin, A. A., Ueda, N., & Mansor, S. (2019). Land cover classification from fused DSM and UAV images using convolutional neural networks. Remote Sensing, 11(12), 1-18. https://doi.org/10.3390/rs11121461
  • Amo-Boateng, M., Sey, N., Amproche, A., & Domfeh, M. (2022). Instance segmentation scheme for roofs in rural areas based on Mask R-CNN. Egyptian Journal of Remote Sensing and Space Science, 25, 569-577.
  • Boonpook, W., Tan, Y., & Xu, B. (2020). Deep learning-based multi-feature semantic segmentation in building extraction from images of UAV photogrammetry. International Journal of Remote Sensing, 42(1), 1-19. https://doi.org/10.1080/01431161.2020.1788742
  • Chea, C., Saengprachatanarug, K., Posom, J., Wongphati, M., & Taira, E. (2019). Sugarcane canopy detection using high spatial resolution UAS images and digital surface model. Engineering and Applied Science Research, 46(4), 312-317.
  • Chen, J., Wang, G., Luo, L., Gong, W., & Cheng, Z. (2021). Building area estimation in drone aerial images based on Mask R-CNN. IEEE Geoscience and Remote Sensing Letters, 18(5), 891-894. https://doi.org/10.1109/LGRS.2020.2988326

Author Information

Kanokwan Khiewwan

Department of Computer Science and Technology, Faculty of Science, Naresuan University, Phitsanulok, Thailand

Duangduen Asavasuthirakul

Drone AI Solutions Co., Ltd., Kamphaeng Phet, Thailand

Sutasinee Chimlek

Department of Computer Science and Technology, Faculty of Science, Naresuan University, Phitsanulok, Thailand

About this Article

Journal

Online First Articles

Type of Manuscript

Original Research Article

Published

1 December 2025