Associate professor
Supervisor of Master's Candidates
Hits:
Affiliation of Author(s):计算机科学与工程学院
Journal:2023 4th International Conference on Computer Vision, Image and Deep Learning
Funded by:自选课题
Key Words:remote sensing; lightweight; CNN; semantic segmentation
Abstract:Semantic segmentation, a fundamental task in computer vision, has developed rapidly in recent years. Semantic segmentation of remote sensing urban scene images, utilized in tasks such as land cover mapping, urban change detection, environmental preservation, and economic assessment, has also received much attention. In order to extract global semantic features, recent research has focused on combining transformers with CNN, which have great potential in global information modeling, for semantic segmentation models. However, the hybrid method of CNN and Vision Transformer still suffers from low latency. And the MLP of the visual transformer results in many parameters. To address these issues, we propose a lightweight pure CNN UNet(LPCUNet) model, which introduces a sizeable convolutional kernel to capture the global context. Moreover, a simple fusion module dynamically fuses local and global features. Extensive experiments show that our proposed method achieves state-of-the-art performance while faster latency. More precisely, the LPCUNet model demonstrated impressive performance with 83.3% and 86.4% mean Intersection over Union (mIoU) scores on the Vaihingen and Potsdam datasets respectively.
Co-author:武辉铸,鲍学良,仲昭昊
First Author:Kevin Liu
Indexed by:Essay collection
Page Number:2
Translation or Not:no
Date of Publication:2023-05-14