Associate professor
Supervisor of Master's Candidates
Hits:
Affiliation of Author(s):计算机科学与工程学院
Journal:IET Computer Vision
Funded by:其他课题
Key Words:computational complexity, convolutional neural nets
Abstract:Due to the large computational and GPUs memory cost of semantic segmentation, some works focus on designing a lite weight model to achieve a good trade‐off between computational cost and accuracy. A common method is to combined CNN and vision transformer. However, these methods ignore the contextual information of multi receptive fields. And existing methods often fail to inject detailed information losses in the downsampling of multi‐scale feature. To fix these issues, we propose AG Self‐Attention, which is Enhanced Atrous Self‐Attention (EASA), and Gate Attention. AG Self‐Attention adds the contextual information of multi receptive fields into the global semantic feature. Specifically, the Enhanced Atrous Self‐Attention uses weight shared atrous convolution with different atrous rates to get the contextual information under the specific different receptive fields. Gate Attention introduces gating mechanism to inject detailed information into the global semantic feature and filter detailed information by producing “fusion” gate and “update” gate. In order to prove our insight. We conduct numerous experiments in common semantic segmentation datasets, consisting of ADE20 K, COCO‐stuff, PASCAL Context, Cityscapes, to show that our method achieves state‐of‐the‐art performance and achieve a good trade‐off between computational cost and accuracy.
Co-author:高延生,李海,仲昭昊,赵宏伟
First Author:Kevin Liu
Indexed by:Journal paper
Page Number:1
ISSN No.:1751-9632
Translation or Not:no
Date of Publication:2023-08-08