Stepwise Feature Fusion: Local Guides Global



Wang, Jinfeng, Huang, Qiming, Tang, Feilong, Meng, Jia ORCID: 0000-0003-3455-205X, Su, Jionglong and Song, Sifan ORCID: 0000-0002-7940-650X
(2022) Stepwise Feature Fusion: Local Guides Global. .

[img] Text
2022_SSFormer.pdf - Author Accepted Manuscript

Download (697kB) | Preview

Abstract

Colonoscopy, currently the most efficient and recognized colon polyp detection technology, is necessary for early screening and prevention of colorectal cancer. However, due to the varying size and complex morphological features of colonic polyps as well as the indistinct boundary between polyps and mucosa, accurate segmentation of polyps is still challenging. Deep learning has become popular for accurate polyp segmentation tasks with excellent results. However, due to the structure of polyps image and the varying shapes of polyps, it is easy for existing deep learning models to overfit the current dataset. As a result, the model may not process unseen colonoscopy data. To address this, we propose a new state-of-the-art model for medical image segmentation, the SSFormer, which uses a pyramid Transformer encoder to improve the generalization ability of models. Specifically, our proposed Progressive Locality Decoder can be adapted to the pyramid Transformer backbone to emphasize local features and restrict attention dispersion. The SSFormer achieves state-of-the-art performance in both learning and generalization assessment.

Item Type: Conference or Workshop Item (Unspecified)
Uncontrolled Keywords: Deep learning, Generalization, Polyp segmentation
Divisions: Faculty of Science and Engineering > School of Electrical Engineering, Electronics and Computer Science
Depositing User: Symplectic Admin
Date Deposited: 02 Nov 2023 10:19
Last Modified: 03 Nov 2023 08:41
DOI: 10.1007/978-3-031-16437-8_11
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3176576