Computer Vision Faculty Publications

TransResNet: Integrating the Strengths of ViTs and CNNs for High Resolution Medical Image Segmentation via Feature Grafting

Muhammad Hamza Sharif, Mohamed Bin Zayed University of Artificial Intelligence
Dmitry Demidov, Mohamed Bin Zayed University of Artificial Intelligence
Asif Hanif, Mohamed Bin Zayed University of Artificial IntelligenceFollow
Mohammad Yaqub, Mohamed Bin Zayed University of Artificial IntelligenceFollow
Min Xu, Mohamed Bin Zayed University of Artificial IntelligenceFollow

Document Type

Conference Proceeding

Publication Title

BMVC 2022 - 33rd British Machine Vision Conference Proceedings

Abstract

High-resolution images are preferable in medical imaging domain as they significantly improve the diagnostic capability of the underlying method. In particular, high resolution helps substantially in improving automatic image segmentation. However, most of the existing deep learning-based techniques for medical image segmentation are optimized for input images having small spatial dimensions and perform poorly on high-resolution images. To address this shortcoming, we propose a parallel-in-branch architecture called TransResNet, which incorporates Transformer and CNN in a parallel manner to extract features from multi-resolution images independently. In TransResNet, we introduce Cross Grafting Module (CGM), which generates the grafted features, enriched in both global semantic and low-level spatial details, by combining the feature maps from Transformer and CNN branches through fusion and self-attention mechanism. Moreover, we use these grafted features in the decoding process, increasing the information flow for better prediction of the segmentation mask. Extensive experiments on ten datasets demonstrate that TransResNet achieves either state-of-the-art or competitive results on several segmentation tasks, including skin lesion, retinal vessel, and polyp segmentation. The source code and pre-trained models are available at https://github.com/Sharifmhamza/TransResNet.

Publication Date

11-24-2022

Keywords

Computer vision, Deep learning, Diagnosis, Grafting (chemical), Image enhancement, Semantic Segmentation, Semantics

Comments

Open access version available in BMVC

Uploaded: 28 May 2024

Recommended Citation

M. Sharif et al., "TransResNet: Integrating the Strengths of ViTs and CNNs for High Resolution Medical Image Segmentation via Feature Grafting," BMVC 2022 - 33rd British Machine Vision Conference Proceedings, Nov 2022.

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Computer Vision Faculty Publications

TransResNet: Integrating the Strengths of ViTs and CNNs for High Resolution Medical Image Segmentation via Feature Grafting

Document Type

Publication Title

Abstract

Publication Date

Keywords

Comments

Recommended Citation

Included in

Browse

Contribute

Links

Computer Vision Faculty Publications

TransResNet: Integrating the Strengths of ViTs and CNNs for High Resolution Medical Image Segmentation via Feature Grafting

Authors

Document Type

Publication Title

Abstract

Publication Date

Keywords

Comments

Recommended Citation

Included in

Share

Browse

Contribute

Links