Quadruple-Consistency Vision Transformer for Medical Image Segmentation with Limited Number of Sparse Annotations

Yufan Liu, Ziyang Wang*, Tianxiang Chen, Zi Ye

*Corresponding author for this work

Research output: Chapter in Book/Published conference outputConference publication

1 Citation (Scopus)

Abstract

Deep learning has significantly advanced the field of medical image segmentation but typically relies on extensive, densely annotated datasets, which are both costly and time-consuming to prepare. In response to the need for reducing annotation efforts, this study investigates a novel supervision approach named Semi-Scribble Supervised Learning, which utilizes a combination of semi-supervised (SSL) and weakly-supervised learning (WSL) techniques. This approach leverages both a large volume of unlabeled data and a smaller set of sparsely annotated, scribble-based labels. We introduce the Quadruple-Consistency Vision Transformer (4C-ViT), which capitalizes on the recent success of Vision Transformers in capturing intricate image features. Specifically, the proposed 4C-ViT employs an advanced consistency training strategy that incorporates quadruple perturbations at both the data and network levels, enhancing the network's robustness and performance. The efficacy of 4C-ViT is demonstrated on a publicly available MRI cardiac segmentation benchmark, where it outperforms other baseline methods across several evaluation metrics. The proposed 4C-ViT, alongside all baseline methods and the challenging yet realistic dataset, is made public available at https://github.com/ziyangwang007/CVSSL-MIS.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Image Processing (ICIP), Abu Dhabi, United Arab Emirates, 2024
PublisherIEEE
Pages2101-2107
Number of pages7
ISBN (Electronic)9798350349399
DOIs
Publication statusE-pub ahead of print - 27 Sept 2024
Event31st IEEE International Conference on Image Processing, ICIP 2024 - Abu Dhabi, United Arab Emirates
Duration: 27 Oct 202430 Oct 2024

Publication series

NameProceedings - International Conference on Image Processing (ICIP)
ISSN (Print)1522-4880
ISSN (Electronic)2381-8549

Conference

Conference31st IEEE International Conference on Image Processing, ICIP 2024
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period27/10/2430/10/24

Keywords

  • Medical Image Segmentation
  • Semi-Supervised Learning
  • Vision Transformer
  • Weakly-Supervised Learning

Fingerprint

Dive into the research topics of 'Quadruple-Consistency Vision Transformer for Medical Image Segmentation with Limited Number of Sparse Annotations'. Together they form a unique fingerprint.

Cite this