Dealing with Unreliable Annotations: A Noise-Robust Network for Semantic Segmentation through A Transformer-Improved Encoder and Convolution Decoder

Ziyang Wang*, Irina Voiculescu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)
1 Downloads (Pure)

Abstract

Conventional deep learning methods have shown promising results in the medical domain when trained on accurate ground truth data. Pragmatically, due to constraints like lack of time or annotator inexperience, the ground truth data obtained from clinical environments may not always be impeccably accurate. In this paper, we investigate whether the presence of noise in ground truth data can be mitigated. We propose an innovative and efficient approach that addresses the challenge posed by noise in segmentation labels. Our method consists of four key components within a deep learning framework. First, we introduce a Vision Transformer-based modified encoder combined with a convolution-based decoder for the segmentation network, capitalizing on the recent success of self-attention mechanisms. Second, we consider a public CT spine segmentation dataset and devise a preprocessing step to generate (and even exaggerate) noisy labels, simulating real-world clinical situations. Third, to counteract the influence of noisy labels, we incorporate an adaptive denoising learning strategy (ADL) into the network training. Finally, we demonstrate through experimental results that the proposed method achieves noise-robust performance, outperforming existing baseline segmentation methods across multiple evaluation metrics.

Original languageEnglish
Article number7966
Number of pages13
JournalApplied Sciences (Switzerland)
Volume13
Issue number13
DOIs
Publication statusPublished - 7 Jul 2023

Bibliographical note

Publisher Copyright:
© 2023 by the authors.
Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Data Access Statement

The dataset used in this study is public available at http://spineweb.digitalimaginggroup.ca/

Keywords

  • computed tomography
  • image segmentation
  • noisy label
  • Vision Transformer

Fingerprint

Dive into the research topics of 'Dealing with Unreliable Annotations: A Noise-Robust Network for Semantic Segmentation through A Transformer-Improved Encoder and Convolution Decoder'. Together they form a unique fingerprint.

Cite this