TY - GEN
T1 - An Uncertainty-Aware Transformer for MRI Cardiac Semantic Segmentation via Mean Teachers
AU - Wang, Ziyang
AU - Zheng, Jian-Qing
AU - Voiculescu, Irina
PY - 2022/7/25
Y1 - 2022/7/25
N2 - Deep learning methods have shown promising performance in medical image semantic segmentation. The cost of high-quality annotations, however, is still high and hard to access as clinicians are pressed for time. In this paper, we propose to utilize the power of Vision Transformer (ViT) with a semi-supervised framework for medical image semantic segmentation. The framework consists of a student model and a teacher model, where the student model learns from image feature information and helps teacher model to update parameters. The consistency of the inference of unlabeled data between the student model and teacher model is studied, so the whole framework is set to minimize segmentation supervision loss and consistency semi-supervision loss. To improve the semi-supervised performance, an uncertainty estimation scheme is introduced to enable the student model to learn from only reliable inference data during consistency loss calculation. The approach of filtering inconclusive images via an uncertainty value and the weighted sum of two losses in the training process is further studied. In addition, ViT is selected and properly developed as a backbone for the semi-supervised framework under the concern of long-range dependencies modeling. Our proposed method is tested with a variety of evaluation methods on a public benchmarking MRI dataset. The results of the proposed method demonstrate competitive performance against other state-of-the-art semi-supervised algorithms as well as several segmentation backbones.
AB - Deep learning methods have shown promising performance in medical image semantic segmentation. The cost of high-quality annotations, however, is still high and hard to access as clinicians are pressed for time. In this paper, we propose to utilize the power of Vision Transformer (ViT) with a semi-supervised framework for medical image semantic segmentation. The framework consists of a student model and a teacher model, where the student model learns from image feature information and helps teacher model to update parameters. The consistency of the inference of unlabeled data between the student model and teacher model is studied, so the whole framework is set to minimize segmentation supervision loss and consistency semi-supervision loss. To improve the semi-supervised performance, an uncertainty estimation scheme is introduced to enable the student model to learn from only reliable inference data during consistency loss calculation. The approach of filtering inconclusive images via an uncertainty value and the weighted sum of two losses in the training process is further studied. In addition, ViT is selected and properly developed as a backbone for the semi-supervised framework under the concern of long-range dependencies modeling. Our proposed method is tested with a variety of evaluation methods on a public benchmarking MRI dataset. The results of the proposed method demonstrate competitive performance against other state-of-the-art semi-supervised algorithms as well as several segmentation backbones.
KW - Image semantic segmentation
KW - Semi-supervised learning
KW - Vision transformer
UR - http://www.scopus.com/inward/record.url?scp=85135922319&partnerID=8YFLogxK
UR - https://link.springer.com/chapter/10.1007/978-3-031-12053-4_37
U2 - 10.1007/978-3-031-12053-4_37
DO - 10.1007/978-3-031-12053-4_37
M3 - Conference publication
AN - SCOPUS:85135922319
SN - 9783031120527
T3 - Lecture Notes in Computer Science (LNCS)
SP - 494
EP - 507
BT - Medical Image Understanding and Analysis
A2 - Yang, Guang
A2 - Aviles-Rivero, Angelica
A2 - Roberts, Michael
A2 - Schönlieb, Carola-Bibiane
PB - Springer
T2 - 26th Annual Conference on Medical Image Understanding and Analysis, MIUA 2022
Y2 - 27 July 2022 through 29 July 2022
ER -