Abstract
Due to the lack of quality annotation in medical imaging community, semi-supervised learning methods are highly valued in image semantic segmentation tasks. In this paper, an advanced consistency-aware pseudo-label-based self-ensembling approach is presented to fully utilize the power of Vision Transformer (ViT) and Convolutional Neural Network (CNN) in semi-supervised learning. Our proposed framework consists of a feature-learning module which is enhanced by ViT and CNN mutually, and a guidance module which is robust for consistency-aware purposes. The pseudo labels are inferred and utilized recurrently and separately by views of CNN and ViT in the feature-learning module to expand the data set and are beneficial to each other. Meanwhile, a perturbation scheme is designed for the feature-learning module, and averaging network weight is utilized to develop the guidance module. By doing so, the framework combines the feature-learning strength of CNN and ViT, strengthens the performance via dual-view co-training, and enables consistency-aware supervision in a semi-supervised manner. A topological exploration of all alternative supervision modes with CNN and ViT are detailed validated, demonstrating the most promising performance and specific setting of our method on semi-supervised medical image segmentation tasks. Experimental results show that the proposed method achieves state-of-the-art performance on a public benchmark data set with a variety of metrics. The code is publicly available (https://github.com/ziyangwang007/CV-SSL-MIS ).
| Original language | English |
|---|---|
| Title of host publication | Computer Vision – ECCV 2022 Workshops. |
| Subtitle of host publication | Tel Aviv, Israel, October 23-27, 2022 Proceedings, Part VII |
| Editors | Leonid Karlinsky, Tomer Michaeli, Ko Nishino |
| Pages | 424-441 |
| Number of pages | 18 |
| Volume | 13807 |
| ISBN (Electronic) | 9783031250828 |
| DOIs | |
| Publication status | Published - 12 Feb 2023 |
| Event | Workshops held at the 17th European Conference on Computer Vision, ECCV 2022 - Tel Aviv, Israel Duration: 23 Oct 2022 → 27 Oct 2022 |
Publication series
| Name | Lecture Notes in Computer Science (LNCS) |
|---|---|
| Volume | 13807 |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | Workshops held at the 17th European Conference on Computer Vision, ECCV 2022 |
|---|---|
| Country/Territory | Israel |
| City | Tel Aviv |
| Period | 23/10/22 → 27/10/22 |
Bibliographical note
Publisher Copyright:© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.