TY - JOUR
T1 - MixSegNet
T2 - Fusing multiple mixed-supervisory signals with multiple views of networks for mixed-supervised medical image segmentation
AU - Wang, Ziyang
AU - Yang, Chen
PY - 2024/7
Y1 - 2024/7
N2 - Deep learning has driven remarkable advancements in medical image segmentation. The requirement for comprehensive annotations, however, poses a significant challenge due to the labor-intensive and expensive nature of expert annotation. Addressing this challenge, we introduce a multiple mixed-supervisory signals learning (MSL) strategy, MixSegNet, that synergistically harnesses the benefits of Fully-Supervised (FSL), Weakly-Supervised (WSL), and Semi-Supervised Learning (SSL). This approach enables the utilization of various data-efficient annotations for network training, promoting efficient medical image segmentation within realistic clinical scenarios. MixSegNet concurrently trains networks with a combination of limited dense labels, a larger proportion of cost-efficient sparse labels, and unlabeled data. The networks utilized in this system comprise Vision Transformer (ViT) and Convolutional Neural Networks (CNN), which work together via an effective strategy including network self-ensembling and label dynamic-ensembling. This strategy adeptly handles the training challenges arising from datasets with limited or absent supervisory signals. We validated MixSegNet on a public Magnetic Resonance Imaging (MRI) cardiac segmentation benchmark dataset. It demonstrated superior performance compared to 21 other SSL or WSL baseline methods under similar labeling-cost conditions, as supported by comprehensive evaluation metrics, and slightly outperform classical FSL methods. The code for MixSegNet, all baseline methods, and the data pre-processing techniques with the datasets for different annotation situations are available at https://github.com/ziyangwang007/MixSegNet.
AB - Deep learning has driven remarkable advancements in medical image segmentation. The requirement for comprehensive annotations, however, poses a significant challenge due to the labor-intensive and expensive nature of expert annotation. Addressing this challenge, we introduce a multiple mixed-supervisory signals learning (MSL) strategy, MixSegNet, that synergistically harnesses the benefits of Fully-Supervised (FSL), Weakly-Supervised (WSL), and Semi-Supervised Learning (SSL). This approach enables the utilization of various data-efficient annotations for network training, promoting efficient medical image segmentation within realistic clinical scenarios. MixSegNet concurrently trains networks with a combination of limited dense labels, a larger proportion of cost-efficient sparse labels, and unlabeled data. The networks utilized in this system comprise Vision Transformer (ViT) and Convolutional Neural Networks (CNN), which work together via an effective strategy including network self-ensembling and label dynamic-ensembling. This strategy adeptly handles the training challenges arising from datasets with limited or absent supervisory signals. We validated MixSegNet on a public Magnetic Resonance Imaging (MRI) cardiac segmentation benchmark dataset. It demonstrated superior performance compared to 21 other SSL or WSL baseline methods under similar labeling-cost conditions, as supported by comprehensive evaluation metrics, and slightly outperform classical FSL methods. The code for MixSegNet, all baseline methods, and the data pre-processing techniques with the datasets for different annotation situations are available at https://github.com/ziyangwang007/MixSegNet.
KW - Medical image segmentation
KW - Mixed-supervised learning
KW - Vision transformer
UR - http://www.scopus.com/inward/record.url?scp=85185831854&partnerID=8YFLogxK
UR - https://www.sciencedirect.com/science/article/pii/S0952197624002173?via%3Dihub
U2 - 10.1016/j.engappai.2024.108059
DO - 10.1016/j.engappai.2024.108059
M3 - Article
AN - SCOPUS:85185831854
SN - 0952-1976
VL - 133
JO - Engineering Applications of Artificial Intelligence
JF - Engineering Applications of Artificial Intelligence
IS - A
M1 - 108059
ER -