Hepatic vessel segmentation based on 3D swin-transformer with inductive biased multi-head self-attention

Wu, Mian; Qian, Yinling; Liao, **angyun; Wang, Qiong; Heng, Pheng-Ann

doi:10.1186/s12880-023-01045-y

Hepatic vessel segmentation based on 3D swin-transformer with inductive biased multi-head self-attention

Research
Open access
Published: 08 July 2023

Volume 23, article number 91, (2023)
Cite this article

Download PDF

You have full access to this open access article

BMC Medical Imaging Aims and scope Submit manuscript

Hepatic vessel segmentation based on 3D swin-transformer with inductive biased multi-head self-attention

Download PDF

Mian Wu¹^na1,
Yinling Qian¹^na1,
**angyun Liao¹,
Qiong Wang¹ &
…
Pheng-Ann Heng^1,2

3129 Accesses
3 Citations
Explore all metrics

Abstract

Purpose

Segmentation of liver vessels from CT images is indispensable prior to surgical planning and aroused a broad range of interest in the medical image analysis community. Due to the complex structure and low-contrast background, automatic liver vessel segmentation remains particularly challenging. Most of the related researches adopt FCN, U-net, and V-net variants as a backbone. However, these methods mainly focus on capturing multi-scale local features which may produce misclassified voxels due to the convolutional operator’s limited locality reception field.

Methods

We propose a robust end-to-end vessel segmentation network called Inductive BIased Multi-Head Attention Vessel Net(IBIMHAV-Net) by expanding swin transformer to 3D and employing an effective combination of convolution and self-attention. In practice, we introduce voxel-wise embedding rather than patch-wise embedding to locate precise liver vessel voxels and adopt multi-scale convolutional operators to gain local spatial information.

On the other hand, we propose the inductive biased multi-head self-attention which learns inductively biased relative positional embedding from initialized absolute position embedding. Based on this, we can gain more reliable queries and key matrices.

Results

We conducted experiments on the 3DIRCADb dataset. The average dice and sensitivity of the four tested cases were 74.8$\%$ and 77.5$\%$, which exceed the results of existing deep learning methods and improved graph cuts method. The Branches Detected(BD)/Tree-length Detected(TD) indexes also proved the global/local feature capture ability better than other methods.

Conclusion

The proposed model IBIMHAV-Net provides an automatic, accurate 3D liver vessel segmentation with an interleaved architecture that better utilizes both global and local spatial features in CT volumes. It can be further extended for other clinical data.

View this article's peer review reports

SCAN: sequence-based context-aware association network for hepatic vessel segmentation

Article 30 November 2023

Hierarchical deep network with uncertainty-aware semi-supervised learning for vessel segmentation

Article 12 October 2021

Robust Liver Vessel Extraction Using DV-Net with D-BCE Loss Function

Introduction

Background

CT liver vessel segmentation is essential for 3D visualization, path planning , and guidance in interventional liver surgery [28, 29]. However, the vessel and liver backgrounds show similar intensity values on CT images due to their similarity in the enhancement characteristics. They are curvy, twist, occlude one another, and sometimes are seriously distorted by liver tumors. Due to the intensity similarity and complex structure of the liver vessel, accurate liver vessel segmentation is still challenging. Nowadays, accurate liver vessel segmentation heavily relies on doctors’ manual segmentation, which is hugely time-consuming and subject to the experience and skills of the experts [5].

Therefore, automatic vessel segmentation has triggered a broad discussion in the community. Even though some deep learning methods achieved big success on organ segmentation tasks, they cannot perform well in vessel segmentation due to the considerable variations of vessel structure and unbalance between backgrounds and vessels. Most recent work are designed based on FCN [20], U-net [26], and V-net’s [22] variants. They heavily rely on convolution layers, which integrate multi-scale local information to get passable results. Yet convolution’s limited reception field does not have long dependencies and enough global features, it can hardly accurately distinguish variant vessel margins and segment minor vessels. Therefore, develo** a liver vessel segmentation method that adds long dependencies and utilizes global spatial features is necessary.

Related work

Current liver vessel segmentation methods can be roughly classified into traditional region-based methods, edge-based segmentation methods and deep learning-based methods. As region-based methods do not perform well in vessel segmentation, we review most related work in the latter two categories. Since we use the transformer model as our backbone, we also review the newest work related to the transformer model. A more comprehensive literature survey can refer to [7].

Traditional methods

Edge-based methods can be further classified into image filtering and enhancement algorithms, tracking-based algorithms [23]. Filter and enhancement algorithms extract the volume with a common process called filtering to reduce the noise, then enhance the vessels by applying image gradients or multi-scale high-order deviations, particularly the second derivatives of the angiographic images to extract high-frequency information [16, 21]. Besides, Pamulapati et al. [24] introduced a vessel segmentation method based on the medial axis enhancement filter. Tracking-based algorithms focus on the predefined vessel models and track the minimum cost path. Friman et al. [9] proposed to track many hypothetical vessel trajectories at the same time, which improved the results in low contrast conditions. Cetin et al. [3], Cetin and Unal [2] presented the tubular structure segmentation method, which utilized a second-order tensor from directional intensity measurement and employed a higher-order tensor based on cylindrical flux-based to construct the vascular structure.

Deep learning-based methods

Most deep learning-based liver vessel segmentation work rely on CNN-based architecture, specifically, U-net [26] and its variants, as well as little attempts by FCN [20] and V-net [22]. In chronological order, early-stage vessel segmentation methods like retinal vessel segmentation are based on 2D methods. Later, with the segmentation targets changed to 3D images, 3D methods became mainstream. Fu et al. [10], Li et al. [18] have proposed the segmentation method for the retinal vessel from 2D images. These methods can handle small objects in 2D slices, however, the vessel segmentation on the liver, brain, or lung are volume tasks. Most 2D methods cannot transfer to 3D images directly due to space continuous along the Z-axis, which omits essential information. Therefore, the current state of art solutions for liver vessel segmentation focus on 2D multi-path(2.5d) and 3D methods. Kitrungrotsakul et al. [15] specifically proposed three DenseNets with the shared kernel that fit for resampling three planes(sagittal, coronal and transverse planes) patches from IRCADb dataset called 2.5D method. Çiçek et al. [6] extend UNet from 2D image to volume, which fused multi-scale 3D convolution feature called 3D-UNet. In order to employ the 3D representation of liver vessel features, Huang et al. [12] proposed the variant of 3D-Unet fit the problem worked well, and their evaluation of IRCADb incomplete annotations further improved the result. Yu et al. [33] added the residual module into the 3D-UNet that provided more residual features. Xu et al. [31] employed a 3D-FCN frame for this task. However, a reasonably supervised deep network model has to be trained on a large dataset with high-quality labels, and the current datasets cause the noise labels to hurt the model performance. Lately, Yan et al. [32] proposed a way to fuse self-attention into 3D U-net that improved segmentation details as a great attempt.

Vision transformers and 2D swin transformer

The self-attention mechanism allows transformers to dynamically extract the important features of word sequences and learn their long-range dependencies. This notion has recently been extended to computer vision by defining the vision transformer [8], which aims at the image recognition task. By taking 2D image patches with positional embeddings as input and pre-trained on large classical datasets, ViT achieved comparable results with the CNN-based methods. In medical image tasks, more recent methods like [4, 34] enjoyed the benefit of both CNNs and transformers. Efforts of Chen et al. [4] firstly utilize CNNs to extract low-level local features and transformers to catch global intersections. Currently, based on the shifted windows mechanism, Liu et al. [19] proposed Swin transformer that can learn hierarchical object concepts at different scales by applying appropriate downsampling to feature maps that achieved state-of-art semantic segmentation. Inspired by swin-transformer, Swin-Unet [1] firstly employed hierarchical transformer blocks with integrated encoder and decoder to build U-shape architecture. This work improved transUnet’s result on medical multi-organ segmentation tasks. For 3D segmentation, Karimi et al. [14] tentatively replaced the 3D convolutional operators with transformers as the backbone to build the model. They first split the local volume block into 3D patches and embedded them into a 1D sequence through ViT’s self-attention design. Compared to these methods, our IBIMHAV-Net inherits the advantages of convolution in encoding precise spatial information and using inductive biased self-attention in hierarchical representation that helps to overcome connectivity and variance of liver vessel segmentation.

Proposed method

Motivated by existing 2D swin-transformer [1, 19] and past vision transformer attempts [4, 8, 11], we propose a transformer-based architecture for volumetric liver vessel segmentation which better utilize global features and long dependencies. The main advantages and contributions of the proposed method are as follows:

1. We propose a network architecture by expanding swin transformer to 3D and combining convolution and self-attention to play their strengths. For self-attention, the global spatial information has been encoded by embedding, and long dependencies have been entangled by our designed 3D transformer block. For convolution, multi-scale convolutions in the local feature path and downsampling/upsampling layers help to encode precise local information and capture hierarchical resolution features.

2. We introduce the voxel-wise rather than patch-wise embedding as the initial transformer input to fully utilize volumetric information, which transforms volumetric prediction to the sequence-to-sequence prediction in hierarchical resolution features.

3. We propose the Inductive Biased multi-head attention(IB-MSA) which changes the positional embedding way that learns biased positional embedding with initialization of absolute 1-dimensional embedding in the transformer blocks. Thus dramatically improving liver vessel segmentation results.

Methodology

The proposed method starts with dataset preprocessing. Then we introduce the architecture of our framework, namely Inductive BIased Multi-Head Attention Vessel Net(IBIMHAV-Net), including the details of our 3D transformer design and inductive biased multi-head attention mechanism. Finally, we describe post-processing which reduces some discrete inaccurate results.

Preprocessing

Preprocessing plays an essential role and affects the segmentation results significantly [12, $\mathcal {X} \in \mathcal {R}^{H \times W \times D}$ into high dimensional tensor. This high-dimensional tensor represents as $\mathcal {T} \in \textrm{R}^{\frac{H}{4} \times \frac{W}{4} \times \frac{D}{4} \times C}$, where ${\frac{W}{4} \times \frac{D}{4} \times C}$ is the patch tokens and C represents the length of sequence which is 128(discussed in 3.3). Due to the variant and complex vessel structure, we design the successive large kernel convolutional combinations for pixel-wise level sequence encoding instead patch-size encoding. Moreover, this setting reduce computational complexity with same range of receptive field to accommodate long sequence. After every convolutional layer followed one GELU and one layerNorm layer to fully embedding as 1-D sequence. The kernels and strides are set as Fig. 3 Right since the input volumes were nearly squares to fit the model.

Down-sampling layer The swin transformer blocks used neighboring concatenate operations in past 2D tasks [1, 19]. However, we find that easy convolution with small strides worked better. It also needs a GELU layer and a Layer Norm to keep the normalization of processing measures to refine the feature map mapped to [0, 1] to keep the sensitivity of the model. It works better than Batch Normalization (BN) and ReLU activation function in our architecture.

3D swin transformer block with Inductive Biased MSA Module

After passing patch embedding block’s, the high dimensional sequence tensor $\mathcal {T}$ is put into transformer blocks. Compare to original Swin transformer, our method conduct self-attention in a hierarchical path and compute self-attention within 3D patches volume with bias focusing on block edge segmentation (i.e. IB-MSA, bias positional multi-head self-attention) instead 2D shift window.

3D transformer block In the tail of embedding block, the sequence is transformed to the high-dimensional tensor in swin transformer blocks. The main idea is to fully mix the captured long-term dependencies with hierarchical object concepts at various scales by following down-sampling convolution and global spatial information from the beginning embedding block.

In order to represent the workflow in our design, let the high-dimensional tensor $\mathcal {T} \in \mathcal {R}^{L \times C}$ reshape as $\hat{\mathcal {T}} \in \textbf{R}^{N \times P \times C}$ by passing through IB-MSA, where N is the number of tiny local volumes, $P = S_{H} \times S_{W} \times S_{D}$ denotes the number of patch tokens in each volume. $\left\{ S_{H}, S_{W}, S_{D}\right\}$ stand for the size of tiny local volume. To fit to our task’s various shape of vessel CT scans, this setting could cover all patch tokens of the last transformer block in the encoder. Because of different sampling quality between datasets, it may not be reasonable to brute-force pad the data in order to satisfy fixed $\left\{ S_{H}, S_{W}, S_{D}\right\}$. Therefore, the cropped patch X needs to be adaptively adjusted in order to fit the size of local volumes. And we set $\left\{ S_{H}, S_{W}, S_{D}\right\}$ on IRCADb to $\left\{ 4, 4, 4\right\}$.

Following the baseline [1], we present two successive transformer blocks. The main difference is that our computational unit is built for 3D volumes rather than 2D windows. Based on above volume partitioning way, the continuous swin transformer can be formulated as follows:

$$\begin{aligned} \hat{\mathcal T}^{l}=IB-M S A\left( L N\left( {\mathcal T}^{l-1}\right) \right) +{\mathcal T}^{l-1} \nonumber \\ {\mathcal T}^{l}=M L P\left( L N\left( \hat{{\mathcal T}}^{l}\right) \right) +\hat{{\mathcal T}}^{l} \nonumber \\ \hat{\mathcal T}^{l+1}=Shifted \ IB-M S A\left( L N\left( {\mathcal T}^{l}\right) \right) +{\mathcal T}^{l} \nonumber \\ {\mathcal T}^{l+1}=M L P\left( L N\left( \hat{{\mathcal T}}^{l+1}\right) \right) +\hat{{\mathcal T}}^{l+1} \end{aligned}$$

(1)

Here, l expresses the layer number, MLP represents multi-layer perceptron. IB-MSA is our bias multi-head attention and it has the 3D shifted version.

Conclusions

This paper designs a liver vessels segmentation method from CT images using the transformer-based network. Swin transformer has been expanding to 3D as the backbone which interleaved with convolutions and expanding for 3D volumes. In specific, the small stride convolution in both local feature block path and up/down-sampling blocks keep the spatial information hierarchically for two successive swin transformer blocks. A new pixel-wised embedding method has been used for our few samples task with variant structures. A new type of bias positional embedding in our transformer is proposed. Numerical Evaluation and visualization based on different benchmarks proved the validity of this deep learning method. Our method has been trained and tested on 3D-IRCADb-01 dataset. In the future, we would further improve segmentation accuracy by introducing more precise datasets and trying multi-task methods to reduce the negative effects of liver tumors.

Availability of data and materials

In this research, we utilized the 3D-IRCADb-01 dataset and resampled it to ROI. The raw data can be downloaded from(https://www.ircad.fr/research/data-sets/liver-segmentation-3d-ircadb-01/) and follow the 2.1 to resample the data.

References

Cao H, Wang Y, Chen J, Jiang D, Zhang X, Tian Q, et al. Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation. ar**v preprint ar**v:2105.05537. 2021.
Cetin S, Unal G. A higher-order tensor vessel tractography for segmentation of vascular structures. IEEE Trans Med Imaging. 2015;34(10):2172–85.
Article PubMed Google Scholar
Cetin S, Demir A, Yezzi A, Degertekin M, Unal G. Vessel tractography using an intensity based tensor model with branch detection. IEEE Trans Med Imaging. 2012;32(2):348–63.
Article PubMed Google Scholar
Chen J, Lu Y, Yu Q, Luo X, Adeli E, Wang Y, et al. Transunet: Transformers make strong encoders for medical image segmentation. ar**v preprint ar**v:2102.04306. 2021.
Chi Y, Liu J, Venkatesh SK, Huang S, Zhou J, Tian Q, et al. Segmentation of liver vasculature from contrast enhanced CT images using context-based voting. IEEE Trans Biomed Eng. 2010;58(8):2144–53.
Google Scholar
Çiçek Ö, Abdulkadir A, Lienkamp SS, Brox T, Ronneberger O. 3D U-Net: learning dense volumetric segmentation from sparse annotation. In: International conference on medical image computing and computer-assisted intervention. Springer; 2016. p. 424–432.
Ciecholewski M, Kassjański M. Computational Methods for Liver Vessel Segmentation in Medical Imaging: A Review. Sensors. 2021;21(6):2027.
Article PubMed PubMed Central Google Scholar
Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, et al. An image is worth 16x16 words: Transformers for image recognition at scale. ar**v preprint ar**v:2010.11929. 2020.
Friman O, Hindennach M, Kühnel C, Peitgen HO. Multiple hypothesis template tracking of small 3D vessel structures. Med Image Anal. 2010;14(2):160–71.
Article PubMed Google Scholar
Fu H, Xu Y, Lin S, Wong DWK, Liu J. Deepvessel: Retinal vessel segmentation via deep learning and conditional random field. In: International conference on medical image computing and computer-assisted intervention. Springer; 2016. p. 132–139.
Hatamizadeh A, Yang D, Roth H, Xu D. Unetr: Transformers for 3d medical image segmentation. ar**v preprint ar**v:2103.10504. 2021.
Huang Q, Sun J, Ding H, Wang X, Wang G. Robust liver vessel extraction using 3D U-Net with variant dice loss function. Comput Biol Med. 2018;101:153–62.
Article PubMed Google Scholar
Isensee F, Petersen J, Klein A, Zimmerer D, Jaeger PF, Kohl S, et al. nnU-net: Self-adapting framework for u-net-based medical image segmentation. ar**v preprint ar**v:1809.10486. 2018.
Karimi D, Vasylechko S, Gholipour A. Convolution-Free Medical Image Segmentation using Transformers. ar**v preprint ar**v:2102.13645. 2021.
Kitrungrotsakul T, Han XH, Iwamoto Y, Lin L, Foruzan AH, **ong W, et al. VesselNet: A deep convolutional neural network with multi pathways for robust hepatic vessel segmentation. Comput Med Imaging Graph. 2019;75:74–83.
Article PubMed Google Scholar
Lamy J, Merveille O, Kerautret B, Passat N, Vacavant A. Vesselness filters: A survey with benchmarks applied to liver imaging. In: 2020 25th International Conference on Pattern Recognition (ICPR). IEEE; 2021. p. 3528–3535.
Lee TC, Kashyap RL, Chu CN. Building skeleton models via 3-D medial surface axis thinning algorithms. CVGIP: Graph Model Image Process. 1994;56(6):462–478.
Li Q, Feng B, **e L, Liang P, Zhang H, Wang T. A cross-modality learning approach for vessel segmentation in retinal images. IEEE Trans Med Imaging. 2015;35(1):109–18.
Article PubMed Google Scholar
Liu Z, Lin Y, Cao Y, Hu H, Wei Y, Zhang Z, et al. Swin transformer: Hierarchical vision transformer using shifted windows. ar**v preprint ar**v:2103.14030. 2021.
Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2015. p. 3431–3440.
Luu HM, Klink C, Moelker A, Niessen W, Van Walsum T. Quantitative evaluation of noise reduction and vesselness filters for liver vessel segmentation on abdominal CTA images. Phys Med Biol. 2015;60(10):3905.
Article PubMed Google Scholar
Milletari F, Navab N, Ahmadi SA. V-net: Fully convolutional neural networks for volumetric medical image segmentation. In 2016 fourth international conference on 3D vision (3DV). 2016. p. 565–571. Ieee.
Moccia S, De Momi E, El Hadji S, Mattos LS. Blood vessel segmentation algorithms—review of methods, datasets and evaluation metrics. Comput Methods Prog Biomed. 2018;158:71–91.
Article Google Scholar
Pamulapati V, Wood BJ, Linguraru MG, Intra-hepatic vessel segmentation and classification in multi-phase CT using optimized graph cuts. In: 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro. IEEE; 2011. p. 1982–5.
Qin Y, Zheng H, Gu Y, Huang X, Yang J, Wang L, et al. Learning tubule-sensitive CNNs for pulmonary airway and artery-vein segmentation in CT. IEEE Trans Med Imaging. 2021;40(6):1603–17.
Article PubMed Google Scholar
Ronneberger O, Fischer P, Brox T. U-net: Convolutional networks for biomedical image segmentation. In: International Conference on Medical image computing and computer-assisted intervention. Springer; 2015. p. 234–241.
Sangsefidi N, Foruzan AH, Dolati A. Balancing the data term of graph-cuts algorithm to improve segmentation of hepatic vascular structures. Comput Biol Med. 2018;93:117–26.
Article PubMed Google Scholar
Sboarina A, Foroni RI, Minicozzi A, Antiga L, Lupidi F, Longhi M, et al. Software for hepatic vessel classification: feasibility study for virtual surgery. Int J CARS. 2010;5(1):39–48.
Article CAS Google Scholar
Schumann C, Bieberstein J, Braunewell S, Niethammer M, Peitgen HO. Visualization support for the planning of hepatic needle placement. Int J CARS. 2012;7(2):191–7.
Article Google Scholar
Touvron H, Cord M, Sablayrolles A, Synnaeve G, Jégou H. Going deeper with image transformers. ar**v preprint ar**v:2103.17239. 2021.
Xu M, Wang Y, Chi Y, Hua X, Training liver vessel segmentation deep neural networks on noisy labels from contrast CT imaging. In: 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI). IEEE; 2020. p. 1552–5.
Yan Q, Wang B, Zhang W, Luo C, Xu W, Xu Z, et al. An attention-guided deep neural network with multi-scale feature fusion for liver vessel segmentation. IEEE J Biomed Health Inf. 2020.
Yu W, Fang B, Liu Y, Gao M, Zheng S, Wang Y. Liver vessels segmentation based on 3d residual U-NET. In: 2019 IEEE International Conference on Image Processing (ICIP). IEEE; 2019. p. 250–254.
Zhang Y, Liu H, Hu Q. Transfuse: Fusing transformers and cnns for medical image segmentation. ar**v preprint ar**v:2102.08005. 2021.

Download references

Acknowledgements

Qiong Wang, Mian Wu, Yinling Qian, and **angyun Liao all work for Shenzhen Institute of Advanced Technology. Pheng-ann Heng works for both Shenzhen Institute of Advanced Technology and The Chinese Univerity of Hong Kong. All these people are involved in this paper.

Funding

This work is supported in part by NSFC General Project 62072452, the Regional Joint Fund of Guangdong under Grant 2021B151520011, the Regional Joint Fund of Guangdong (Guangdong-Hong Kong-Macao Research Team Project) under Grant 2021B1515130003, Natural Science Foundation of Guangdong Province (2020A1515010357, 2021A1515011869), Shenzhen Science and Technology Program (No.JCYJ20220818101401003, JCYJ20200109115627045 and JCYJ20200109114244249).

The funding body played no role in the design of the study and collection, analysis, interpretation of data, and in writing the manuscript”.

Author information

Mian Wu and Yinling Qian contributed equally to this work.

Authors and Affiliations

Guangdong Provincial Key Laboratory of Computer Vision and Virtual Reality Technology, Shenzhen Institute of Advanced Technology, Chinese Academy of Science, Shenzhen, China
Mian Wu, Yinling Qian, **angyun Liao, Qiong Wang & Pheng-Ann Heng
The Chinese University of Hong Kong, Hong Kong SAR, China
Pheng-Ann Heng

Authors

Mian Wu
View author publications
You can also search for this author in PubMed Google Scholar
Yinling Qian
View author publications
You can also search for this author in PubMed Google Scholar
**angyun Liao
View author publications
You can also search for this author in PubMed Google Scholar
Qiong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Pheng-Ann Heng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MW and YQ wrote the main manuscript text and carried out the data analysis. XL, QW and PAH preprared and collected the data. XL reviewed the imaging analysis procedure. XL, QW and PAH reviewed the main manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to **angyun Liao.

Ethics declarations

Ethics approval and consent to participate

This study is based on IRCADb France committee on ethics approval and consent for the dataset. The 3D-IRCADb-01 is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Wu, M., Qian, Y., Liao, X. et al. Hepatic vessel segmentation based on 3D swin-transformer with inductive biased multi-head self-attention. BMC Med Imaging 23, 91 (2023). https://doi.org/10.1186/s12880-023-01045-y

Download citation

Received: 27 December 2022
Accepted: 05 June 2023
Published: 08 July 2023
DOI: https://doi.org/10.1186/s12880-023-01045-y

Hepatic vessel segmentation based on 3D swin-transformer with inductive biased multi-head self-attention

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

SCAN: sequence-based context-aware association network for hepatic vessel segmentation

Hierarchical deep network with uncertainty-aware semi-supervised learning for vessel segmentation

Robust Liver Vessel Extraction Using DV-Net with D-BCE Loss Function

Introduction

Background

Related work

Proposed method

Methodology

Preprocessing

3D swin transformer block with Inductive Biased MSA Module

Decoder

Weighted Loss Function

Experiments

Ablation studies

Conclusions

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation