*Result*: EPIDSeg-Net: A Multi-Modal Fusion Framework Based on DRR Guidance in Radiotherapy is Used for Precise Segmentation of MV-EPID Lung Targets.
Original Publication: Schenectady, NY : Adenine Press, c2002-
Digit Health. 2025 May 14;11:20552076251341092. (PMID: 40376574)
Adv Clin Exp Med. 2018 May;27(5):725-730. (PMID: 29790681)
Neural Netw. 2020 Jan;121:74-87. (PMID: 31536901)
Annu Rev Biomed Eng. 2017 Jun 21;19:221-248. (PMID: 28301734)
Comput Biol Med. 2024 Sep;180:108980. (PMID: 39137668)
Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018 Sep;11045:3-11. (PMID: 32613207)
IEEE Trans Image Process. 2016 May;25(5):2045-58. (PMID: 26863661)
Comput Intell Neurosci. 2022 May 23;2022:5369516. (PMID: 35655520)
Sensors (Basel). 2023 Mar 24;23(7):. (PMID: 37050479)
Sensors (Basel). 2022 Sep 28;22(19):. (PMID: 36236483)
IEEE Trans Radiat Plasma Med Sci. 2019 Mar;3(2):162-169. (PMID: 34722958)
Digit Health. 2025 Jul 20;11:20552076251361603. (PMID: 40693252)
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):5900-5920. (PMID: 38652624)
Med Image Anal. 2024 Oct;97:103280. (PMID: 39096845)
IEEE J Biomed Health Inform. 2021 Jan;25(1):121-130. (PMID: 32305947)
SLAS Technol. 2025 Oct;34:100334. (PMID: 40712914)
Int J Comput Assist Radiol Surg. 2023 Feb;18(2):353-365. (PMID: 36042149)
EJNMMI Phys. 2024 Jul 25;11(1):67. (PMID: 39052194)
PeerJ Comput Sci. 2024 Jan 12;10:e1798. (PMID: 38259898)
Sensors (Basel). 2024 Jun 21;24(13):. (PMID: 39000834)
*Further Information*
*BackgroundBy integrating Digitally Reconstructed Radiograph (DRR) images of pulmonary tumors with Electronic Portal Imaging Device (EPID) images to assist in target segmentation, and subsequently comparing morphological changes in segmented targets across different radiotherapy stages, this approach enables precise quantification of dynamic variations in target volume and shape. This methodological integration provides objective evidence for treatment response evaluation and dynamic optimization of treatment plans, thereby significantly enhancing the precision of radiotherapy delivery.MethodsThe proposed multimodal segmentation framework, named EPIDSeg-Net, comprises an encoder, a multi-scale feature layer, and a decoder. The encoder utilizes a dual-branch architecture: a CNN branch for extracting local texture features and a Swin-Transformer branch for capturing global semantic features. The model first calibrates multimodal input features through a Dual Attention Mechanism (DAM) to adaptively adjust modality-specific weights, thereby enhancing tolerance to missing image information in multi-sequence segmentation. Subsequently, two key modules are implemented within the multi-scale feature layer: a Large-Kernel Grouped Attention Gating (LKG-Gate) module to strengthen local contextual awareness, and a Multi-Path Feature Extraction (MPFE) module to improve feature robustness via a parallel structure. These designs enable the model to effectively focus on lung tumor target regions, optimize segmentation accuracy, and achieve high-performance reconstruction.ResultsThe framework effectively integrates multimodal features, enabling high-precision localization and sharp boundary delineation while preserving anatomical details. Quantitative evaluations demonstrate superior performance: DICE = 93.2 (92.4∼93.9), CE = 0.352, HD95 = 9.42 (6.03∼12.8), IOU = 86.0 (84.1∼87.9), and SENCE = 0.828. Overall, the model excels at preserving gradient information, regional integrity, and fine details; effectively suppresses feature loss; and reduces missed segmentation rates, leading to improvements in both subjective and objective performance metrics.ConclusionThe proposed segmentation method effectively integrates information from EPID and DRR images, enabling more precise localization and segmentation of lesion regions within EPID images while enhancing segmentation accuracy.*