基于深度图像和基本强度剖面特征检索的医学体数据传输函数

盛斌

doi:10.1007/s11390-024-3419-7

基于深度图像和基本强度剖面特征检索的医学体数据传输函数

盛斌,

A Transfer Function Design for Medical Volume Data Using a Knowledge Database Based on Deep Image and Primitive Intensity Profile Features Retrieval

摘要

摘要:
研究背景 本研究旨在深入探索计算机图形学及体绘制技术在医学图像分析领域中的实际运用。鉴于医学影像数据往往具有三维结构和体积特性，常通过切片堆叠的方式形成三维影像。尽管传统的体绘制技术能够实现一定程度的图像可视化，但在细节展现和效果优化方面仍存在明显不足。为此，本研究提出了一种基于内容检索的转移函数生成方法，旨在提升医学图像的可视化质量。直接体积渲染作为本研究的重点议题，其传输函数的设计与实施至关重要。目前，大多数直接体渲染技术仍需用户亲自浏览知识库，进行传输函数的选择与调整，这无疑增加了工作负担和人力成本。针对这一问题，本研究开发了一种高效的自动化检索知识库方法，以期在医学领域三维成像中实现更广泛的应用价值。通过本研究，有望为医学影像分析提供更准确、高效的可视化手段，进而推动相关领域的科技进步。这一研究成果将对医学影像处理与分析产生积极影响，有望为医疗诊断、治疗和研究提供更准确、直观和高效的视觉支持。
目的针对当前直接体积渲染方法存在的不足，如手动检索和调整传输函数的繁琐性，以及现有基于内容检索的传输函数设计方法在处理多感兴趣结构时的局限性，本文提出了一种创新的传输函数设计策略。通过引入一种新颖的基于内容检索的方法，该方法能够自动检索知识库，并融合自动识别感兴趣区域的推理过程，显著减少多维传输函数的设计与优化步骤。这一改进旨在优化医学图像的可视化效果，为医学诊断与分析提供更为直观、精确的视觉支持。
方法本文提出了基于三元输入查询的新型方法，旨在从给定的体积数据中精确提取体影像数据特征。此方法的核心在于构建一个用户定义的感兴趣结构射线，并辅以两个与射线共平面的三维图像。为了从这两张共平面图像中提取高级区域空间语义的图像特征，本文采用了预训练的卷积神经网络。在定位感兴趣结构方面，本文创新性地沿用了结构的强度轮廓，并结合动态时间规整技术，以应对相似轮廓间可能存在的微小对齐差异。此外，本文还开发了一种两阶段基于内容检索的方法，该方法通过巧妙结合卷积神经网络和动态时间规整两种技术的检索结果，实现了互补优化，从而显著提升了检索效率。为了提升用户交互的便捷性和直观性，本文还引入了一种基于笔画的、以图像为中心的交互方式。用户仅需简单绘制射线（在二维横截面视图上）或选择点（在三维直接体积渲染上），即可轻松生成三元输入查询。此外，对于需要强调特定感兴趣结构的情况，本文还提供了基于可见度的参数优化方案，以优化传输函数设计。
结果经过实验验证，本文提出的两阶段方法在检索各类感兴趣结构时，相较于传统的欧几里得距离基线方法，展现出了显著的优势。在召回率方面，两阶段方法平均提升了0.201，其中在肾脏检索方面的改善尤为明显，提升幅度高达0.304。这一提升充分证明了该方法在医学图像检索领域的有效性和实用性。在消融实验中，两阶段方法的表现同样令人瞩目。无论是对全局性更强的感兴趣结构（如动脉、骨骼和肝脏），还是对局部感兴趣结构（如肝脏、肺和脾），两阶段方法均展现出了优于单一组件（传统卷积神经网络和动态时间规整）的性能。在定性分析方面，两阶段方法在三元输入查询中能够准确识别所有感兴趣结构，这进一步验证了其在实际应用中的可靠性和准确性。与之相比，单一组件方法无法独立完成感兴趣结构的有效检索，这再次证明了组合两阶段方法的有效性。我们还通过一系列用例展示了所提出CBR-TF方法在医学体积可视化中的应用效果。用户可以通过简单的线条画在二维视图中指定感兴趣结构，进而在三维直接体积渲染中突出显示这些结构。这一功能为医学图像的解读和分析提供了极大的便利。在计算性能方面，虽然两阶段方法的计算时间（平均15.93秒）远高于ED基线方法（0.84秒），但考虑到其在检索性能上的显著提升，这一计算成本是可以接受的，这提示在未来的工作中可以进一步优化相关算法设计以提高计算效率。
结论本论文提出了一种基于内容检索的转移函数生成方法，以提高医学图像的可视化效果。该方法通过引入自动识别感兴趣区域（SOI）的案例推理（CBR）过程，大幅减少了多维传输函数（TF）设计的手动优化。实验结果表明，该方法在识别不同感兴趣区域（SOI）方面具有较高的准确性和召回率。该方法不仅能够自动生成高质量的转移函数，精确识别目标结构，还能通过融合图像语义与强度轮廓匹配，实现高效的可视化。

Abstract: Direct volume rendering (DVR) is a technique that emphasizes structures of interest (SOIs) within a volume visually, while simultaneously depicting adjacent regional information, e.g., the spatial location of a structure concerning its neighbors. In DVR, transfer function (TF) plays a key role by enabling accurate identification of SOIs interactively as well as ensuring appropriate visibility of them. TF generation typically involves non-intuitive trial-and-error optimization of rendering parameters, which is time-consuming and inefficient. Attempts at mitigating this manual process have led to approaches that make use of a knowledge database consisting of pre-designed TFs by domain experts. In these approaches, a user navigates the knowledge database to find the most suitable pre-designed TF for their input volume to visualize the SOIs. Although these approaches potentially reduce the workload to generate the TFs, they, however, require manual TF navigation of the knowledge database, as well as the likely fine tuning of the selected TF to suit the input. In this work, we propose a TF design approach, CBR-TF, where we introduce a new content-based retrieval (CBR) method to automatically navigate the knowledge database. Instead of pre-designed TFs, our knowledge database contains volumes with SOI labels. Given an input volume, our CBR-TF approach retrieves relevant volumes (with SOI labels) from the knowledge database; the retrieved labels are then used to generate and optimize TFs of the input. This approach largely reduces manual TF navigation and fine tuning. For our CBR-TF approach, we introduce a novel volumetric image feature which includes both a local primitive intensity profile along the SOIs and regional spatial semantics available from the co-planar images to the profile. For the regional spatial semantics, we adopt a convolutional neural network to obtain high-level image feature representations. For the intensity profile, we extend the dynamic time warping technique to address subtle alignment differences between similar profiles (SOIs). Finally, we propose a two-stage CBR scheme to enable the use of these two different feature representations in a complementary manner, thereby improving SOI retrieval performance. We demonstrate the capabilities of our CBR-TF approach with comparison with a conventional approach in visualization, where an intensity profile matching algorithm is used, and also with potential use-cases in medical volume visualization.

HTML全文

参考文献()

施引文献

资源附件()