基于分形噪声与径向距离融合的裂缝图像合成方法

A CRACK IMAGE SYNTHESIS METHOD BASED ON THE FUSION OF FRACTAL NOISE AND RADIAL DISTANCE

  • 摘要: 基于深度学习的视觉检测方法在混凝土结构裂缝分割领域展现出显著优势,但受限于训练数据集的数量与质量。针对现有裂缝数据集获取成本高、人工负担重的问题,本文提出了一种轻量化的裂缝图像合成方法。该方法利用随机分形噪声算法生成裂缝前景图像,并引入一种基于裂缝轴线径向距离的前景-背景图像叠加策略,从而实现高效、高质量的裂缝图像及其对应标注图像的自动生成。为验证方法的有效性,本文分别采用真实数据、合成数据、真实-合成混合数据在相同参数下对神经网络模型进行训练,并在真实裂缝数据测试集上对模型性能进行验证。结果表明,在相同数据量下,合成数据模型的各项性能指标与真实数据模型的差异均在10%以内,在使用真实-合成混合数据后,模型在交并比等指标上的差距缩小至2%左右,并在召回率上以0.33%的微弱优势超越真实数据模型。此外,合成图像单张的平均生成时间仅为136毫秒,优于传统人工标注方法。本文为基于图像的结构裂缝分割领域提供了一种低成本、高效率的训练数据获取方案,适用于实际工程中真实数据匮乏的场景。

     

    Abstract: Deep learning-based methods have demonstrated a remarkable performance in the segmentation of cracks occurring in concrete structures. However, their effectiveness remains constrained by the quantity and quality of training datasets. To address the high acquisition cost and labor-intensive nature of existing crack datasets, this study proposes a lightweight crack image synthesis method, which utilizes a stochastic fractal noise algorithm to generate foreground crack patterns, and introduces a foreground-background fusion strategy based on the radial distance from the crack axis. The method proposed enables the efficient and automated generation of high-quality crack images along with their corresponding annotations. To validate the method proposed, neural network models were trained under identical configurations using real data, synthetic data, and a combination of both, and tested on a real-world crack dataset. The simulation results show that, under the same data volume, the performance gap between models trained on synthetic data and real data remained within 10% across all key metrics. When using a hybrid dataset, the discrepancy in metrics such as Intersection over Union (IoU) decreased to approximately 2%, and the recall rate slightly surpassed the real-data model by 0.33%. Furthermore, the average generation time per synthetic image was only 136 milliseconds, outperforming conventional manual annotation methods. This study offers a cost-effective and highly efficient solution for generating training data for image-based structural crack segmentation tasks, especially in engineering scenarios where real data is limited.

     

/

返回文章
返回