基于改进YOLOv11n模型的输送带异物检测方法

Method for foreign object detection on conveyor belts based on improved YOLOv11n model

  • 摘要: 针对煤矿输送带异物尺寸差异大、背景复杂、小目标和细长目标检测效果不佳的问题,提出一种基于改进YOLOv11n模型的输送带异物检测方法。YOLOv11n模型核心改进包括3个方面:在颈部网络引入顺序化尺度交互融合(SSFF)模块,用顺序化的尺度交互增强不同尺度信息的有效捕获与融合;构建并行化的C3K2PPA模块,引入空间维度的并行补丁感知注意力(PPA)模块,以突出关键区域表征,提升召回率;在同尺度横向融合处设计带向对比融合(BACF)模块,结合带向先验、高通边缘指示,用逐像素门控替代简单拼接,可在不增加通道数的前提下抑制沿带向的周期性背景噪声并强化跨分支差异,从而提升模型在复杂工况下的判别能力与鲁棒性。实验结果表明:改进YOLOv11n模型的精确率与召回率分别达 0.914 与 0.892,mAP@0.5与mAP@0.5:0.95分别为93.1%,62.2%,较YOLOv11n有显著提升,在准确性和鲁棒性方面优于YOLOv5s,YOLOv8n,YOLOv10n等主流轻量模型;模型推理速度达96帧/s,实时性较高,能高效执行煤矿输送带异物检测任务;热力图分析表明,改进YOLOv11n模型有效增强了目标区域聚焦能力,减少了冗余框,提高了小目标检测精度。

     

    Abstract: To address the problem of large differences in foreign object sizes, complex backgrounds, and poor detection performance of small and slender targets in coal mine conveyor belts, a foreign object detection method based on an improved YOLOv11n model was proposed. The core improvements of the YOLOv11n model included three aspects: first, a Scale Sequence Feature Fusion (SSFF) module was introduced into the neck network to enhance the effective capture and fusion of information at different scales through sequential scale interaction; second, a parallel C3K2PPA module was constructed, introducing a Parallelized Patch-Aware Attention (PPA) module in the spatial dimension to highlight key region representations and improve recall; third, a Band-Aware Contrast Fusion (BACF) module was designed at the lateral fusion layer of the same scale. By combining belt-directional priors and high-pass edge indicators, and replacing simple concatenation with pixel-wise gating, the module suppressed periodic background noise along the belt direction and enhanced cross-branch differences without increasing the number of channels, thereby improving the model’s discriminative capability and robustness under complex working conditions. The experimental results showed that the precision and recall of the improved YOLOv11n model reached 0.914 and 0.892, respectively, with mAP@0.5 and mAP@0.5:0.95 values of 93.1% and 62.2%, showing significant improvement over the original YOLOv11n and outperforming mainstream lightweight models such as YOLOv5s, YOLOv8n, and YOLOv10n in accuracy and robustness. The inference speed of the model reached 96 frames per second, indicating high real-time performance and efficient execution in coal mine conveyor belt foreign object detection tasks. Heatmap analysis showed that the improved YOLOv11n model effectively enhanced the target-area focusing capability, reduced redundant bounding boxes, and improved the detection accuracy of small targets.

     

/

返回文章
返回