Forest fires are a serious ecological concern, and smoke is an early warning indicator. Early smoke images barely capture a tiny portion of the total smoke. Because of the irregular nature of smoke’s dispersion and the dynamic nature of the surrounding environment, smoke identification is complicated by minor pixel-based traits. This study presents a new framework that decreases the sensitivity of various YOLO detection models. Additionally, we compare the detection performance and speed of different YOLO models such as YOLOv3, YOLOv5, and YOLOv7 with prior ones such as Fast R-CNN and Faster R-CNN. Moreover, we follow the use of a collected dataset that describes three distinct detection areas, namely close, medium, and far distance, to identify the detection model’s ability to recognize smoke targets correctly. Our model outperforms the gold-standard detection method on a multi-oriented dataset for detecting forest smoke by an mAP accuracy of 96.8% at an IoU of 0.5 using YOLOv5x. Additionally, the findings of the study show an extensive improvement in detection accuracy using several data-augmentation techniques. Moreover, YOLOv7 outperforms YOLOv3 with an mAP accuracy of 95%, compared to 94.8% using an SGD optimizer. Extensive research shows that the suggested method achieves significantly better results than the most advanced object-detection algorithms when used on smoke datasets from wildfires, while maintaining a satisfactory performance level in challenging environmental conditions.