Improved Fast YOLO One-Stage Object Detection Algorithm for Detecting Objects in Images
Main Article Content
Abstract
The use of a Convolutional neural network (CNN) has gained wide recommendation in the research community, especially in the area of vision systems (Object detection). The recent CNN recorded Various advancements in object detection in images with tremendous accuracy but still faced challenges of high time complexity. A one-stage object detection algorithm called YOLO (You Only Look Once), used for object classification and localization, performs great, especially in detecting objects in real time. In this study, we proposed an improved, fast YOLO CNN-based algorithm for detecting objects in images. We introduced hard negative mining for resampling and voting to eliminate some negative samples for balancing between negative and positive samples. A small convolution operation was used in exchange for the original convolution, which adjusted the parameters and effectively decreased detection time in images. The proposed model outperformed Fast YOLO with a precision of 88.32% and recall of 89.92%, conducted on smart city datasets.
Downloads
Article Details
Section

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
How to Cite
References
Ansari, S. (2020). Deep Learning in Object Detection. In Building Computer Vision Applications Using Artificial Neural Networks (pp. 219-307): Springer. DOI: https://doi.org/10.1007/978-1-4842-9866-4_6
Fang, W., Wang, L., & Ren, P. (2019). Tinier-YOLO: A real-time object detection method for constrained environments. IEEE Access, 8, 1935-1944. DOI: http://doi.org/10.1109/ACCESS.2019.2961959
Li, J., Su, Z., Geng, J., & Yin, Y. (2018). Real-time detection of steel strip surface defects based on improved Yolo detection network. IFAC-PapersOnLine, 51(21), 76-81. DOI: https://doi.org/10.1016/j.ifacol.2018.09.412
Li, Z., & Zhou, F. (2017). FSSD: feature fusion single shot multibox detector. arXiv preprint arXiv:1712.00960. DOI: https://doi.org/10.48550/arXiv.1712.00960
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., & Berg, A. C. (2016). SSD: Single shot multibox detector. Paper presented at the European conference on computer vision. DOI: https://doi.org/10.1007/978-3-319-46448-0_2
Lou, X., & Cui, B. (2007). Boundedness and exponential stability for nonautonomous RCNNs with distributed delays. Computers & Mathematics with Applications, 54(4), 589-598. DOI: https://doi.org/10.1016/j.camwa.2007.02.007
Lu, S., Wang, B., Wang, H., Chen, L., Linjian, M., & Zhang, X. (2019). A real-time object detection algorithm for video. Computers & Electrical Engineering, 77, 398-408. DOI: https://doi.org/10.1016/j.compeleceng.2019.05.009
Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection—paper presented at the Proceedings of the IEEE conference on computer vision and pattern recognition. DOI: https://doi.org/10.1109/CVPR.2016.91
Redmon, J., & Farhadi, A. (2017). YOLO9000: better, faster, stronger. Paper presented at the Proceedings of the IEEE conference on computer vision and pattern recognition. DOI: https://doi.org/10.1109/CVPR.2017.690
Redmon, J., & Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767. DOI: https://doi.org/10.48550/arXiv.1804.02767
Sudowe, P., & Leibe, B. (2011). Efficient use of geometric constraints for sliding-window object detection in video. Paper presented at the International Conference on Computer Vision Systems. DOI: https://dl.acm.org/doi/10.5555/2045399.2045402
Sun, X., Wu, P., & Hoi, S. C. (2018). Face detection using deep learning: An improved faster RCNN approach. Neurocomputing, 299, 42-50. DOI: https://doi.org/10.1016/j.neucom.2018.03.030
Wang, Y., Niu, H., Wang, X., & Chen, L. (2019). Multi-class Object Detection Algorithm Based on Convolutional Neural Network. Paper presented at the 2019 IEEE International Conference of Intelligent Applied Systems on Engineering (ICIASE). DOI: https://doi.org/10.1109/ICIASE45644.2019.9074015
Xie, L., Ahmad, T., Jin, L., Liu, Y., & Zhang, S. (2018). A new CNN-based method for multi-directional car license plate detection. IEEE Transactions on Intelligent Transportation Systems, 19(2), 507-517. DOI: https://doi.org/10.1109/TITS.2017.2784093
Yin, Y., Li, H., & Fu, W. (2020). Faster-YOLO: An accurate and faster object detection method. Digital Signal Processing, 102, 102756. DOI: https://doi.org/10.1016/j.dsp.2020.102756
Zha, S., Luisier, F., Andrews, W., Srivastava, N., & Salakhutdinov, R. (2015). Exploiting image-trained CNN architectures for unconstrained video classification. arXiv preprint arXiv:1503.04144. DOI: https://doi.org/10.48550/arXiv.1503.04144
Lambat, S., & Sonawane, Dr. S. S. (2020). Analysis of Different Neural Network Techniques Used for Image Caption Generation. In International Journal of Innovative Technology and Exploring Engineering (Vol. 9, Issue 9, pp. 299–304). DOI: https://doi.org/10.35940/ijitee.i7165.079920
Kholwal, R., & Maurya, S. (2021). A Comparative Approach on Classification of Images with Convolutional Neural Networks. In International Journal of Engineering and Advanced Technology (Vol. 10, Issue 4, pp. 201–205). DOI: https://doi.org/10.35940/ijeat.d2483.0410421
Panda, C. (2019). Object Detection and Tracking using Faster R-CNN. In International Journal of Recent Technology and Engineering (IJRTE) (Vol. 8, Issue 3, pp. 4894–4900). DOI: https://doi.org/10.35940/ijrte.c5580.098319
Dr. Nithyanandh S. (2025). Object Detection & Analysis with Deep CNN and Yolov8 in Soft Computing Frameworks. In International Journal of Soft Computing and Engineering (Vol. 14, Issue 6, pp. 19–27). DOI: https://doi.org/10.35940/ijsce.e3653.14060125
Jadia, A., & Chawla, M. P. S. (2020). Image Classification and Detection of Insulators using Bag of Visual Words and Speeded up Robust Features. In International Journal of Innovative Science and Modern Engineering (Vol. 6, Issue 10, pp. 7–13). DOI: https://doi.org/10.35940/ijisme.j1260.0961020