Enhancing Unmanned Aerial Vehicle Object Detection via Tensor Decompositions and Positive–Negative Momentum Optimizers
The current development of machine learning has advanced many fields in applied sciences and industry, including remote sensing. In this area, deep neural networks are used to solve routine object detection problems, satisfying the required rules and conditions. However, the growing number and difficulty of such problems cause the developers to construct machine learning models with higher computational complexities, such as an increased number of hidden layers, epochs, learning rate, and rate decay. In this paper, we propose the Yolov8 architecture with decomposed layers via canonical polyadic and Tucker methods for accelerating the solving of the object detection problem in satellite images. Our positive–negative momentum approaches enabled a reduction in the loss in precision and recall assessments for the proposed neural network. The convolutional layer factorization reduces the shapes and accelerates the computations at kernel nodes in the proposed deep learning models. The advanced optimization algorithms achieve the global minimum of loss functions, which makes the precision and recall metrics superior to the ones for their known counterparts. We examined the proposed Yolov8 with decomposed layers, comparing it with the conventional Yolov8 on the DIOR and VisDrone 2020 datasets containing the UAV images. We verified the performance of the proposed and known neural networks on different optimizers. It is shown that the proposed neural network accelerates the solving object detection problem by 44–52%. The proposed Yolov8 with Tucker and canonical polyadic decompositions has greater precision and recall metrics than the usual Yolov8 with known analogs by 0.84–0.94 and 0.228–1.107 percentage points, respectively.