Methods for real-time object searching and recognizing in video images on ios mobile platform

2019;
: pp. 24-34
1
Lviv Polytechnic National University, Computer Engineering Department
2
Lviv Polytechnic National University, Computer Engineering Department

The features of the most common methods and systems for searching and recognizing objects in video are explored. The research shows the feasibility of building search and recognition tools for the iOS platform in real time. The method of functional adaptation of the algorithm of search and recognition of objects to features of video is offered, which consists in processing of video image by smoothing and minimization filters, which reduces the time of search and recognition of objects. The block diagram and algorithm of system functioning were designed. Developed a program to solve the problem of finding and quickly recognizing objects in real time in Swift language on the iOS mobile platform. A convolutional neural network with YOLOv3 architecture was used along with framework for working with neural networks for mobile CoreML applications. A method of improving the performance of such a neural network is proposed, which is based on the quantization of the neural network weights and minimizes the model size and search time of its objects. The frequencies of image processing using the proposed means and models of neural networks of the type YOLOv3-tiny, YOLOv3-416 and our own model YOLOv3-KD are investigated. The possibility of functioning of the proposed funds in real time is provided.

[1] Wikipedia.  Binary  classification  [Elektronnyj  resurs]  /  Chicago  2019.  Rezhym  dostupu: https://en.wikipedia.org/wiki/Binary_classification.

[2] Wikipedia. Support-vector machine [Elektronnyj resurs] / Chicago 2019. Rezhym dostupu: https://en.wikipedia.org/wiki/Support-vector_machine.

[3] Wikipedia. Artificial neural network [Elektronnyj resurs] / Chicago 2019. Rezhym dostupu: https://en.wikipedia.org/wiki/Artificial_neural_network.

[4] Wikipedia. Convolutional neural network [Elektronnyj resurs] / Chicago 2019. Rezhym dostupu: https://en.wikipedia.org/wiki/Convolutional_neural_network.

[5-7]. Rohith Gandhi. R-CNN, Fast R-CNN, Faster R-CNN, YOLO. Object Detection Algorithms [Elektronnyj resurs] / San-Francisco 2018 – Rezhym dostupu: https://towardsdatascience.com/r-cnn-fast-r-cnn-faster-r-cnn-yoloobject-....

[8]  Wikipedia.  Keras  [Elektronnyj  resurs]  /  Chicago  2019.  Rezhym  dostupu: https://en.wikipedia.org/wiki/Keras.

[9-10]. Hao Gao. Understand Single Shot MultiBox Detector (SSD) and Implement It in Pytorch [Elektronnyj resurs] / San-Francisco 2018. Rezhym dostupu: https://medium.com/@smallfishbigsea/understand-ssd-andimplement-your-own....

[11] Juan Garcia, Reza Bakhshandeh. Methods and systems for object recognition. [Elektronnyjresurs] / MenloPark2016-Rezhym dostupu: https://patents.google.com/patent/US948940.

[12] Wikipedia.  Google  Translate  [Elektronnyj  resurs]  /  Chicago  2019.  Rezhym  dostupu: https://en.wikipedia.org/wiki/Google_Translate