Parul University Digital Repository

An Efficient Deep Convolutional Neural Network Approach for Object Detection and Recognition from a Video Sequence using Multi-Scale Anchor Box

Show simple item record

dc.contributor.author Garg, Dweepna
dc.date.accessioned 2021-01-27T11:10:25Z
dc.date.available 2021-01-27T11:10:25Z
dc.date.issued 2020
dc.identifier.uri http://ir.paruluniversity.ac.in:8080/xmlui/handle/123456789/9820
dc.description For Full Thesis Kindly contact to respective Library en_US
dc.description.abstract Deep learning is a new era of machine learning which trains computers to find the structure from a massive amount of data. Learning is described at multiple levels of representation. This enables us to make sense of the data consisting of text, sound, and images. Many computer vision problems such as object detection, image classification, and semantic segmentation have been solved using convolutional neural networks. Object detection in videos involves confirming the presence of the object in the image or video and then locating it accurately for recognition. Detecting and recognizing the still object from an image has comparatively shown better performance with the use of detection frameworks like R-CNN, Fast R-CNN, Faster R-CNN etc. The challenge is to detect and recognize the moving object from a static camera efficiently and accurately. In the video, modeling techniques suffer from high computation and memory costs which may lead to a decrease in performance measures such as accuracy and efficiency to identify the object accurately. The motive behind this work is to accurately detect and recognize the moving and still object from a video sequence using deep learning in real-time. The existing algorithms of object detection based on the deep convolution neural network worked well for large-size objects as the detection models get better results. However, those models fail to detect the varying size of the objects that have low resolution. This is because the features do not fully represent the essential characteristics of the objects in real-time after going through the repeated convolution operations of existing models. The proposed work improves the accuracy of detection by extracting the features of object at different size and scale by using a multi-scale anchor box. With the help of CNN, the deep knowledge from the dataset is extracted by giving the model a rigorous set of training samples. Our model has achieved 84.49 mAP on the test set of the Pascal VOC-2007 dataset which is higher than the state-of-the-art models. In our work, considering the accuracy as one of the evaluation measures, the objects get detected and recognized at 11 FPS which is comparatively better than other real-time object detection models. Our model is also trained and tested for face detection using the FDDB dataset. Moreover, the model is also able to detect partially covered faces. This also serves as one of the real-time application of our proposed work. en_US
dc.language.iso en en_US
dc.publisher Parul University en_US
dc.subject Deep Convolutional en_US
dc.subject Anchor Box en_US
dc.subject Neural Network Approach en_US
dc.title An Efficient Deep Convolutional Neural Network Approach for Object Detection and Recognition from a Video Sequence using Multi-Scale Anchor Box en_US
dc.title.alternative 150300402002 en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account