Recent Papers | Research Publications

A SURVEY OF DEEP LEARNING ARCHITECTURES FOR OBJECT DETECTION IN COMPUTER VISION

Dr Neha yadav Neha yadav

Download Paper

Paper Contents

Abstract

Object Detection is a prominent area in computer vision, where deep learning has dramatically advanced in many areas-from autonomous driving and healthcare to surveillance. Discuss the development of deep learning models for object detection: two-stage detectors like Faster R-CNN, one-stage detectors as YOLO and SSD, and emerging transformer-based models like DETR. We discuss strengths and weaknesses of each type of model with respect to accuracy, speed, and efficiency of resources used, specifically looking at the challenges such models pose in real applications like occlusion, detection of small objects, and domain adaptation. Finally, we describe how large datasets like MS COCO and PASCAL VOC became important to the development of benchmarks. Future promising research directions would be multi-modal learning, lightweight models for resource-constrained devices, and ethics considerations for privacy-sensitive applications. This review tries to outline the state-of-the-art object detection methodology available nowadays, indicates the challenges of the present situation, and points out how further development might occur. Keywords: Computer Vision, Deep Learning, R-CNN, YOLO, SSD, DETR, MS COCO, PASCAL VOC, Multi-modal learning.

Copyright

Paper ID: IJPREMS50200030844

ISSN: 2321-9653

Publisher: ijprems

Abstract
Copyright