Intruder Detection from Video Surveillance Using Deep Learning

Video surveillance, or closed-circuit television (CCTV), is a well-known technology globally. Many homeowners use this technology for security purposes. However, the existing system cannot distinguish whether the individual captured in the footage is the homeowner or a stranger. Moreover, the author...

Full description

Bibliographic Details
Published in:2024 IEEE 15th Control and System Graduate Research Colloquium, ICSGRC 2024 - Conference Proceeding
Main Author: Abu Mangshor N.N.; Sabri N.; Aminuddin R.; Rashid N.A.M.; Mohd Johari N.F.; Zaini Jemani M.A.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85206629131&doi=10.1109%2fICSGRC62081.2024.10691188&partnerID=40&md5=99c2ed133268b599e4d95cbec7a7ad79
Description
Summary:Video surveillance, or closed-circuit television (CCTV), is a well-known technology globally. Many homeowners use this technology for security purposes. However, the existing system cannot distinguish whether the individual captured in the footage is the homeowner or a stranger. Moreover, the authority still needs to analyze each footage, which can be time-consuming. Therefore, this present study introduces a system that can distinguish the homeowner from any stranger, record a specific chunk of footage when an intruder is detected, and notify the homeowner about the incident. To ensure the success of this project, two Deep Learning (DL) models were trained: the EfficientDet and MobileNets models. The first model, EfficientDet is an object detection model used for the detection of a person. The second model, MobileNets is the image classification model for performing figure recognition of the homeowner. These deep learning models are loaded into a Raspberry Pi 4 (Pi) to act as video surveillance and perform detection together with classification. If an intruder is detected, a notification will be sent to the homeowner together with a short video recording of the incident which is viewable via a web application. Based on the testing performed, the system passed all use cases of the functionality testing. This study confirmed the usability as well as the accuracy of the proposed technique. On accuracy testing, the object detection model achieved an average precision (AP) of 76.00%. As for the image classification model, the accuracy achieved is 85.71%. The ability of this system can be improved with the usage of better hardware for inference such as Google's Coral, an edge TPU, as this would increase the performance of the model in terms speed. © 2024 IEEE.
ISSN:
DOI:10.1109/ICSGRC62081.2024.10691188