Research on automatic annotation and classification of video content based on machine learning

The Convolutional Mini-batch Gradient (CMG) architecture represents a pioneering synthesis of Convolutional Neural Networks (CNNs) and the Mini-batch Gradient Descent (MGD) training technique. This amalgamation leverages the inherent spatial feature extraction prowess of CNNs and the computational e...

Full description

Bibliographic Details
Published in:Proceedings of 2024 IEEE 6th International Conference on Civil Aviation Safety and Information Technology, ICCASIT 2024
Main Author: Muying L.; Yuming Q.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85217410959&doi=10.1109%2fICCASIT62299.2024.10827856&partnerID=40&md5=be62a31c95fa251fbbf57305fb1692ab
id 2-s2.0-85217410959
spelling 2-s2.0-85217410959
Muying L.; Yuming Q.
Research on automatic annotation and classification of video content based on machine learning
2024
Proceedings of 2024 IEEE 6th International Conference on Civil Aviation Safety and Information Technology, ICCASIT 2024


10.1109/ICCASIT62299.2024.10827856
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85217410959&doi=10.1109%2fICCASIT62299.2024.10827856&partnerID=40&md5=be62a31c95fa251fbbf57305fb1692ab
The Convolutional Mini-batch Gradient (CMG) architecture represents a pioneering synthesis of Convolutional Neural Networks (CNNs) and the Mini-batch Gradient Descent (MGD) training technique. This amalgamation leverages the inherent spatial feature extraction prowess of CNNs and the computational efficiency of MGD, ushering in a novel approach to the automated annotation and categorization of video content. At the heart of CMG lies the utilization of CNNs to conduct a thorough excavation of visual features embedded within individual video frames. Concurrently, the Mini-batch Gradient Descent strategy accelerates the iteration of the model, facilitating faster learning cycles and enhancing the model's adaptability. This dual-pronged approach yields a significant boost in the precision and recall rates associated with video analysis, thereby enriching the comprehensiveness and accuracy of the resultant annotations. Through a series of empirical evaluations and comparisons, the CMG framework has not only demonstrated a considerable augmentation in the resilience and computational efficacy of the model when operating in intricate and variable environments but has also substantiated its reliability and practicality in the realm of automated video content labeling and classification. The model's ability to maintain high levels of performance despite environmental complexities is a testament to its robust design. Furthermore, CMG's proven track record in enhancing computational efficiency and delivering precise outcomes positions it as a viable and promising solution for advancing the frontiers of video content understanding and analysis in the era of big data and machine learning. © 2024 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Muying L.; Yuming Q.
spellingShingle Muying L.; Yuming Q.
Research on automatic annotation and classification of video content based on machine learning
author_facet Muying L.; Yuming Q.
author_sort Muying L.; Yuming Q.
title Research on automatic annotation and classification of video content based on machine learning
title_short Research on automatic annotation and classification of video content based on machine learning
title_full Research on automatic annotation and classification of video content based on machine learning
title_fullStr Research on automatic annotation and classification of video content based on machine learning
title_full_unstemmed Research on automatic annotation and classification of video content based on machine learning
title_sort Research on automatic annotation and classification of video content based on machine learning
publishDate 2024
container_title Proceedings of 2024 IEEE 6th International Conference on Civil Aviation Safety and Information Technology, ICCASIT 2024
container_volume
container_issue
doi_str_mv 10.1109/ICCASIT62299.2024.10827856
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85217410959&doi=10.1109%2fICCASIT62299.2024.10827856&partnerID=40&md5=be62a31c95fa251fbbf57305fb1692ab
description The Convolutional Mini-batch Gradient (CMG) architecture represents a pioneering synthesis of Convolutional Neural Networks (CNNs) and the Mini-batch Gradient Descent (MGD) training technique. This amalgamation leverages the inherent spatial feature extraction prowess of CNNs and the computational efficiency of MGD, ushering in a novel approach to the automated annotation and categorization of video content. At the heart of CMG lies the utilization of CNNs to conduct a thorough excavation of visual features embedded within individual video frames. Concurrently, the Mini-batch Gradient Descent strategy accelerates the iteration of the model, facilitating faster learning cycles and enhancing the model's adaptability. This dual-pronged approach yields a significant boost in the precision and recall rates associated with video analysis, thereby enriching the comprehensiveness and accuracy of the resultant annotations. Through a series of empirical evaluations and comparisons, the CMG framework has not only demonstrated a considerable augmentation in the resilience and computational efficacy of the model when operating in intricate and variable environments but has also substantiated its reliability and practicality in the realm of automated video content labeling and classification. The model's ability to maintain high levels of performance despite environmental complexities is a testament to its robust design. Furthermore, CMG's proven track record in enhancing computational efficiency and delivering precise outcomes positions it as a viable and promising solution for advancing the frontiers of video content understanding and analysis in the era of big data and machine learning. © 2024 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1825722578699812864