Education, Science, Technology, Innovation and Life
Open Access
Sign In

Research on Image Recognition Based on Different Depths of VGGNet

Download as PDF

DOI: 10.23977/jipta.2024.070110 | Downloads: 30 | Views: 1034

Author(s)

Handong Song 1, Huimin Lu 2

Affiliation(s)

1 School of Mechanical Engineering, University of Jinan, Jinan, 250024, China
2 School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, 430074, China

Corresponding Author

Handong Song

ABSTRACT

With the advancement of image recognition technology, the significance of convolutional neural networks has continually increased. The VGGNet model, developed by the Visual Geometry Group at the University of Oxford, has proven successful, demonstrating that the depth of the network is crucial for model performance. This study aims to explore the impact of various depths of VGGNet models on image recognition tasks. Three classic VGG network models were selected: VGG-13, VGG-16, and VGG-19, along with two widely used image datasets, MNIST and CIFAR-10, for a comprehensive experimental analysis. The experimental results on the CIFAR-10 dataset indicated that as network depth increased, there was a significant enhancement in model accuracy, with VGG-19 performing the best. This outcome confirms the superiority of deep networks in processing complex image data. Conversely, in the simpler MNIST dataset, the deeper VGG-19 did not exhibit better performance compared to VGG-13, suggesting that excessively deep networks might not be necessary for simple datasets and could lead to overfitting and gradient vanishing.

KEYWORDS

VGGNet, Convolutional neural network, Image recognition, Deep learning

CITE THIS PAPER

Handong Song, Huimin Lu, Research on Image Recognition Based on Different Depths of VGGNet. Journal of Image Processing Theory and Applications (2024) Vol. 7: 84-90. DOI: http://dx.doi.org/10.23977/jipta.2024.070110.

REFERENCES

[1] Li D, Deng L, Cai Z .Research on image classification method based on convolutional neural network[J].Neural Computing and Applications, 2020, 33(14):1-11.
[2] Fajri D , Mahmudy W , Yulianti T .Detection of Disease and Pest of Kenaf Plant Based on Image Recognition with VGGNet19 [J]. Knowl. Eng. Data Sci. 2021.DOI:10.17977/um018v4i12021p55-68.
[3] Umang P, Shruti B, Avik H .Enhancing cross-domain transferability of black-box adversarial attacks on speaker recognition systems using linearized backpropagation[J].Pattern Analysis and Applications, 2024, 27(2)
[4] Song X, Fan L. Human Posture Recognition and Estimation Method Based on 3D Multiview Basketball Sports Dataset [J].Complexity, 2021. DOI:10.1155/2021/6697697.
[5] Srivastava N, Hinton E G, Krizhevsky A, et al. Dropout: a simple way to prevent neural networks from overfitting. [J]. Journal of Machine Learning Research, 2014, 15(1):1929-1958.
[6] Liu Aili, Chen Zhixiang. ReLU depth network structure and activation function approximation [J]. Journal of shaoxing liberal art &science college, 2024, 44 (02): 58-68.
[7] Liao Qing-Jiang, Liu Ting, Zhang Xing-Yue, et al. For facial expression recognition based on VGGNet study [J]. Journal of software engineering, 2023, 26 (11): 59-62. 
[8] Simonyan K, Zisserman A .Very Deep Convolutional Networks for Large-Scale Image Recognition. [J]. CoRR, 2014, abs/1409.1556.

Downloads: 2476
Visits: 175641

Sponsors, Associates, and Links


All published work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2016 - 2031 Clausius Scientific Press Inc. All Rights Reserved.