Open Access
Issue
ITM Web Conf.
Volume 78, 2025
International Conference on Computer Science and Electronic Information Technology (CSEIT 2025)
Article Number 04014
Number of page(s) 8
Section Foundations and Frontiers in Multimodal AI, Large Models, and Generative Technologies
DOI https://doi.org/10.1051/itmconf/20257804014
Published online 08 September 2025
  1. F. Su, Q. Lü and R. Luo, “Research Review on Image Classification Based on Deep Learning,” Telecommunications Science, vol. 35, no. 11, pp. 58–74, 2019. [Google Scholar]
  2. H. Wu, B. Xiao, N. Codella, M. Liu, X. Dai, L. Yuan and L. Zhang, “CvT: Introducing Convolutions to Vision Transformers,” in Proc. IEEE/CVF Int. Conf. Computer Vision (ICCV), Montreal, QC, Canada, 2021, pp. 22–31, doi: 10.1109/ICCV48922.2021.00009. [Google Scholar]
  3. Y. Lecun, Y. Bengio and G. Hinton, “Deep learning,” Nature, vol. 521, no. 7553, pp. 436–444, 2015, doi: 10.1038/nature14539. [CrossRef] [PubMed] [Google Scholar]
  4. Y. Qing, W. Liu, L. Feng, et al., “Improved transformer net for hyperspectral image classification,” Remote Sensing, vol. 13, no. 11, p. 2216, Jun. 2021. [Google Scholar]
  5. A. Dosovitskiy, L. Beyer, A. Kolesnikov, et al., “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale,” arXiv preprint, arXiv:2010.11929, 2020. [Google Scholar]
  6. N.V.B.K. Sai and R. Ishwariya, “Deep Learning Based Binary Classification of Invasive Ductal Carcinoma: A Comparative Study on CNN and VIT Models,” in Proc. IEEE Int. Women in Engineering Conf. Electrical and Computer Engineering (WIECON-ECE), Chennai, India, 2024, pp. 398–403, doi: 10.1109/WIECON-ECE64149.2024.10915172. [Google Scholar]
  7. H. Liu, X. Sun, Y. Li, et al., “Overview of Deep Learning Models for Image Classification Based on Convolutional Neural Networks,” Computer Engineering and Applications, pp. 1–29, 2025. [Online]. Available: http://kns.cnki.net/kcms/detail/11.2127.TP.20250213.1223.013.html. [Google Scholar]
  8. K. Zuo, H. Liang, D. Wang, et al., “Co-Saliency Detection Based on Multi-Scale Feature Extraction and Feature Fusion,” in Proc. 4th Int. Conf. Control and Robotics (ICCR), Guangzhou, China, 2022, pp. 364–368, doi: 10.1109/ICCR55715.2022.10053903. [Google Scholar]
  9. R. Li, “Research on Fine-Grained Image Classification Recognition Fusing Attention and Multi-Scale ViT,” M.S. thesis, Jiangxi Science and Technology Normal University, China, 2024, doi: 10.27751/d.cnki.gjxkj.2024.000225. [Google Scholar]
  10. J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li and L. Fei-Fei, “ImageNet: A large-scale hierarchical image database,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Miami, FL, USA, 2009, pp. 248–255, doi: 10.1109/CVPR.2009.5206848. [Google Scholar]

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.