CAReNet: A Promising AI Architecture for Low Data Regime Mixing Convolutions and Attention

Cools, Aurélie; Belarbi, Mohammed Amin; MAHMOUDI, Sidi

doi:10.1007/978-3-031-78698-3_3

Request a copy

Contribution to collective works (Parts of books)

CAReNet: A Promising AI Architecture for Low Data Regime Mixing Convolutions and Attention

Cools, Aurélie; Belarbi, Mohammed Amin; MAHMOUDI, Sidi

2025 • In Lecture Notes in Networks and Systems

Peer reviewed

Permalink
https://hdl.handle.net/20.500.12907/51091

DOI
10.1007/978-3-031-78698-3_3

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

978-3-031-78698-3-40-50.pdf

Publisher postprint (797.06 kB)

Request a copy

All documents in ORBi UMONS are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Disciplines :

Computer science

Author, co-author :

Cools, Aurélie ; Université de Mons - UMONS > Faculté Polytechnique > Service Informatique, Logiciel et Intelligence artificielle

Belarbi, Mohammed Amin ; Université de Mons - UMONS > Faculté Polytechnique > Service Informatique, Logiciel et Intelligence artificielle

MAHMOUDI, Sidi ; Université de Mons - UMONS > Faculté Polytechnique > Service Informatique, Logiciel et Intelligence artificielle

Language :

English

Title :

CAReNet: A Promising AI Architecture for Low Data Regime Mixing Convolutions and Attention

Publication date :

01 January 2025

Main work title :

Lecture Notes in Networks and Systems

Publisher :

Springer Nature Switzerland

ISBN/EAN :

978-3-03-178698-3
978-3-03-178697-6

Peer reviewed :

Peer reviewed

Additional URL :

https://link.springer.com/content/pdf/10.1007/978-3-031-78698-3_3

Research unit :

F114 - Informatique, Logiciel et Intelligence artificielle

Research institute :

Numediart
Infortech

Available on ORBi UMONS :

since 08 January 2025

Statistics

Number of views

56 (4 by UMONS)

Number of downloads

0 (0 by UMONS)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

OpenCitations

OpenAlex citations

Bibliography

Barhoumi, Y., Rasool, G.: ScopeFormer: N-CNN-ViT hybrid model for intracranial hemorrhage classification. arXiv preprint arXiv:2107.04575 (2021)
Brown, T., et al.: Language models are few-shot learners. Adv. Neural. Inf. Process. Syst. 33, 1877–1901 (2020)
Chowdhary, K., Chowdhary, K.R.: Natural language processing. Fundam. Artif. Intell. 603–649 (2020)
Coates, A., Ng, A., Lee, H.: An analysis of single-layer networks in unsupervised feature learning. In: Gordon, G., Dunson, D., Dudík, M. (eds.) Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. Proceedings of Machine Learning Research, vol. 15, pp. 215–223. PMLR, Fort Lauderdale (2011). https://proceedings.mlr.press/v15/coates11a.html
Dagli, R.: AstroFormer: more data might not be all you need for classification. arXiv preprint arXiv:2304.05350 (2023)
Dai, Z., Liu, H., Le, Q.V., Tan, M.: CoatNet: marrying convolution and attention for all data sizes. Adv. Neural. Inf. Process. Syst. 34, 3965–3977 (2021)
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
Deng, L.: The MNIST database of handwritten digit images for machine learning research. IEEE Signal Process. Mag. 29(6), 141–142 (2012)
Dosovitskiy, A., et al.: An image is worth 16\(\times \)16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Johnson, R., Zhang, T.: Deep pyramid convolutional neural networks for text categorization. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 562–570 (2017)
Khan, S., Naseer, M., Hayat, M., Zamir, S.W., Khan, F.S., Shah, M.: Transformers in vision: a survey. ACM Comput. Surv. (CSUR) 54(10s), 1–41 (2022)
Krizhevsky, A., Hinton, G., et al.: Learning multiple layers of features from tiny images (2009)
Le, Y., Yang, X.: Tiny ImageNet visual recognition challenge. CS 231N 7(7), 3 (2015)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
LeCun, Y., Cortes, C.: MNIST handwritten digit database (2010). http://yann.lecun.com/exdb/mnist/
Liu, X., Peng, H., Zheng, N., Yang, Y., Hu, H., Yuan, Y.: EfficientViT: memory efficient vision transformer with cascaded group attention. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14420–14430 (2023)
Parvaiz, A., Khalid, M.A., Zafar, R., Ameer, H., Ali, M., Fraz, M.M.: Vision transformers in medical computer vision-a contemplative retrospection. Eng. Appl. Artif. Intell. 122, 106126 (2023)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Smith, L.N., Topin, N.: Deep convolutional neural network design patterns. arXiv preprint arXiv:1611.00847 (2016)
Tu, Z., et al.: MaxViT: multi-axis vision transformer. In: European Conference on Computer Vision, pp. 459–479. Springer (2022)
Vaswani, A., et al.: Attention is all you need. Adv. Neural Inf. Process. Syst. 30 (2017)
Wang, W., et al.: Pyramid vision transformer: a versatile backbone for dense prediction without convolutions. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 568–578 (2021)
Wu, J., Zhang, Q., Xu, G.: Tiny ImageNet challenge. Technical report (2017)
Xiao, H., Rasul, K., Vollgraf, R.: Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:1708.07747 (2017)