Paper published in a journal (Scientific congresses and symposiums)
Single node deep learning frameworks: Comparative study and CPU/GPU performance analysis
Lerat, Jean-Sébastien; Mahmoudi, Sidi; Mahmoudi, Saïd
2023In Concurrency and Computation: Practice and Experience, 35 (14)
Peer Reviewed verified by ORBi
 

Files


Full Text
Distributed_Deep_Learning__From_Single_Node_to_Multi_Node_Architecture.pdf
Author postprint (643.69 kB)
Download

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Keywords :
artificial intelligence; CPU; deep learning; distributed computing; frameworks; GPU; parallel computing; Comparatives studies; Deep learning; Design and implementations; Efficient sets; Framework; High-level programming; Learning frameworks; Parallel com- puting; Performances analysis; Software; Theoretical Computer Science; Computer Science Applications; Computer Networks and Communications; Computational Theory and Mathematics
Abstract :
[en] Deep learning presents an efficient set of methods that allow learning from massive volumes of data using complex deep neural networks. To facilitate the design and implementation of algorithms, deep learning frameworks provide a high-level programming interface. Based on these frameworks, new models, and applications are able to make better and better predictions. One type of deep learning application is the Internet of Things that can gather a continuous flow of data, which causes an explosion of the amount of data. Therefore, to handle this data management issue, computation technologies can offer new perspectives to analyze more data with more complex models. In this context, a cluster of computers can operate to quickly deliver a model or to enable the design of a complex neural network spread among computers. An alternative is to distribute a deep learning task with HPC cloud computing resources and to scale cluster in order to quickly and efficiently train a neural network. As a first step to design an infrastructure aware framework which is able to scale the computing nodes, this work aims to review and analyze the state-of-the-art frameworks by collecting device utilization data during the training task. We gather information about the CPU, RAM and the GPU utilization on deep learning algorithms with and without multi-threading. The behavior of each framework is discussed and analyzed in order to shed light on the strengths and weaknesses of the different deep learning frameworks.
Disciplines :
Computer science
Author, co-author :
Lerat, Jean-Sébastien ;  Faculty of Engineering, University of Mons, Mons, Belgium ; Department of Sciences and Technologies, Haute École en Hainaut, Mons, Belgium
Mahmoudi, Sidi  ;  Université de Mons - UMONS > Faculté Polytechniqu > Service Informatique, Logiciel et Intelligence artificielle
Mahmoudi, Saïd ;  Faculty of Engineering, University of Mons, Mons, Belgium
Language :
English
Title :
Single node deep learning frameworks: Comparative study and CPU/GPU performance analysis
Publication date :
25 June 2023
Event name :
Concurrency and Computation Practice and Expertise
Event date :
Juin 2023
Audience :
International
Journal title :
Concurrency and Computation: Practice and Experience
ISSN :
1532-0626
eISSN :
1532-0634
Publisher :
John Wiley and Sons Ltd
Volume :
35
Issue :
14
Peer reviewed :
Peer Reviewed verified by ORBi
Research unit :
F114 - Informatique, Logiciel et Intelligence artificielle
Research institute :
Infortech
Funding text :
This work was partially funded by the Wallonia-Brussels Federation (JCM/TP/BS/mo/c999)This work was partially funded by the Wallonia‐Brussels Federation (JCM/TP/BS/mo/c999)
Available on ORBi UMONS :
since 12 January 2024

Statistics


Number of views
20 (6 by UMONS)
Number of downloads
59 (2 by UMONS)

Scopus citations®
 
5
Scopus citations®
without self-citations
1
OpenCitations
 
2
OpenAlex citations
 
6

Bibliography


Similar publications



Contact ORBi UMONS