Paper published in a book (Scientific congresses and symposiums)
Towards Human Performance on Sketch-Based Image Retrieval
Seddati, Omar; Dupont, Stéphane; Mahmoudi, Saïd et al.
2022In Proceedings of 19th International Conference on Content-based Multimedia Indexing, CBMI 2022
Peer reviewed
 

Files


Full Text
cbmi2022-18 (1).pdf
Author postprint (664.56 kB)
Request a copy

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Keywords :
CNN; Sketch-based image retrieval; Triplet networks; Batch sizes; Embeddings; Human performance; Image database; Large-scales; Model sharing; Normalisation; Sketch-based image retrievals; State of the art; Triplet network; Human-Computer Interaction; Computer Networks and Communications; Computer Vision and Pattern Recognition; Software
Abstract :
[en] Sketch-based image retrieval (SBIR) solutions are attracting increased interest in the field of computer vision. These solutions provide an intuitive and powerful tool to retrieve images in large-scale image databases. In this paper, we conduct a comprehensive study of classic triplet CNN training pipelines within the SBIR context. We study the impact of embeddings normalization, model sharing, margin selection, batch size, hard mining selection and the evolution of the number of hard triplets during training to propose several avenues for improvement. We also propose dropout column, an adaptation of dropout for triplet network and similar pipelines. In addition, we also introduce a novel approach to build state-of-the-art SBIR solutions that can be used with low power systems. The whole study is conducted using The Sketchy Database, a large-scale SBIR database. We carry out a series of experiments and show that adopting a few simple modifications enhances significantly existing SBIR pipelines (faster training & higher accuracy). Our study enables us to propose an enhanced pipeline that outperforms previous state-of-the-art on the Sketchy Database by a significant margin (a recall of 53.92% compared to 46.2% at k = 1) and reaches almost human performance (54.27%) on a large-scale benchmark.
Disciplines :
Computer science
Author, co-author :
Seddati, Omar  ;  Université de Mons - UMONS > Faculté Polytechnique > Service Information, Signal et Intelligence artificielle
Dupont, Stéphane  ;  Université de Mons - UMONS > Faculté des Sciences > Service d'Intelligence Artificielle
Mahmoudi, Saïd  ;  Université de Mons - UMONS > Faculté Polytechnique > Informatique, Logiciel et Intelligence artificielle
Dutoit, Thierry ;  Université de Mons - UMONS
Language :
English
Title :
Towards Human Performance on Sketch-Based Image Retrieval
Publication date :
14 September 2022
Event name :
International Conference on Content-based Multimedia Indexing
Event place :
Graz, Aut
Event date :
14-09-2022 => 16-09-2022
By request :
Yes
Audience :
International
Main work title :
Proceedings of 19th International Conference on Content-based Multimedia Indexing, CBMI 2022
Publisher :
Association for Computing Machinery
ISBN/EAN :
978-1-4503-9720-9
Peer reviewed :
Peer reviewed
Research unit :
S841 - Artificial Intelligence
Research institute :
Numediart
Available on ORBi UMONS :
since 10 November 2022

Statistics


Number of views
13 (5 by UMONS)
Number of downloads
1 (1 by UMONS)

Scopus citations®
 
1
Scopus citations®
without self-citations
1
OpenCitations
 
0

Bibliography


Similar publications



Contact ORBi UMONS