Unpublished conference/Abstract (Scientific congresses and symposiums)
Recognition of woodpecker calls using a convolutional deep neural network
Florentin, Juliette; Verlinden, Olivier
20184th ABAV Acoustics Research Day
 

Files


Full Text
FLORENTIN_4th_ABAV_Research_Day_abstract.pdf
Publisher postprint (18.67 kB)
Request a copy

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Abstract :
[en] Recent research on recognizing bird calls has changed its focus from audio features to images. Indeed, many bird calls, including the most iconic woodpecker calls, can be visually identified on spectrograms. The intention is then to feed the images to a convolutional Deep Neural Network (DNN) and let it detect the important visual patterns and recognize the bird species. We thus designed a network with two convolutional layers for the image analysis and two dense layers to converge towards a diagnosis. The base images are standardized to 1 sec in duration and target the dominant frequency range for woodpeckers, 1--3.5 kHz. This is shorter than the typical woodpecker call, hence the images only capture high-energy moments within the calls. This approach focuses on recognizing syllables first, leaving call structure as a next step. The dataset is derived from Xeno-Canto and private recordings and comprises 12154 images, half of them woodpecker calls (9 classes). The perspicacity of the network is improved using dropout and data augmentation. We also expect the DNN to benefit from being co-trained on broader tasks such as distinguishing birdcalls from other sounds. However, early results speak against the straightforwardness of DNNs. The net assigns the highest probability to the correct class in only 36.4% of the cases. Considering the top three suggestions, the accuracy reaches 77.8%, standing above 90% for four calls (Jynx torquilla, D. medius, D.martius flight call, Dryobates minor) while Picus canus plummets at 14.3%.
Disciplines :
Computer science
Zoology
Physics
Author, co-author :
Florentin, Juliette ;  Université de Mons > Faculté Polytechnique > Mécanique rationnelle, Dynamique et Vibrations
Verlinden, Olivier  ;  Université de Mons > Faculté Polytechnique > Service de Mécanique rationnelle, Dynamique et Vibrations
Language :
English
Title :
Recognition of woodpecker calls using a convolutional deep neural network
Publication date :
21 March 2018
Event name :
4th ABAV Acoustics Research Day
Event place :
Bruxelles, Belgium
Event date :
2018
Research unit :
F703 - Mécanique rationnelle, Dynamique et Vibrations
Research institute :
R100 - Institut des Biosciences
R450 - Institut NUMEDIART pour les Technologies des Arts Numériques
Available on ORBi UMONS :
since 14 January 2019

Statistics


Number of views
6 (0 by UMONS)
Number of downloads
0 (0 by UMONS)

Bibliography


Similar publications



Contact ORBi UMONS