An Optimised Grid Search Based Framework for Robust Large-Scale Natural Soundscape Classification

Napier, Thomas, Ahn, Euijoon, Allen-Ankins, Slade, and Lee, Ickjai (2023) An Optimised Grid Search Based Framework for Robust Large-Scale Natural Soundscape Classification. In: Lecture Notes in Computer Science (14471) pp. 468-479. From: AI 2023: 36th Australasian Joint Conference on Artificial Intelligence, 28 November - 1 December 2023, Brisbane, QLD, Australia.

[img] PDF (Published Version) - Published Version
Restricted to Repository staff only

View at Publisher Website: https://doi.org/10.1007/978-981-99-8388-...
 
1


Abstract

Large-scale natural soundscapes are remarkably complex and offer invaluable insights into the biodiversity and health of ecosystems. Recent advances have shown promising results in automatically classifying the sounds captured using passive acoustic monitoring. However, the accuracy performance and lack of transferability across diverse environments remains a challenge. To rectify this, we propose a robust and flexible ecoacoustics sound classification grid search-based framework using optimised machine learning algorithms for the analysis of large-scale natural soundscapes. It consists of four steps: pre-processing including the application of spectral subtraction denoising to two distinct datasets extracted from the Australian Acoustic Observatory, feature extraction using Mel Frequency Cepstral Coefficients, feature reduction, and classification using a grid search approach for hyperparameter tuning across classifiers including Support Vector Machine, k-Nearest Neighbour, and Artificial Neural Networks. With 10-fold cross validation, our experimental results revealed that the best models obtained a classification accuracy of 96% and above in both datasets across the four major categories of sound (biophony, geophony, anthrophony, and silence). Furthermore, cross-dataset validation experiments using a pooled dataset highlight that our framework is rigorous and adaptable, despite the high variance in possible sounds at each site.

Item ID: 81306
Item Type: Conference Item (Research - E1)
ISBN: 78-981-99-8388-9
Copyright Information: © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
Date Deposited: 06 Dec 2023 00:17
FoR Codes: 46 INFORMATION AND COMPUTING SCIENCES > 4611 Machine learning > 461199 Machine learning not elsewhere classified @ 60%
41 ENVIRONMENTAL SCIENCES > 4102 Ecological applications > 410299 Ecological applications not elsewhere classified @ 40%
SEO Codes: 28 EXPANDING KNOWLEDGE > 2801 Expanding knowledge > 280115 Expanding knowledge in the information and computing sciences @ 60%
28 EXPANDING KNOWLEDGE > 2801 Expanding knowledge > 280102 Expanding knowledge in the biological sciences @ 40%
Downloads: Total: 1
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page