LEAVES: An open-source web-based tool for the scalable annotation and visualisation of large-scale ecoacoustic datasets using cluster analysis

Napier, Thomas, Ahn, Euijoon, Allen-Ankins, Slade, Schwarzkopf, Lin, and Lee, Ickjai (2025) LEAVES: An open-source web-based tool for the scalable annotation and visualisation of large-scale ecoacoustic datasets using cluster analysis. Ecological Informatics, 87. 103026.

[img]
Preview
PDF (Published Version) - Published Version
Available under License Creative Commons Attribution.

Download (3MB) | Preview
View at Publisher Website: https://doi.org/10.1016/j.ecoinf.2025.10...
 
3


Abstract

Ecoacoustics has emerged as a pivotal discipline in the conservation and monitoring of ecosystems, offering insights into species’ behaviour and ecosystem health through soundscape analysis. Central to this is the need for accurate annotations of environmental audio recordings, which underpin the computational models used in ecological monitoring. However, due to the increasingly large scale of datasets, annotation using existing tools and techniques cannot be performed at feasible speeds or with the necessary accuracy required for real-world application. The LEAVES (Large-scale Ecoacoustics Annotation and Visualisation with Efficient Segmentation) platform addresses this gap by leveraging unsupervised clustering techniques optimised for the high-throughput annotation of large-scale ecoacoustics datasets. Our evaluation across six real-world datasets shows that LEAVES improves annotation efficiency by up to 7.12 times compared to manual annotation while maintaining 79%–90% label similarity to validated data. We expect that our proposed tool will greatly accelerate the annotation process when generating high-quality labelled datasets, supporting larger-scale studies with broader community engagement in ecoacoustics research.

Item ID: 84922
Item Type: Article (Research - C1)
ISSN: 1878-0512
Copyright Information: © 2025 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Date Deposited: 18 Mar 2025 22:53
FoR Codes: 46 INFORMATION AND COMPUTING SCIENCES > 4605 Data management and data science > 460502 Data mining and knowledge discovery @ 50%
31 BIOLOGICAL SCIENCES > 3103 Ecology > 310399 Ecology not elsewhere classified @ 50%
SEO Codes: 22 INFORMATION AND COMMUNICATION SERVICES > 2204 Information systems, technologies and services > 220401 Application software packages @ 100%
Downloads: Total: 3
Last 12 Months: 3
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page