What question did this study set out to answer?

The research aims to improve whale pulse detection accuracy using convolutional neural networks by analyzing architectural performance.

March 2, 2026Open Access

Optimising Convolutional Neural Network Architectures for Fin Whale Pulse Detection in Spectrograms

Key Points

The research aims to improve whale pulse detection accuracy using convolutional neural networks by analyzing architectural performance.
Analyzed existing convolutional neural network architectures for whale pulse detection.
Examined internal layer behavior to identify informative features in spectrograms.
Developed a simplified model with just two convolutional layers and a lightweight classifier.
Improved classification accuracy from 87% to 98% with the new architecture.
Reduced variability in performance across training repetitions compared to the original model.

Abstract

Deep neural networks are widely used for image classification in different fields, although selecting an appropriate architecture often remains a trial-and-error process. The purpose of this work is to investigate a convolutional neural network architecture used to detect whale pulses in spectrograms in order to better understand the causes of its underperformance. By examining the behaviour of its internal layers, we show that the early convolutional blocks capture the most informative acoustic features, while deeper layers provide limited additional benefit and, under the considered training conditions, may even degrade classification accuracy. Based on these observations, we derive a simplified architecture consisting of only the first two convolutional layers followed by a lightweight classifier. This network achieves near-optimal performance, improving accuracy from 87% to 98%, and exhibits substantially lower variability between repetitions compared to the original model.

Optimising Convolutional Neural Network Architectures for Fin Whale Pulse Detection in Spectrograms

Key Points

Abstract

Cite This Study