Unsupervised clustering of coral reef bioacoustics
An unsupervised process is described for clustering automatic detections in an acoustically active coral reef soundscape. First, acoustic metrics were extracted from spectrograms and timeseries of each detection based on observed properties of signal types and classified using unsupervised clustering methods. Then, deep embedded clustering (DEC) was applied to fixed-length power spectrograms of each detection to learn features and clusters. The clustering methods were compared on simulated bioacoustic signals for fish calls and whale song units with randomly varied signal parameters and additive white noise. Overlap and density of the handpicked features led to reduced accuracy for unsupervised clustering methods. DEC clustering identified clusters with fish calls, whale song, and events with simultaneous fish calls and whale song, but accuracy was reduced when the class sizes were imbalanced. Both clustering approaches were applied to acoustic events detected on directional autonomous seafloor acoustic recorder (DASAR) sensors on a Hawaiian coral reef in February-March 2020. Unsupervised clustering of handpicked features did not distinguish fish calls from whale song. DEC had high recall and correctly classified a majority of whale song. Manual labels indicated a class imbalance between fish calls and whale song at a 3-to-1 ratio, likely leading to reduced DEC clustering accuracy.
READ FULL TEXT