The sperm whale 'phonetic alphabet' revealed by AI

https://lemmy.ca/post/24849958

The sperm whale 'phonetic alphabet' revealed by AI - Lemmy.ca

Sperm whales communicate with each other using rhythmic sequences of clicks, called codas. It was previously thought that sperm whales had just 21 coda types. However, after studying almost 9,000 recordings, the Ceti researchers identified 156 distinct codas. They also noticed the basic building blocks of these codas which they describe as a “sperm whale phonetic alphabet” – much like phonemes, the units of sound in human language which combine to form words. Pratyusha Sharma, a PhD student at MIT and lead author of the study, describes the “fine-grain changes” in vocalisations the AI identified. Each coda consists of between three and 40 rapid-fire clicks. The sperm whales were found to vary the overall speed, or the “tempo”, of the codas, as well as to speed up and slow down during the delivery of a coda, in other words, making it “rubato”. Sometimes they added an extra click at the end of a coda, akin, says Sharma, to “ornamentation” in music. These subtle variations, she says, suggest sperm whale vocalisations could carry a much richer amount of information than previously thought.

I know it’s a general consumption news story, but I wish they’d have at least included a name or basic description of what the “AI” used was. “AI” is about as blandly unspecific as you can get even if it’s only being applied to algorithms that fall into that broad category, let alone how commonly it’s misused nowadays.
That’s the thing, dude, when you write software…it doesn’t have a name. They probably started with a framework like tensor flow and then stacked audio analysis ml modules on top figuratively, but this was probably mostly written in house it’s not a commercial product.
I’m in “artificial intelligence” and write software. I’m not looking for a product name. If they developed an entirely new technique that’d also be cool to mention, but they probably built on existing techniques and were at least working in some broad form of AI.
The github shows them doing some k nearest neighbors land kernel estimation as part of their understanding of the coda (I think? I don’t know anything about whales)
sw-combinatoriality/code/2-tempo.ipynb at main · pratyushasharma/sw-combinatoriality

Dataset and Codebase for paper on "Contextual and Combinatorial Structure in Sperm Whale Vocalisations" - pratyushasharma/sw-combinatoriality

GitHub
The KMeans is actually an unused import, but yeah, I see the kernel estimation. They also use a gaussian mixture in one of the lower cells. So a little AI.
Whales can have a little AI as a treat :3