Ever wondered how to render a smooth, audio-reactive waveform using Canvas? We're peeling back the layers of our latest project in building Ramble. Perfect for #AudioProcessing enthusiasts and #WebDev pros alike! https://www.doist.dev/building-ramble-3-visualizing-the-waveform/
Building Ramble #3: Visualizing the Waveform

How we render a smooth, real-time audio waveform with Canvas in the browser

Meta công bố SAM Audio - mô hình AI phân tách âm thanh, tách biệt âm chủ và nền từ file âm thanh hoặc video. Ứng dụng cho âm thanh tổng quát, nhạc, giọng nói. #Meta #AICôngNghiệp #PhânTíchÂmThanh #AudioProcessing #AIModel

https://www.reddit.com/r/LocalLLaMA/comments/1pqfmsr/meta_releases_sam_audio_for_audio_separation/

Bitwave is a new open-source audio format built with Rust & Python. It embeds spatial data & BPM for immersive, adaptive experiences in VR and gaming. https://hackernoon.com/its-time-to-reinvent-the-audio-file-introducing-bitwave #audioprocessing
It’s Time to Reinvent the Audio File: Introducing Bitwave | HackerNoon

Bitwave is a new open-source audio format built with Rust & Python. It embeds spatial data & BPM for immersive, adaptive experiences in VR and gaming.

A #webdev project is getting to a good place, starting to think about the next thing. Thinking about building some hardware for realtime audio processing with a #RaspberryPi #ComputeModule and a custom audio interface with an #RP2040

I want to do some mixing and EQ of multiple audio sources in a small package I can embed into other things. Diving into the #USB spec is a daunting task but seems like the most compatible way to send samples fast.

#ElectricalEngineering #AudioProcessing

Halloween costume audio processing challenge: fabric distortion + ambient noise + acoustic occlusion. Classic edge case that reveals limitations in current speech enhancement algorithms. Physical barriers still trump computational solutions.
#AudioProcessing #EdgeCases #AILimitations
Neural audio codecs: how to get audio into LLMs

Why modeling audio is harder than text, and how to make it feasible with neural audio codecs.

Not bad for boosting 5.1 movie volume, which is never mixed to stereo properly by any media player or television I've used (always have to turn up the volume to even remotely hear the speaking parts, then frantically dig around for the remote once the swell of music or explosions start):

sox -S .\audio.wav audio-amp.wav remix -m 1,3,4,5 2,3,4,6 gain -n

#sox #sox2 #audioprocessing

Anyone got any handy Sox or other CLI commands to fix podcasts, video audio with very variable volume? This works pretty well, but I can't add my usual 'gain -n' at the end because it seems to introduce a few odd spikes through the file.

sox -S .\audio.flac .\audio-com.flac compand 0.3,1 6:-70,-60,-20 -18 -90 0.2

#sox #sox2 #dynamicrangecompression #compressor #audioprocessing

Decided to have another go. #PlugData is considerably less intuitive at this point than the #BitwigStudio Grid environment, but appears to be very light weight. Here we have a patch that is coming together where we have a continuous output that blends between Low Pass, Band Pass and High Pass. #VisualProgramming #SignalProcessing #AudioProcessing
Voxtral | Mistral AI

Introducing frontier open source speech understanding models.