Meta công bố SAM Audio - mô hình AI phân tách âm thanh, tách biệt âm chủ và nền từ file âm thanh hoặc video. Ứng dụng cho âm thanh tổng quát, nhạc, giọng nói. #Meta #AICôngNghiệp #PhânTíchÂmThanh #AudioProcessing #AIModel
https://www.reddit.com/r/LocalLLaMA/comments/1pqfmsr/meta_releases_sam_audio_for_audio_separation/
A #webdev project is getting to a good place, starting to think about the next thing. Thinking about building some hardware for realtime audio processing with a #RaspberryPi #ComputeModule and a custom audio interface with an #RP2040
I want to do some mixing and EQ of multiple audio sources in a small package I can embed into other things. Diving into the #USB spec is a daunting task but seems like the most compatible way to send samples fast.
Neural audio codecs: how to get audio into LLMs
https://kyutai.org/next/codec-explainer
#HackerNews #NeuralAudioCodecs #LLMs #AudioTechnology #MachineLearning #AudioProcessing
Not bad for boosting 5.1 movie volume, which is never mixed to stereo properly by any media player or television I've used (always have to turn up the volume to even remotely hear the speaking parts, then frantically dig around for the remote once the swell of music or explosions start):
sox -S .\audio.wav audio-amp.wav remix -m 1,3,4,5 2,3,4,6 gain -n
Anyone got any handy Sox or other CLI commands to fix podcasts, video audio with very variable volume? This works pretty well, but I can't add my usual 'gain -n' at the end because it seems to introduce a few odd spikes through the file.
sox -S .\audio.flac .\audio-com.flac compand 0.3,1 6:-70,-60,-20 -18 -90 0.2
#sox #sox2 #dynamicrangecompression #compressor #audioprocessing