Not interested in reading? Just want to play with TransmuSynth? Check out the demo below instead:

https://transmusynth.fly.dev/

#Python #Cryptography #ImageProcessing #AudioProcessing #MIDI #Music #TransmuSynth

I wrote up a blog post that describes how I combined my image2sound and promp2pixel tools to create the web-based TransmuSynth!

https://johnbeers.xyz/behold-the-transmusynth.html

#Python #Cryptography #ImageProcessing #AudioProcessing #MIDI #Music #TransmuSynth

johnbeers.xyz - Behold, the TransmuSynth!

Python Templates for Neural Image Classification and Spectral Audio Processing – Part 2
https://www.youtube.com/watch?v=TNY2UGQ5kAc
#AudioProcessing #coding #programming #Python
Python Templates for Neural Image Classification and Spectral Audio Processing - Part 2

YouTube
Python Templates for Neural Image Classification and Spectral Audio Processing – Part 2
https://www.youtube.com/watch?v=TNY2UGQ5kAc
#AudioProcessing #coding #programming #Python
Python Templates for Neural Image Classification and Spectral Audio Processing - Part 2

YouTube

Evil Otto by Audio Damage 🎛️
OTT-style multiband comp: 3 bands, up/down comp, sidechain, A/B, visuals

💻 Win/Mac/Linux/iOS (CLAP/VST3/AAX/AU)
🎁 FREE
🔗 https://www.audiodamage.com/pages/evil-otto

#freeplugin #multibandcomp #audioprocessing #audiodamage #musicproduction #mixingmastering #vstplugin

Python Templates for Neural Image Classification and Spectral Audio Processing – Part 2
https://www.youtube.com/watch?v=TNY2UGQ5kAc
#AudioProcessing #coding #programming #Python
Python Templates for Neural Image Classification and Spectral Audio Processing - Part 2

YouTube
Just ran Demucs completely locally on my system (RX 6700 XT / 16 GB RAM).

Demucs is an open source AI model for music source separation, developed by Meta. It can split a full song into individual stems like vocals, drums, bass, and other instruments, making it useful for remixing, transcription, and audio analysis.

Test track: Fear of the Dark by Iron Maiden
(https://www.youtube.com/watch?v=bePCRKGUwAY)

Setup:

- Demucs installed via pip
- Model: htdemucs (default)
- Input converted to WAV using ffmpeg
- GPU acceleration via ROCm

Setting it up is tricky because Demucs is tightly pinned to older PyTorch versions, so you have to install dependencies manually and use "--no-deps" to avoid breaking your (ROCm-)PyTorch setup.

Result:
Very clean vocal separation in most parts. Some artifacts appear during very loud or distorted sections (e.g. emotional peaks or shouting).

Next steps / possibilities:

- Normalize and filter audio before separation
- Extract vocals for transcription or remixing
- Create karaoke / instrumental versions
- Combine with Whisper for lyrics
- Batch processing for datasets
- Model: htdemucs_ft (higher quality)

Video workflow:

- Recorded with OBS
- Edited in Kdenlive
- Transcoded with VAAPI (H.264)

No cloud, real hardware.
Everything runs on Linux, so anyone can set this up.
Works on CPU as well, but much slower.

#Demucs #AI #MachineLearning #AudioSeparation #MusicAI #OpenSource #Linux #ROCm #AMD #DeepLearning #AudioProcessing #Vocals #Karaoke #StemSeparation #SelfHosted #NoCloud #FOSS #Tech #LocalAI #MetaAI
Python Templates for Neural Image Classification and Spectral Audio Processing – Part 2
https://www.youtube.com/watch?v=TNY2UGQ5kAc
#AudioProcessing #coding #programming #Python
Python Templates for Neural Image Classification and Spectral Audio Processing - Part 2

YouTube
Python Templates for Neural Image Classification and Spectral Audio Processing – Part 2
https://www.youtube.com/watch?v=TNY2UGQ5kAc
#AudioProcessing #coding #programming #Python
Python Templates for Neural Image Classification and Spectral Audio Processing - Part 2

YouTube
Python Templates for Neural Image Classification and Spectral Audio Processing – Part 2
https://www.youtube.com/watch?v=TNY2UGQ5kAc
#AudioProcessing #coding #programming #Python
Python Templates for Neural Image Classification and Spectral Audio Processing - Part 2

YouTube