Mastodawn

楽曲のChordを推定する
https://qiita.com/y_abe_bc/items/73778c9202ab4f8a7474?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items

#qiita #Python #librosa #Demucs

楽曲のChordを推定する - Qiita

mp3などの楽曲ファイルを入力すると、ボーカルと伴奏を分離し、ボーカルの文字起こしとキー推定、さらに伴奏側のコード進行と小節ごとのキー進行までまとめて出力する CLI を実装しました。実行結果まず、どんな出力が得られるかを先に載せておきます。この CLI はテキスト...

Qiita

PsychoticSheep Mar 23

Just ran Demucs completely locally on my system (RX 6700 XT / 16 GB RAM).

Demucs is an open source AI model for music source separation, developed by Meta. It can split a full song into individual stems like vocals, drums, bass, and other instruments, making it useful for remixing, transcription, and audio analysis.

Test track: Fear of the Dark by Iron Maiden
(https://www.youtube.com/watch?v=bePCRKGUwAY)

Setup:

- Demucs installed via pip
- Model: htdemucs (default)
- Input converted to WAV using ffmpeg
- GPU acceleration via ROCm

Setting it up is tricky because Demucs is tightly pinned to older PyTorch versions, so you have to install dependencies manually and use "--no-deps" to avoid breaking your (ROCm-)PyTorch setup.

Result:
Very clean vocal separation in most parts. Some artifacts appear during very loud or distorted sections (e.g. emotional peaks or shouting).

Next steps / possibilities:

- Normalize and filter audio before separation
- Extract vocals for transcription or remixing
- Create karaoke / instrumental versions
- Combine with Whisper for lyrics
- Batch processing for datasets
- Model: htdemucs_ft (higher quality)

Video workflow:

- Recorded with OBS
- Edited in Kdenlive
- Transcoded with VAAPI (H.264)

No cloud, real hardware.
Everything runs on Linux, so anyone can set this up.
Works on CPU as well, but much slower.

#Demucs #AI #MachineLearning #AudioSeparation #MusicAI #OpenSource #Linux #ROCm #AMD #DeepLearning #AudioProcessing #Vocals #Karaoke #StemSeparation #SelfHosted #NoCloud #FOSS #Tech #LocalAI #MetaAI

Dan Gero Jul 29, 2025

It'd be nice if there was a #Demucs model that could separate laugh tracks from sitcom episodes. I know an #AI laugh track remover exists already, but to be honest, I wasn't impressed at all by the demo. It sounds like it just turned the episode all the way down when a laugh track came in. Unfortunately, I think the reason it can't happen easily yet is because there aren't many public domain croud sounds out there that you can just train AI on if any, or at least, not to my knowledge. #ML

Show thread

fzap Jan 28, 2025

@Lioh

oder „Karaoke für Arme“ 😜

https://www.jaxgeller.com/using-ai-to-turn-youtube-videos-into-karaoke/

mit #yt-dlp #demucs #whisper #ffmpeg

Using AI to turn Youtube videos into Karaoke

Jackson Geller's personal website

Show thread

Andre Louis May 14, 2024

#Demucs vs #Logic 11, short comparison:

Andre Louis Feb 27, 2024

Well damn. By using '-d mps' on my M1 Max, I got #Demucs to run at about 21 seconds per-second.

The Cube Jan 13, 2024

A tip for those wanting to try #OpenVino with #Audacity. If using #Demucs, after the process is complete, unselect all tracks in the project except the very first one, which is the original file before processing. Then, press Alt + T, then hit V to remove the selected track. Otherwise, the file will clip like hell!