This is *amazing*.
You can take an MP3 and encode it into MIDI. The results are horrifyingly mesmerising. Can you hear the vocal?
(Stolen from https://www.tumblr.com/red3blog/135098280942/formeldeharv-i-put-all-i-want-for-christmas-is )
Red No. 3

I'm driving myself up the wall because I swear I can hear the vocal line but I don't know how that could be if it was truly converted to MIDI. Unless you can replicate speech sounds entirely with mod…

Tumblr

…and here's the MIDI version "enhanced" with Adobe's AI Podcast tool.

There is a deeply strange male voice in there at times.

(Thanks to @lazerwalker for the inspirations - https://xoxo.zone/@lazerwalker/111269073594821019)

Emilia (@[email protected])

cursed fact: Adobe Podcast's "Enhance AI", a tool for noise removal and voice boosting, firmly believes that any audio you give it *must* have human speech. If you upload, say, vocal-free chiptunes playing on a Game Boy, it will *find* the speech. https://lzrwlkr.me/3S424J7

XOXO Zone
@Edent @lazerwalker I fed the audio into Whisper and it seems there's a YouTuber trapped in there
@Edent @lazerwalker @mccllstr i have heard it hears “like and subscribe” when under-stimulated with actual words
@Edent @lazerwalker This has the makings of an audio trivia contest question. Take pop tunes, feed them through, and try to get people to identify them.
@Edent @lazerwalker Thanks for the nightmare fuel.🫠
@Edent 'horrifyingly mesmerising' is exactly right! So weird how you can hear the voice drifting along somewhere underneath the nightmarish piano
@Edent so much better! The nightmare of commercialized Christmas haunting the player piano of conscience.
@Edent that’s how this always sounds to me
@Edent Horrifying, yet somehow...also not horrifying?
@Edent
Will that was.. Something..

@Edent So this is doing pitch-to-MIDI analysis? I kinda thought this would be a raw data to MIDI thing which would sound very, VERY weird.

On the other side, you can use tools to embed images in MP3s, as famously used in the soundtrack to the game FEZ. https://venturebeat.com/games/fez-hidden-images/

The Fez soundtrack’s hidden images and how they got there

The soundtrack to developer Polytron's Xbox Live Arcade-exclusive puzzle game Fez has only been out since Friday, but a group of industrious fans has already discovered a number of secret images hidden in individual music tracks.

VentureBeat
@Edent I love how frigging punk it gets when the song actually begins.
@Edent and yes, it's like you're listening to a ghost screaming through a piano
Skrillex midi except played back in Windows 3.11

I was messing around with midis in win3.11 and this outdated meme is the result. The bass drops are my favorite part. The synth used in this is apparently "SB16 Extended Music Synthesis" according to Windows. Basically default soundblaster (emulated by dosbox) settings I suppose. I don't actually know shit about midis. edit: oh my god this was featured in a jpara video LMAO https://youtu.be/Pa-pZlh5sKw?t=371 "ew" -JasonParadise, 2021

Mal Morphix | Invidious

@Edent Nice. Reminds me of Chopsticks, the talking piano.

https://www.youtube.com/watch?v=uBEL3YVzMwk

Robot Piano Catches Fire Playing Rush E (World’s Hardest Song)

YouTube

@Edent OMG, I’m adding that to my Christmas mix *now*

Interesting that I can’t hear the vocal at *all*, even though I’m very familiar with the song and am trying to hear Mariah in that mess. Normally the brain hears what it expects to hear (viz the “Yanny” & “Laurel “effect, or “Green needle” & “Brainstorm”).

@MichaelPorter @Edent
“… you could ever know…” came through for me.
@Edent Rings true to me. Vowels are relatively simple things, a bit like chords. Three or four "formant" frequencies is usually enough. If we could here plosives or sibilants I'd be more suspicious.
@Edent I can only hear a haunted piano calling for a lazy kid named Sparky.
@Edent amazing. And gross. 🤣🤢
@Edent @Stoori Convenient, I was planning on posting this myself again!

@Edent This is insane, horrible and mesmerizing all at the same time.

Really curious though, does anyone know how I can actually hear the vocals? Is it really possible to synthesize speech with only piano sounds?

@houseofleft @Edent Look up Bregman's "sine wave speech" work from the 90s. I'm guessing the pitch of the piano notes combined with the low frequency of the inter-note interval is approximating forments of vowel sounds. Cool illusion.
@Edent This is a great demonstration of something I learned from a friend's work on synthesizing and analyzing speech: the deeply weird ways the sounds we are so used to are made of sometimes quite complex combinations of different frequencies and timing.
@Edent there's a point at which I can't tell if the MIDI version is actually close to the original, or if it's my brain auto-correcting.