Mastodawn

Steve Troughton-Smith Apr 18, 2024

Here’s your AI astonishment/nightmare fuel for today:

"TL;DR: single portrait photo + speech audio = hyper-realistic talking face video with precise lip-audio sync, lifelike facial behavior, and naturalistic head movements, generated in real time.”

https://www.microsoft.com/en-us/research/project/vasa-1/

VASA-1 - Microsoft Research

Opens in a new tab

Microsoft Research

Show thread

Scott Jann

@stroughtonsmith I made pretty much the same thing on classic Mac OS using MacInTalk to make a cartoon face animate like 30 years ago https://www.zenwheel.com/software/mouth.html

VASA-1 - Microsoft Research

zenwheel»mouth