Gen-1: The Next Step Forward for Generative AI
Use words and images to generate new videos out of existing ones.
Gen-1: The Next Step Forward for Generative AI
Use words and images to generate new videos out of existing ones.
Shape-aware Text-driven Layered Video Editing
"[A] pre-trained text-conditioned diffusion model as guidance for refining shape distortion and completing unseen regions. The experimental results demonstrate that our method can achieve shape-aware consistent video editing and compare favorably with the state-of-the-art."
[Google] MusicLM: Generating Music From Text
"We introduce MusicLM, a model generating high-fidelity music from text descriptions such as 'a calming violin melody backed by a distorted guitar riff'. MusicLM casts the process of conditional music generation as a hierarchical sequence-to-sequence modeling task, and it generates music at 24 kHz that remains consistent over several minutes. "