244 Followers
268 Following
87 Posts
Product Design x Creative Technology

Shape-aware Text-driven Layered Video Editing

"[A] pre-trained text-conditioned diffusion model as guidance for refining shape distortion and completing unseen regions. The experimental results demonstrate that our method can achieve shape-aware consistent video editing and compare favorably with the state-of-the-art."

https://text-video-edit.github.io

Shape-aware Text-driven Layered Video Editing Demo

Stable Diffusion & OpenCV for Face Detection and Automatic Compositing
https://www.youtube.com/watch?v=ISr_gTkO42M
Tutorial: Stable Diffusion & OpenCV for Face Detection and Automatic Compositing

YouTube
Audio-visual Expression Using Image Generation AI — Development Case Study at MUTEK.JP

I was in charge of visuals for Nao Tokui — Emergent Rhythm (AI Generative Live Set) at MUTEK.JP on Thursday, 12/8. Emergent Rhythm” is an improvisational performance built entirely from thre sounds…

Qosmo Lab
NVIDIA Broadcast 1.4 Adds Eye Contact and Vignette Effects With Virtual Background Enhancements

Plus new options to mirror your camera and take a selfie.

NVIDIA
Uncrop: Outpainting With AI
https://neural.love/uncrop
Uncrop Image – Online Outpainting – AI Aspect Ratio Changer – Try It for Free | neural.love

Do you want to synthesize the rest of your image or to uncrop a photo? Our online AI aspect ratio changer is here to help.

BoxInstSeg is a toolbox that aims to provide state-of-the-art box-supervised instance segmentation algorithms.

https://github.com/LiWentomng/BoxInstSeg
https://deepai.org/publication/box2mask-box-supervised-instance-segmentation-via-level-set-evolution

GitHub - LiWentomng/BoxInstSeg: A toolbox for box-supervised instance segmentation.

A toolbox for box-supervised instance segmentation. - GitHub - LiWentomng/BoxInstSeg: A toolbox for box-supervised instance segmentation.

GitHub

[Google] Muse: Text-To-Image Generation via Masked Generative Transformers

"[A] text-to-image Transformer model that achieves state-of-the-art image generation performance while being significantly more efficient than diffusion or autoregressive models. Muse is trained on a masked modeling task in discrete token space: given the text embedding extracted from a pre-trained large language model (LLM), Muse is trained to predict randomly masked image tokens."

https://muse-model.github.io

Muse: Text-To-Image Generation via Masked Generative Transformers

Game streaming with NovelAI + GPT-3 — AI VTuber experiment by Yuske Fukuyama

https://www.nicovideo.jp/watch/sm41575169
https://github.com/fkymy

AIにゲーム実況させてみた #1

AIにゲーム実況させてみた #1 [技術・工作] マリオカート8DXを実況するAIVTuberを作ってみました。これから、少しずつ進化をさせていきたいです...

ニコニコ動画

Deep Reconstruction of 3D Smoke Densities from Artist Sketches

"We present a method to compute a 3D smoke density field directly from 2D artist sketches, bridging the gap between early-stage prototyping of smoke keyframes and pre-visualization."

https://www.youtube.com/watch?v=PF7QqNZ28hk
https://cgl.ethz.ch/publications/papers/paperKim22a.php

Deep Reconstruction of 3D Smoke Densities from Artist Sketches (EUROGRAPHICS 2022)

YouTube
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
https://tuneavideo.github.io
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

A new method for text-to-video generation using one text-video pair.

Tune-A-Video