Fast and high quality image and audio conditional video generation [model] [code]
for best results - make it as elaborate as possible