Artificial Intelligence (AI) technologies and tools have been advancing at a fast pace. Many new tools continue to evolve and they are increasingly targeted at improving productivity of end consumers.
After my book - Diary of a Successful Loser - was published last month, I had been reaching out to reviewers and bloggers. Along the way, I decided to explore the use of of Text-to-Video AI software to generate a promotional video for the book.
Here is one version of a resulting video : What do you think?
- OpenAI's DALL-e AI - Open AI announced that it removed the waitlist for its DALL-E AI image generator service. More than 1.5M users are now actively creating over 2M images a day with DALL·E—from artists and creative directors to authors and architects—with over 100K users sharing their creations and feedback in our Discord community.
- Make-A-Video - Meta's Make-A-Video is an AI-powered video generator that can create novel video content from text or image prompts, similar to existing image synthesis tools like DALL-E and Stable Diffusion. Make-A-Video research builds on the recent progress made in text-to-image generation technology built to enable text-to-video generation. The system uses images with descriptions to learn what the world looks like and how it is often described. Link to Meta's research paper
- Stable Diffusion - A newly released open source image synthesis model called Stable Diffusion allows anyone with a PC and a decent GPU to conjure up almost any visual reality they can imagine. It can imitate virtually any visual style, and if you feed it a descriptive phrase, the results appear on your screen like magic.
- Imagen Video - Google’s newest AI generator that creates HD video from text prompts. Google's engineers claim it is a text-conditional video generation system based on a cascade of video diffusion models. Given a text prompt, Imagen Video generates high definition videos using a base video generation model and a sequence of interleaved spatial and temporal video super-resolution models. According to Google's research paper, Imagen Video includes several notable stylistic abilities, such as generating videos based on the work of famous painters (the paintings of Vincent van Gogh, for example), generating 3D rotating objects while preserving object structure, and rendering text in a variety of animation styles. Google is hopeful that general-purpose video synthesis models can "significantly decrease the difficulty of high-quality content generation."