Google has launched its latest AI model. Called Lumiere, the multimodal video generation tool is capable of produ

Google’s OpenAI Killer: Lumiere multimodal AI unveiled, can create 5-second videos from text, images

submited by
Style Pass
2024-02-13 16:30:10

Google has launched its latest AI model. Called Lumiere, the multimodal video generation tool is capable of producing realistic 5-second-long videos using just text, or still images as prompts

Google has introduced its latest artificial intelligence model, Lumiere, a multimodal video generation tool capable of producing realistic 5-second-long videos.

Lumiere supports both text-to-video and image-to-video generation, using a Space-Time U-Net (STUNet) architecture to enhance the realism of motion in AI-generated videos.

According to a preprint paper accompanying the release, Lumiere’s innovation lies in generating the entire video in a single process rather than combining still frames.

This approach allows for the simultaneous creation of both spatial (objects in the video) and temporal (movement within the video) aspects, resulting in a more natural perception of motion.

Leave a Comment