👁️ Google Veo

A modern artificial intelligence model by Google that can generate videos from text descriptions or images. The model also adds realistic audio, including speech, sound effects, and ambient background.

❗️The maximum number of characters in a request is 2000 characters.

🎛 Navigation
💥 Step-by-step guide to using VEO. Text to video
💥 Step-by-step guide to using Veo. Image to video
💥 Step-by-step guide to Veo. Frames Type
💥 Step-by-step guide to Veo. Ingredients Type
💥 Step-by-step guide to Veo. Extension

🏠 Main menu > 🎬 Video of the future > 👁️ Google Veo

💥 Step-by-step guide to using VEO. Text to video

Model settings

Model

We offer three advanced models for next-generation video generation - Veo 3.1, Veo 3.1 Fast, and Veo 3.1 Fast Relax. These models combine the power of generative technologies, realistic visuals, and intelligent audio within a single system.

Veo 3.1 - a model designed for those who seek maximum quality and cinematic results. It delivers exceptional detail, smooth motion, and precise audio synchronization, unlocking limitless possibilities for professional content.

Veo 3.1 Fast - a faster and more lightweight version of the model, optimized for speed and efficiency. It is ideal for rapid idea testing and creating dynamic videos with minimal time and resource costs.

Veo 3.1 Fast Relax - a more cost-efficient version of Veo 3.1 Fast. The output quality remains high, while the mode is designed for steady background processing. This model is suitable when speed is not a priority, and stability along with efficient resource usage matters most.

Duration

At the moment, all models support a fixed video length. Each generated video has a duration of 8 seconds.

Aspect Ratio

Two aspect ratios are available in the system: 9:16 (vertical video) and 16:9 (horizontal video).

If Frames are selected in the “Type” section, you can choose between both aspect ratios - 9:16 or 16:9.

If Ingredients are selected in the “Type” section, only one aspect ratio will be available - 16:9.

Type

When working with a text prompt, the “Type” section is used only to adjust variations of the aspect ratio.

Structure of a text prompt for standard generation

After configuring all model settings correctly, you need to write a proper text prompt to achieve accurate generation results.

Main subject and action
This is the foundation of any prompt. Specify who or what is in the frame and what the subject is doing. Keep it clear and avoid unnecessary details or pronouns to prevent confusion.
Environment and time of day
Define the location and lighting based on the time of day — this shapes the atmosphere and color palette of the video.
Camera movement
In Veo, the camera is an important expressive tool. It can move like in real cinematography: zoom in, follow the subject, orbit around objects, and more.

Example:
A massive elephant drinks water from a river in the savanna at sunset. Golden sunlight reflects on the water, and light dust rises around. The camera smoothly moves around the elephant.

Results

Veo 3.1

Veo 3.1 Fast

Veo 3.1 Fast Relax

Structure of a text prompt for generation with voiceover

After configuring all model settings correctly, you need to write a proper text prompt to achieve accurate generation with voice.

Main subject and action
This is the foundation of any prompt. Specify who or what is in the frame and what the subject is doing. Keep it clear and avoid unnecessary details or pronouns to prevent confusion.
Environment and time of day
Define the location and lighting based on the time of day - this shapes the atmosphere and color palette of the video.
Indicate what the characters say
To ensure the model understands that speech should be included, explicitly mention it in the prompt. Specify what the character says.
Add the exact phrase the character should say
If you want precise speech, include short quotes. Veo will generate voiceover and lip movements synchronized with the text. All quotes should be formatted like script lines, for example:

The cat blinks and says: “No dinner again? What kind of human are you…”

Specify the language of speech
To make characters speak in the desired language, include a note in the prompt indicating which language should be used.

Example:
A teacher stands by the blackboard in a bright classroom, with a student holding a notebook nearby. The camera slightly zooms in. The woman says: “Try again, you’ll definitely succeed!” Realistic style, speech in Russian.

Results

Veo 3.1

Veo 3.1 Fast

Veo 3.1 Fast Relax