Create a start-and-end frame video(Vidu)
This series supports OpenAI video generation formats (see the link Video generation Sora compatible formats).
Vidu official video generation API; the public path follows the Vidu official specification.
Current public model: viduq3-turbo. Supports text-to-video, image-to-video, start-and-end-frame video generation, and reference-based video generation. The same model also supports access via the /v1/videos Sora compatible format. Common mode are t2v, i2v, i2v_first_last, and reference_images.
Provides the public request skeleton, task query structure, and recommended validation template for the Vidu official video generation format, while the public path retains the Vidu official endpoint format.
Supported models: viduq3-turbo
Unified request fields:
model(string, required): Vidu public model name, currently publicviduq3-turbo.prompt(string, optional): Video generation prompt; required for text-to-video, and passed as needed for image-to-video, start-and-end-frame, and reference-based video generation.images(array[string], optional): Image input. 1 image for image-to-video, 2 images for start-and-end-frame video generation, and 1~7 images for reference-based video generation.videos(array[string], optional): Optional video subject input for reference-based video generation, used according to the capabilities of the Vidu official model.subjects(array[object], optional): Subject-library format input for reference-based video generation; may include fields such as subject name, image, video, or voice timbre.duration(integer, optional): Video duration, in seconds.viduq3-turbocommonly uses 5 seconds; set it according to the official range.resolution(string, optional): Output resolution, common values are540p,720p, and1080p.aspect_ratio(string, optional): Output aspect ratio, commonly used for text-to-video/reference-based video generation, such as16:9,9:16, and1:1.seed(integer, optional): Random seed.movement_amplitude(string, optional): Motion intensity, common values areauto,small,medium, andlarge.audio(boolean, optional): Whether to enable direct audio-video output.off_peak(boolean, optional): Whether to use staggered generation.watermark(boolean, optional): Whether to add a watermark.
Common model/mode differences:
Vidu 文生视频: Supported model viduq3-turbo; official-format text-to-video.Vidu 图生视频: Supported model viduq3-turbo; official-format single-image image-to-video.Vidu 首尾帧生视频: Supported model viduq3-turbo; official-format two-image start-and-end-frame video generation.Vidu 参考生视频: Supported model viduq3-turbo; official-format multi-reference image-to-video.
Authorizations
Use Bearer Token authentication.
Format: Authorization: Bearer sk-xxxxxx
Body
Vidu external model name, currently public viduq3-turbo.
viduq3-turbo "viduq3-turbo"
Image input. For image-to-video, provide 1 image; for start/end frames, provide 2 images; for reference video generation, provide 1–7 images.
2 elements["https://example.com/input.jpg"]Video generation prompt; required for text-to-video, and optional for image-to-video, first-and-last-frame-to-video, and reference-to-video, depending on business needs.
"A cinematic product ad with smooth camera motion"
For reference video generation, the video subject input is optional. Use it according to the capabilities of the Vidu official model.
Enter in the same format as the video subject library reference generation input. It can include fields such as subject name, images, videos, or voice timbre.
Video duration, in seconds. viduq3-turbo Commonly 5 seconds; set it within the official range.
5
Output resolution, common values 540p, 720p, and 1080p.
540p, 720p, 1080p "720p"
Output aspect ratio, commonly used for text-to-video/reference-to-video generation, such as 16:9, 9:16, and 1:1.
16:9, 9:16, 4:3, 3:4, 1:1 "16:9"
Random seed.
Motion range, common values auto, small, medium, large.
auto, small, medium, large "auto"
Whether to enable direct audio/video output.
all, speech_only, sound_effect_only Whether to use staggered generation.
Whether to add a watermark.
1, 2, 3, 4 Response
Task created successfully
The task ID returned when creating a task, used to query the task.
Task status. Common values: created, queueing, processing, success, failed.
created, queueing, processing, success, failed Points consumed for this task.
