curl --request POST \
  --url https://api.aiid.edu.kg/ent/v2/start-end2video \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "viduq3-turbo",
  "prompt": "A cinematic product ad with smooth camera motion",
  "duration": 5,
  "resolution": "720p",
  "aspect_ratio": "16:9"
}
'

{
  "task_id": "<string>",
  "model": "<string>",
  "prompt": "<string>",
  "images": [
    "<string>"
  ],
  "duration": 123,
  "resolution": "<string>",
  "credits": 123,
  "created_at": "<string>",
  "creations": "<string>"
}

Video generation Vidu

Create a start-and-end frame video(Vidu)

This series supports OpenAI video generation formats (see the link Video generation Sora compatible formats).

Vidu official video generation API; the public path follows the Vidu official specification.

Current public model: viduq3-turbo. Supports text-to-video, image-to-video, start-and-end-frame video generation, and reference-based video generation. The same model also supports access via the /v1/videos Sora compatible format. Common mode are t2v, i2v, i2v_first_last, and reference_images.

Provides the public request skeleton, task query structure, and recommended validation template for the Vidu official video generation format, while the public path retains the Vidu official endpoint format.

Supported models: viduq3-turbo

Unified request fields:

model (string, required): Vidu public model name, currently public viduq3-turbo.
prompt (string, optional): Video generation prompt; required for text-to-video, and passed as needed for image-to-video, start-and-end-frame, and reference-based video generation.
images (array[string], optional): Image input. 1 image for image-to-video, 2 images for start-and-end-frame video generation, and 1~7 images for reference-based video generation.
videos (array[string], optional): Optional video subject input for reference-based video generation, used according to the capabilities of the Vidu official model.
subjects (array[object], optional): Subject-library format input for reference-based video generation; may include fields such as subject name, image, video, or voice timbre.
duration (integer, optional): Video duration, in seconds. viduq3-turbo commonly uses 5 seconds; set it according to the official range.
resolution (string, optional): Output resolution, common values are 540p, 720p, and 1080p.
aspect_ratio (string, optional): Output aspect ratio, commonly used for text-to-video/reference-based video generation, such as 16:9, 9:16, and 1:1.
seed (integer, optional): Random seed.
movement_amplitude (string, optional): Motion intensity, common values are auto, small, medium, and large.
audio (boolean, optional): Whether to enable direct audio-video output.
off_peak (boolean, optional): Whether to use staggered generation.
watermark (boolean, optional): Whether to add a watermark.

Common model/mode differences:

Vidu 文生视频: Supported model viduq3-turbo; official-format text-to-video.
Vidu 图生视频: Supported model viduq3-turbo; official-format single-image image-to-video.
Vidu 首尾帧生视频: Supported model viduq3-turbo; official-format two-image start-and-end-frame video generation.
Vidu 参考生视频: Supported model viduq3-turbo; official-format multi-reference image-to-video.

POST

ent

start-end2video

curl --request POST \
  --url https://api.aiid.edu.kg/ent/v2/start-end2video \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "viduq3-turbo",
  "prompt": "A cinematic product ad with smooth camera motion",
  "duration": 5,
  "resolution": "720p",
  "aspect_ratio": "16:9"
}
'

{
  "task_id": "<string>",
  "model": "<string>",
  "prompt": "<string>",
  "images": [
    "<string>"
  ],
  "duration": 123,
  "resolution": "<string>",
  "credits": 123,
  "created_at": "<string>",
  "creations": "<string>"
}

Authorizations

Authorization

string

header

required

Use Bearer Token authentication. Format: Authorization: Bearer sk-xxxxxx

Body

application/json

model

enum<string>

required

Vidu external model name, currently public viduq3-turbo.

Available options:

viduq3-turbo

Example:

"viduq3-turbo"

images

string<uri>[]

required

Image input. For image-to-video, provide 1 image; for start/end frames, provide 2 images; for reference video generation, provide 1–7 images.

Required array length: 2 elements

Example:

["https://example.com/input.jpg"]

prompt

string

Video generation prompt; required for text-to-video, and optional for image-to-video, first-and-last-frame-to-video, and reference-to-video, depending on business needs.

Example:

"A cinematic product ad with smooth camera motion"

videos

string<uri>[]

For reference video generation, the video subject input is optional. Use it according to the capabilities of the Vidu official model.

subjects

object[]

Enter in the same format as the video subject library reference generation input. It can include fields such as subject name, images, videos, or voice timbre.

Hide child attributes

subjects.name

string

subjects.images

string<uri>[]

subjects.videos

string<uri>[]

subjects.voice_id

string

subjects.server_id

string

auto_subjects

boolean

duration

integer

Video duration, in seconds. viduq3-turbo Commonly 5 seconds; set it within the official range.

Example:

5

resolution

enum<string>

Output resolution, common values 540p, 720p, and 1080p.

Available options:

540p,

720p,

1080p

Example:

"720p"

aspect_ratio

enum<string>

Output aspect ratio, commonly used for text-to-video/reference-to-video generation, such as 16:9, 9:16, and 1:1.

Available options:

16:9,

9:16,

4:3,

3:4,

1:1

Example:

"16:9"

seed

integer

Random seed.

movement_amplitude

enum<string>

Motion range, common values auto, small, medium, large.

Available options:

auto,

small,

medium,

large

Example:

"auto"

audio

boolean

Whether to enable direct audio/video output.

audio_type

enum<string>

Available options:

all,

speech_only,

sound_effect_only

voice_id

string

is_rec

boolean

bgm

boolean

payload

string

off_peak

boolean

Whether to use staggered generation.

watermark

boolean

Whether to add a watermark.

wm_position

enum<integer>

Available options:

1,

2,

3,

4

wm_url

string<uri>

callback_url

string<uri>

Response

200 - application/json

Task created successfully

task_id

string

required

The task ID returned when creating a task, used to query the task.

state

enum<string>

required

Task status. Common values: created, queueing, processing, success, failed.

Available options:

created,

queueing,

processing,

success,

failed

model

string

prompt

string

images

string[]

duration

integer

resolution

string

credits

integer

Points consumed for this task.

created_at

string

creations

string

Create reference-generated video(Vidu)Query tasks(Vidu)