Create video

curl --request POST \
  --url https://api.aiid.edu.kg/v1/videos \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form model=sora-2 \
  --form 'prompt=<string>' \
  --form image='@example-file' \
  --form duration=123 \
  --form width=512 \
  --form height=512 \
  --form fps=30 \
  --form seed=20231234 \
  --form n=1 \
  --form response_format=url \
  --form user=user-1234 \
  --form 'metadata={}' \
  --form 'mode=<string>' \
  --form 'images=<string>' \
  --form 'content={}' \
  --form 'size=<string>' \
  --form 'seconds=<string>' \
  --form 'input_reference=<string>' \
  --form 'parameters={}'

{
  "id": "video_abc123",
  "object": "video",
  "model": "sora-2",
  "status": "queued",
  "progress": 0,
  "created_at": 1764347090922,
  "seconds": "8"
}

Video generation Sora compatible formats

Create video

OpenAI Sora video generation API in a compatible format. Supports text-to-video and image/video reference generation modes.

POST /v1/videos: Create video task
GET /v1/videos/{video_id}: Query task status
GET /v1/videos/{video_id}/content: Retrieve the video binary content

Currently not available externally:

GET /v1/videos
POST /v1/videos/{video_id}/remix

Standard authentication headers:

Content-Type: application/json

Use POST /v1/videos to create a task and obtain id
Poll GET /v1/videos/{id} until completed or failed
After completion, prefer using video_url returned in the response
For unified download, call GET /v1/videos/{id}/content

Core parameters:

model: string: Required, public model name
prompt: string: Recommended, video generation description
seconds: number|string: Optional, target duration
duration: number|string: Optional, duration alias
size: string: Optional, output resolution
mode: string: Optional, common values: t2v i2v i2v_first_last reference_material

Common compatibility parameters:

aspect_ratio
ratio
quality
resolution
fps
image
image_url
image_urls
images
reference_images
input_reference
end_image_url
last_image_url
video_urls
audio_urls
function_mode
content
callback_url
external_task_id

Kling common extended parameters:

model_name
negative_prompt
cfg_scale
sound
camera_control
image_list
video_id
task_id
watermark_info

4.1 Veo series

Note: stable, official, and similar labels indicate different groups, not external model suffixes; when calling, keep model as the base model name, and let the account or token group determine which group is actually used.

Veo 3.x main model

veo3
veo3-fast
veo3-fast-frames
veo3-frames
veo3-pro
veo3-pro-frames
veo3.1
veo3.1-fast
veo3.1-pro
veo3.1-components
veo3.1-4k
veo3.1-pro-4k

Recommendations:

For text-to-video only, prefer the veo3* / veo3.1* base models
For image-to-video or reference-to-video scenarios, it is recommended to explicitly pass mode + image_url/reference_images

4.2 Sora series

Base models:

sora-2
sora-2-pro

Recommendations:

sora-2* is suitable for general-purpose use
To use groups such as stable / official, keep model as the base model name and do not append any suffix to the model name.

4.3 Seedance series

Public model:

doubao-seedance-1-0-lite-t2v-250428
doubao-seedance-1-0-lite-i2v-250428
doubao-seedance-1-0-pro-250528
doubao-seedance-1-0-pro-fast-251015
doubao-seedance-1-5-pro-251215
doubao-seedance-2-0-260128
doubao-seedance-2-0-fast-260128

Recommendations:

For text-to-video, prefer *-t2v-* or pro / fast
For image-to-video, prefer *-i2v-*
For scenarios with complex reference materials, prefer content

4.4 Grok video series

Public model:

grok-imagine-1.0-video
grok-imagine-video-1.5-preview
grok-video-3

Common parameters:

prompt
ratio / aspect_ratio
resolution / size
seconds / duration
image / image_url / input_reference
reference_images

Example:

{
"model": "grok-imagine-1.0-video",
"prompt": "雨夜霓虹街道上的电影感推镜，光影丰富，运动自然",
"reference_images": [
"https://example.com/ref-1.jpg"
],
"seconds": 10,
"aspect_ratio": "16:9",
"resolution": "720P"
}

4.5 Kling video main model

Public base model:

kling-video

Required:

model
model_name

Supported model_name:

kling-v1
kling-v1-5
kling-v1-6
kling-v2-master
kling-v2-1
kling-v2-1-master
kling-v2-5-turbo
kling-v2-6
kling-v3

Common mode:

t2v
i2v
multi_i2v
extend

Minimum input parameters:

Text-to-video: model + model_name + prompt + mode=t2v
Image-to-video: model + model_name + prompt + mode=i2v + image
Multi-image reference: model + model_name + prompt + mode=multi_i2v + image_list
Video extension: model + model_name + mode=extend + video_id

Example:

{
"model": "kling-video",
"model_name": "kling-v2-6",
"mode": "t2v",
"prompt": "海边日落镜头，电影感，风吹长发",
"duration": 5,
"aspect_ratio": "16:9"
}

mode=t2v
Minimum input: model + prompt
mode=i2v
Minimum input: model + prompt + image_url
mode=i2v_first_last
Minimum input: model + prompt + image_url + end_image_url
mode=reference_images
Minimum input: model + prompt + reference_images
mode=reference_material
Minimum input: model + prompt + (image_urls/video_urls/audio_urls 至少一种)

Gemini Omni usage instructions

gemini-omni is the publicly exposed video model name of new-api, and can be called directly via POST /v1/videos.
mode=t2v: text-to-video, with the minimum input parameter being model + prompt.
mode=r2v: reference image/reference material generation; images can be placed in image, image_url, images, image_urls, reference_images, input_reference, or content.
mode=edit: video editing; videos can be placed in video, video_url, videos, or content, and reference images can also be provided at the same time.
The duration field can use seconds or duration, and will be automatically mapped to the 4 / 6 / 8 / 10 second options.

POST

videos

Create video

curl --request POST \
  --url https://api.aiid.edu.kg/v1/videos \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form model=sora-2 \
  --form 'prompt=<string>' \
  --form image='@example-file' \
  --form duration=123 \
  --form width=512 \
  --form height=512 \
  --form fps=30 \
  --form seed=20231234 \
  --form n=1 \
  --form response_format=url \
  --form user=user-1234 \
  --form 'metadata={}' \
  --form 'mode=<string>' \
  --form 'images=<string>' \
  --form 'content={}' \
  --form 'size=<string>' \
  --form 'seconds=<string>' \
  --form 'input_reference=<string>' \
  --form 'parameters={}'

{
  "id": "video_abc123",
  "object": "video",
  "model": "sora-2",
  "status": "queued",
  "progress": 0,
  "created_at": 1764347090922,
  "seconds": "8"
}

Authorizations

Authorization

string

header

required

Use Bearer Token authentication. Format: Authorization: Bearer sk-xxxxxx

Body

model

enum<string>

required

Required, external model name. gemini-omni Can be called via /v1/videos.

Available options:

sora-2,

sora-2-pro,

gemini-omni,

happyhorse-1.0,

happyhorse-1.0-i2v,

happyhorse-1.0-t2v,

happyhorse-1.0-r2v,

happyhorse-1.0-video-edit,

doubao-seedance-1-0-lite-i2v-250428,

doubao-seedance-1-0-lite-t2v-250428,

doubao-seedance-1-0-pro-250528,

doubao-seedance-1-0-pro-fast-251015,

doubao-seedance-1-5-pro-251215,

doubao-seedance-2-0-260128,

doubao-seedance-2-0-fast-260128,

veo3,

veo3-fast,

veo3-fast-frames,

veo3-frames,

veo3-pro,

veo3-pro-frames,

veo3.1,

veo3.1-4k,

veo3.1-components,

veo3.1-fast,

veo3.1-pro,

veo3.1-pro-4k,

kling-video,

grok-imagine-1.0-video,

grok-imagine-video-1.5-preview,

grok-video-3,

viduq3-turbo,

hailuo-video

Example:

"sora-2"

prompt

string

required

Unified prompt entry point. Required for most models.

image

file

Single image entry point; will be mapped to images in certain compatibility modes.

duration

integer

Unified duration parameter. Some models also accept seconds.

width

integer

Video width

Example:

512

height

integer

Video height

Example:

512

fps

integer

Video frame rate

Example:

30

seed

integer

Random seed

Example:

20231234

integer

Number of videos to generate

Example:

1

response_format

string

Response format

Example:

"url"

user

string

User identifier

Example:

"user-1234"

metadata

object

Dynamic extension field container. Numerous model-specific fields are deserialized from here.

mode

string

Optional, video generation mode. gemini-omni Supports t2v, r2v, and edit.

images

string[]

Unified multi-image entry point. Used by Vidu, Seedance, Veo, etc.

content

object[]

size

string

Unified size input, which maps to resolution/aspect_ratio and related fields.

seconds

string

Sora compatibility entry point; at runtime, it falls back to duration.

input_reference

string

Sora/Veo compatible reference image input, can be a multipart file or a compatible object.

parameters

object

Response

Video task created successfully

string

required

object

string

required

model

string

required

status

enum<string>

required

Should use VideoStatus constants: VideoStatusQueued, VideoStatusInProgress, VideoStatusCompleted, VideoStatusFailed

Available options:

queued,

in_progress,

completed,

failed,

video_url,

url,

completed_at

progress

integer

required

created_at

integer

required

task_id

string

Legacy compatibility; to be deprecated

completed_at

integer

expires_at

integer

seconds

string

size

string

remixed_from_video_id

string

error

object

metadata

object

Query task (Seedance)Get video task status