Tạo video

curl --request POST \
  --url https://api.aiid.edu.kg/v1/videos \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form model=sora-2 \
  --form 'prompt=<string>' \
  --form image='@example-file' \
  --form duration=123 \
  --form width=512 \
  --form height=512 \
  --form fps=30 \
  --form seed=20231234 \
  --form n=1 \
  --form response_format=url \
  --form user=user-1234 \
  --form 'metadata={}' \
  --form 'mode=<string>' \
  --form 'images=<string>' \
  --form 'content={}' \
  --form 'size=<string>' \
  --form 'seconds=<string>' \
  --form 'input_reference=<string>' \
  --form 'parameters={}'

{
  "id": "video_abc123",
  "object": "video",
  "model": "sora-2",
  "status": "queued",
  "progress": 0,
  "created_at": 1764347090922,
  "seconds": "8"
}

Định dạng tương thích Sora cho tạo video

Tạo video

Giao diện tạo video định dạng tương thích OpenAI Sora. Hỗ trợ chế độ tạo từ văn bản thành video và tạo bằng ảnh/video tham chiếu.

POST /v1/videos: Tạo tác vụ video
GET /v1/videos/{video_id}: Truy vấn trạng thái tác vụ
GET /v1/videos/{video_id}/content: Lấy nội dung nhị phân của video

Hiện chưa cung cấp ra bên ngoài:

GET /v1/videos
POST /v1/videos/{video_id}/remix

Header xác thực tiêu chuẩn:

Content-Type: application/json

POST /v1/videos tạo tác vụ, lấy id
Thăm dò GET /v1/videos/{id} cho đến khi completed hoặc failed
Sau khi hoàn tất, ưu tiên sử dụng video_url trong kết quả trả về
Nếu cần tải xuống thống nhất, gọi GET /v1/videos/{id}/content

Tham số cốt lõi:

model: string: Bắt buộc, tên mô hình công khai
prompt: string: Khuyến nghị, mô tả tạo video
seconds: number|string: Tùy chọn, thời lượng mục tiêu
duration: number|string: Tùy chọn, bí danh thời lượng
size: string: Tùy chọn, kích thước đầu ra
mode: string: Tùy chọn, các giá trị phổ biến t2v i2v i2v_first_last reference_material

Tham số tương thích thường dùng:

aspect_ratio
ratio
quality
resolution
fps
image
image_url
image_urls
images
reference_images
input_reference
end_image_url
last_image_url
video_urls
audio_urls
function_mode
content
callback_url
external_task_id

Các tham số mở rộng thường dùng của Kling:

model_name
negative_prompt
cfg_scale
sound
camera_control
image_list
video_id
task_id
watermark_info

4.1 Dòng Veo

Lưu ý: stable, official v.v. biểu thị các nhóm khác nhau, không phải hậu tố model dùng đối ngoại; khi gọi, vui lòng giữ model là tên mô hình cơ sở, nhóm tài khoản hoặc mã thông báo sẽ quyết định thực tế đi qua nhóm nào.

Mô hình chính Veo 3.x

veo3
veo3-fast
veo3-fast-frames
veo3-frames
veo3-pro
veo3-pro-frames
veo3.1
veo3.1-fast
veo3.1-pro
veo3.1-components
veo3.1-4k
veo3.1-pro-4k

Khuyến nghị:

Với video tạo từ văn bản thuần túy, ưu tiên dùng mô hình chính veo3* / veo3.1*
Với các tình huống tạo video từ ảnh hoặc từ tham chiếu, nên truyền rõ ràng mode + image_url/reference_images

4.2 Dòng Sora

Mô hình cơ sở:

sora-2
sora-2-pro

Khuyến nghị:

sora-2* phù hợp cho các lệnh gọi thông dụng
Nếu cần sử dụng các nhóm như stable / official, vui lòng giữ model là tên mô hình cơ sở, không thêm hậu tố vào sau tên mô hình.

4.3 Dòng Seedance

Mô hình công khai:

doubao-seedance-1-0-lite-t2v-250428
doubao-seedance-1-0-lite-i2v-250428
doubao-seedance-1-0-pro-250528
doubao-seedance-1-0-pro-fast-251015
doubao-seedance-1-5-pro-251215
doubao-seedance-2-0-260128
doubao-seedance-2-0-fast-260128

Khuyến nghị:

Với tạo video từ văn bản, ưu tiên *-t2v-* hoặc pro / fast
Với tạo video từ hình ảnh, ưu tiên *-i2v-*
Với các tình huống có tư liệu tham chiếu phức tạp, ưu tiên dùng content

4.4 Dòng video Grok

Mô hình công khai:

grok-imagine-1.0-video
grok-imagine-video-1.5-preview
grok-video-3

Tham số thường dùng:

prompt
ratio / aspect_ratio
resolution / size
seconds / duration
image / image_url / input_reference
reference_images

Ví dụ:

{
"model": "grok-imagine-1.0-video",
"prompt": "雨夜霓虹街道上的电影感推镜，光影丰富，运动自然",
"reference_images": [
"https://example.com/ref-1.jpg"
],
"seconds": 10,
"aspect_ratio": "16:9",
"resolution": "720P"
}

4.5 Mô hình chính video Kling

Mô hình chính công khai:

kling-video

Bắt buộc:

model
model_name

Các model_name được hỗ trợ:

kling-v1
kling-v1-5
kling-v1-6
kling-v2-master
kling-v2-1
kling-v2-1-master
kling-v2-5-turbo
kling-v2-6
kling-v3

Các mode phổ biến:

t2v
i2v
multi_i2v
extend

Tham số đầu vào tối thiểu:

Tạo video từ văn bản: model + model_name + prompt + mode=t2v
Tạo video từ hình ảnh: model + model_name + prompt + mode=i2v + image
Tham chiếu nhiều ảnh: model + model_name + prompt + mode=multi_i2v + image_list
Kéo dài video: model + model_name + mode=extend + video_id

Ví dụ:

{
"model": "kling-video",
"model_name": "kling-v2-6",
"mode": "t2v",
"prompt": "海边日落镜头，电影感，风吹长发",
"duration": 5,
"aspect_ratio": "16:9"
}

mode=t2v
Tham số đầu vào tối thiểu: model + prompt
mode=i2v
Tham số đầu vào tối thiểu: model + prompt + image_url
mode=i2v_first_last
Tham số đầu vào tối thiểu: model + prompt + image_url + end_image_url
mode=reference_images
Tham số đầu vào tối thiểu: model + prompt + reference_images
mode=reference_material
Tham số đầu vào tối thiểu: model + prompt + (image_urls/video_urls/audio_urls 至少一种)

Hướng dẫn gọi Gemini Omni

gemini-omni là tên mô hình video công khai của new-api, có thể trực tiếp gọi qua POST /v1/videos.
mode=t2v: tạo video từ văn bản, tham số đầu vào tối thiểu là model + prompt.
mode=r2v: tạo bằng ảnh/tư liệu tham chiếu, ảnh có thể đặt trong image、image_url、images、image_urls、reference_images、input_reference hoặc content.
mode=edit: chỉnh sửa video, video có thể đặt trong video、video_url、videos hoặc content, đồng thời cũng có thể truyền ảnh tham chiếu.
Trường thời lượng có thể dùng seconds hoặc duration, hệ thống sẽ tự động khớp với các mức 4 / 6 / 8 / 10 giây.

POST

videos

Tạo video

curl --request POST \
  --url https://api.aiid.edu.kg/v1/videos \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form model=sora-2 \
  --form 'prompt=<string>' \
  --form image='@example-file' \
  --form duration=123 \
  --form width=512 \
  --form height=512 \
  --form fps=30 \
  --form seed=20231234 \
  --form n=1 \
  --form response_format=url \
  --form user=user-1234 \
  --form 'metadata={}' \
  --form 'mode=<string>' \
  --form 'images=<string>' \
  --form 'content={}' \
  --form 'size=<string>' \
  --form 'seconds=<string>' \
  --form 'input_reference=<string>' \
  --form 'parameters={}'

{
  "id": "video_abc123",
  "object": "video",
  "model": "sora-2",
  "status": "queued",
  "progress": 0,
  "created_at": 1764347090922,
  "seconds": "8"
}

Ủy quyền

Authorization

string

header

bắt buộc

Sử dụng xác thực Bearer Token. Định dạng: Authorization: Bearer sk-xxxxxx

Nội dung

model

enum<string>

bắt buộc

Bắt buộc, tên mô hình công khai. gemini-omni Có thể được gọi thông qua /v1/videos.

Tùy chọn có sẵn:

sora-2,

sora-2-pro,

gemini-omni,

happyhorse-1.0,

happyhorse-1.0-i2v,

happyhorse-1.0-t2v,

happyhorse-1.0-r2v,

happyhorse-1.0-video-edit,

doubao-seedance-1-0-lite-i2v-250428,

doubao-seedance-1-0-lite-t2v-250428,

doubao-seedance-1-0-pro-250528,

doubao-seedance-1-0-pro-fast-251015,

doubao-seedance-1-5-pro-251215,

doubao-seedance-2-0-260128,

doubao-seedance-2-0-fast-260128,

veo3,

veo3-fast,

veo3-fast-frames,

veo3-frames,

veo3-pro,

veo3-pro-frames,

veo3.1,

veo3.1-4k,

veo3.1-components,

veo3.1-fast,

veo3.1-pro,

veo3.1-pro-4k,

kling-video,

grok-imagine-1.0-video,

grok-imagine-video-1.5-preview,

grok-video-3,

viduq3-turbo,

hailuo-video

Ví dụ:

"sora-2"

prompt

string

bắt buộc

Điểm nhập prompt thống nhất. Bắt buộc với hầu hết các mô hình.

image

file

Cổng vào cho một ảnh đơn, một số chế độ tương thích sẽ được ánh xạ tới images.

duration

integer

Đầu vào thời lượng được thống nhất; một số mô hình cũng chấp nhận seconds.

width

integer

Video width

Ví dụ:

512

height

integer

Video height

Ví dụ:

512

fps

integer

Video frame rate

Ví dụ:

30

seed

integer

Random seed

Ví dụ:

20231234

integer

Number of videos to generate

Ví dụ:

1

response_format

string

Response format

Ví dụ:

"url"

user

string

User identifier

Ví dụ:

"user-1234"

metadata

object

Container trường mở rộng động. Một số lượng lớn các trường đặc thù của mô hình được giải tuần tự hóa từ đây.

mode

string

Tùy chọn, chế độ tạo video. gemini-omni Hỗ trợ t2v, r2v, edit.

images

string[]

Điểm vào thống nhất cho nhiều ảnh. Vidu, Seedance, Veo sẽ sử dụng, v.v.

content

object[]

size

string

Điểm vào kích thước hợp nhất, sẽ được ánh xạ thành resolution/aspect_ratio, v.v.

seconds

string

Sora điểm vào tương thích, trong thời gian chạy sẽ quay về duration.

input_reference

string

Sora/Veo điểm vào ảnh tham chiếu tương thích, có thể là tệp multipart hoặc đối tượng tương thích.

parameters

object

Phản hồi

Đã tạo tác vụ video thành công

string

bắt buộc

object

string

bắt buộc

model

string

bắt buộc

status

enum<string>

bắt buộc

Should use VideoStatus constants: VideoStatusQueued, VideoStatusInProgress, VideoStatusCompleted, VideoStatusFailed

Tùy chọn có sẵn:

queued,

in_progress,

completed,

failed,

video_url,

url,

completed_at

progress

integer

bắt buộc

created_at

integer

bắt buộc

task_id

string

Tương thích với giao diện cũ (Sắp ngừng hỗ trợ)

completed_at

integer

expires_at

integer

seconds

string

size

string

remixed_from_video_id

string

error

object

metadata

object

Tạo tác vụ tạo nhạc Lấy trạng thái tác vụ video