Model and Capability Matrix

Model Grouping

Nano Banana

Primarily used for image generation and editing; also supports Gemini native format integration. | Model | Primary Integration Method | |---|---| | nano-banana | /v1/images/generations / /v1/images/edits / /v1beta/... | | nano-banana-2 | /v1/images/generations / /v1/images/edits / /v1beta/... | | nano-banana-2-2k | /v1/images/generations / /v1/images/edits / /v1beta/... | | nano-banana-2-4k | /v1/images/generations / /v1/images/edits / /v1beta/... | | nano-banana-pro | /v1/images/generations / /v1/images/edits / /v1beta/... | | nano-banana-pro-2k | /v1/images/generations / /v1/images/edits / /v1beta/... | | nano-banana-pro-4k | /v1/images/generations / /v1/images/edits / /v1beta/... |

Sora

Prefer the /v1/videos OpenAI video-compatible format, which is suitable for unified video integration. | Model | Primary Integration Method | |---|---| | sora-2 | /v1/videos | | sora-2-pro | /v1/videos |

Gemini Omni

gemini-omni uses new-api to expose public model names, which can be invoked via /v1/videos or /api/v3/contents/generations/tasks; use mode=t2v/r2v/edit to distinguish text-to-video, reference-based generation, and video editing, and the duration will be automatically mapped to the 4 / 6 / 8 / 10-second tiers. | Model | Primary Integration Method | |---|---| | gemini-omni | /v1/videos / /api/v3/contents/generations/tasks |

HappyHorse

Supports Alibaba DashScope official video generation format, and also supports compatible integration via /v1/videos. | Model | Primary Integration Method | |---|---| | happyhorse-1.0 | /api/v1/services/aigc/video-generation/video-synthesis / /v1/videos | | happyhorse-1.0-i2v | /api/v1/services/aigc/video-generation/video-synthesis / /v1/videos | | happyhorse-1.0-t2v | /api/v1/services/aigc/video-generation/video-synthesis / /v1/videos | | happyhorse-1.0-r2v | /api/v1/services/aigc/video-generation/video-synthesis / /v1/videos | | happyhorse-1.0-video-edit | /api/v1/services/aigc/video-generation/video-synthesis / /v1/videos |

Veo

For the Veo primary model, prefer the /v1/videos compatible format; /v1beta/... reserves notes for the Veo native protocol. gemini-omni does not use the Google native path; please use /v1/videos or the Seedance task format. | Model | Primary Integration Method | |---|---| | veo3 | /v1/videos / /v1beta/... | | veo3-fast | /v1/videos / /v1beta/... | | veo3-fast-frames | /v1/videos / /v1beta/... | | veo3-frames | /v1/videos / /v1beta/... | | veo3-pro | /v1/videos / /v1beta/... | | veo3-pro-frames | /v1/videos / /v1beta/... | | veo3.1 | /v1/videos / /v1beta/... | | veo3.1-4k | /v1/videos / /v1beta/... | | veo3.1-components | /v1/videos / /v1beta/... | | veo3.1-fast | /v1/videos / /v1beta/... | | veo3.1-pro | /v1/videos / /v1beta/... | | veo3.1-pro-4k | /v1/videos / /v1beta/... |

Grok

The currently released model is a video generation model; it is recommended to prioritize integration via the unified video interface. | Model | Primary Integration Method | |---|---| | grok-imagine-1.0-video | /v1/videos | | grok-imagine-video-1.5-preview | /v1/videos | | grok-video-3 | /v1/videos |

Seedance

Supports the /api/v3/contents/generations/tasks native format, and also supports compatible integration via /v1/videos; gemini-omni can reuse this task format and use mode=t2v/r2v/edit to differentiate capabilities. | Model | Primary Integration Method | |---|---| | doubao-seedance-1-0-lite-i2v-250428 | /api/v3/contents/generations/tasks / /v1/videos | | doubao-seedance-1-0-lite-t2v-250428 | /api/v3/contents/generations/tasks / /v1/videos | | doubao-seedance-1-0-pro-250528 | /api/v3/contents/generations/tasks / /v1/videos | | doubao-seedance-1-0-pro-fast-251015 | /api/v3/contents/generations/tasks / /v1/videos | | doubao-seedance-1-5-pro-251215 | /api/v3/contents/generations/tasks / /v1/videos | | doubao-seedance-2-0-260128 | /api/v3/contents/generations/tasks / /v1/videos | | doubao-seedance-2-0-fast-260128 | /api/v3/contents/generations/tasks / /v1/videos |

Kling Images

Provides Kling image generation, Omni images, and virtual try-on capabilities externally via /v1/images/generations. | Model | Primary Integration Method | |---|---| | kling-image | /v1/images/generations | | kling-omni-image | /v1/images/generations | | kolors-virtual-try-on-v1 | /v1/images/generations | | kolors-virtual-try-on-v1-5 | /v1/images/generations |

Kling Videos

Both the model-specific interface and the unified video entry point are available. | Model | Primary Integration Method | |---|---| | kling-video | /kling/v1/... / /v1/videos | | kling-* | /kling/v1/... / /v1/videos |

Hailuo

Integrate using unified video capabilities, suitable for text-to-video scenarios. | Model | Primary Integration Method | |---|---| | hailuo* | /v1/videos |

Vidu

Supports the Vidu official video generation format, and also supports compatible integration via /v1/videos, suitable for text-to-video, single-image, first-and-last-frame, and multi-reference-image scenarios. | Model | Primary Integration Method | |---|---| | viduq3-turbo | /ent/v2/... / /v1/videos | | vidu* | /ent/v2/... / /v1/videos | For pricing information, visit: Pricing page.

​Model Grouping

​Nano Banana

​Sora

​Gemini Omni

​HappyHorse

​Veo

​Grok

​Seedance

​Kling Images

​Kling Videos

​Hailuo

​Vidu