Create Video

curl --request POST \
  --url https://octopusx.ai/v1/video/create \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "<string>",
  "images": [
    {}
  ],
  "prompt": "<string>",
  "aspect_ratio": "<string>",
  "audio": true,
  "voice_id": "<string>",
  "is_rec": true,
  "bgm": true,
  "duration": 123,
  "seed": 123,
  "resolution": "<string>",
  "payload": "<string>",
  "off_peak": true,
  "watermark": true,
  "wm_position": 123,
  "wm_url": "<string>",
  "meta_data": "<string>"
}
'

{
  "created_at": "2025-11-08T23:07:57.510141923+08:00",
  "status": "processing",
  "task_id": "48038932-0ff5-4251-8b4b-7a76c09fd114"
}

POST

https://octopusx.ai

video

create

Create Video

curl --request POST \
  --url https://octopusx.ai/v1/video/create \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "<string>",
  "images": [
    {}
  ],
  "prompt": "<string>",
  "aspect_ratio": "<string>",
  "audio": true,
  "voice_id": "<string>",
  "is_rec": true,
  "bgm": true,
  "duration": 123,
  "seed": 123,
  "resolution": "<string>",
  "payload": "<string>",
  "off_peak": true,
  "watermark": true,
  "wm_position": 123,
  "wm_url": "<string>",
  "meta_data": "<string>"
}
'

{
  "created_at": "2025-11-08T23:07:57.510141923+08:00",
  "status": "processing",
  "task_id": "48038932-0ff5-4251-8b4b-7a76c09fd114"
}

Create Video

The Vidu unified video entry uses POST /v1/video/create, and the request body is JSON. Unlike the official Vidu format, this endpoint uses fields such as images, aspect_ratio, resolution, and duration.

The route endpoint is POST /v1/video/create.
Reference images are passed as a URL list through the images array; text-to-video can pass an empty array.
A common model example is viduq3-pro; use the actually available model for the current channel as the reference.
After a successful submission, the task id and status are returned, and you can later use Query Task to poll for results.

Supported Models

viduq3-pro: Efficiently generates high-quality audio and video content, making video content more vivid, more lifelike, and more three-dimensional
viduq2-pro-fast: Lower price floor, stable results, generation speed 2-3 times faster than viduq2-turbo
viduq2-pro: New model, good results, rich details
viduq2-turbo: New model, good results, fast generation
viduq1: Clear visuals, smooth transitions, stable camera movement
viduq1-classic: Clear visuals, richer transitions and camera movement
vidu2.0: Fast generation speed

Method and Path

POST /v1/video/create

Request Example

curl -X POST https://octopusx.ai/v1/video/create \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "viduq3-pro",
    "prompt": "一只猫在草地上追逐蝴蝶",
    "aspect_ratio": "16:9",
    "duration": 5,
    "audio": false,
    "images": []
  }'

Response Example

{
  "created_at": "2025-11-08T23:07:57.510141923+08:00",
  "status": "processing",
  "task_id": "48038932-0ff5-4251-8b4b-7a76c09fd114"
}

Authentication

Authorization: Bearer YOUR_API_KEY

Body

model

string

required

Model name. Supports viduq3-pro, viduq2-pro-fast, viduq2-pro, viduq2-turbo, viduq1, viduq1-classic, and vidu2.0.

images

array

required

Image array. Supports passing image Base64-encoded data or image URLs (ensure they are accessible). For text-to-video, pass an empty array []; for image-to-video, pass 1 image; for first-and-last-frame video generation, pass 2 images. Supported image formats are png, jpeg, jpg, and webp. The aspect ratio must be less than 1:4 or greater than 4:1, and the size must not exceed 50 MB.

prompt

string

Text prompt. The text description for the generated video, with a maximum length of 2000 characters. If you use the is_rec recommended prompt parameter, the model will ignore the prompt input in this parameter.

aspect_ratio

string

Aspect ratio. Default is 16:9. Optional values: 16:9, 9:16, 3:4, 4:3, 1:1. Note: 3:4 and 4:3 are supported only by q2 and q3 models.

audio

boolean

Whether to use direct audio-video output capability. Default is false. false: audio-video direct output is not needed, and a silent video is output; true: audio-video direct output is needed, and a video with dialogue and background sound is output.

voice_id

string

Voice ID. Does not take effect for q3 models. Used to determine the voice timbre in the video; if empty, the system will recommend one automatically.

is_rec

boolean

Whether to use recommended prompts. true: yes, the system automatically recommends prompts and generates the video using the prompt content, with the number of recommended prompts = 1; false: no, generate the video based on the input prompt.

bgm

boolean

Whether to add background music to the generated video. Default is false. true: the system will automatically select suitable music from the preset BGM library and add it; false: no BGM is added.

duration

integer

Video duration (seconds). The default for the viduq2 series is 5. Optional: 1-10.

seed

integer

Random seed. When omitted by default or set to 0, a random number will be used instead. If manually set, the specified seed will be used.

resolution

string

Resolution parameter. The default value depends on the model and video duration. Optional values: 540p, 720p, 1080p, 360p. viduq2 (1-10 seconds): default 720p, optional: 540p, 720p, 1080p.

payload

string

Passthrough parameter. No processing is performed; data transmission only. Maximum length: 1048576 characters.

off_peak

boolean

Off-peak mode. Default is false. true: generate video during off-peak periods; false: generate video immediately.

watermark

boolean

Whether to add a watermark. Default is false. true: add watermark; false: do not add watermark.

wm_position

integer

Watermark position. Default is 3 (bottom right). Optional values: 1: top left; 2: top right; 3: bottom right; 4: bottom left.

wm_url

string

Watermark content, where this is an image URL. If not provided, the default watermark is used: content generated by AI.

meta_data

string

Metadata identifier. A JSON-formatted string, passed through as-is.

Response

created_at

string

Creation time (ISO 8601 format).

status

string

Task status. Common values include processing, failed, and completed.

task_id

string

Task ID, passed as the id parameter in subsequent queries.

Vidu Task Status Query Query Task

​Create Video

​Supported Models

​Method and Path

​Request Example

​Response Example

​Authentication

​Body

​Response

​Related APIs

Create Video

Supported Models

Method and Path

Request Example

Response Example

Authentication

Body

Response

Related APIs