Create Video
Seedance-2
Create Video
Submit a Seedance 2.0 multimodal video generation task using POST /v1/video/generations.
POST
Create Video
Create Video
Submit a Seedance 2.0 video generation task. It supports text-to-video, first-frame/first-and-last-frame, reference image/video/audio, video continuation, video editing, and multimodal composition modes. For assets in the media library, it is recommended to reference them incontent using asset://{assetId} (see Upload Assets).
Method and Path
Request Examples
Response Examples
task_id.
Authentication
Body
Model name:
doubao-seedance-2-0-260128: Standard version, optimized for the best visual quality and complex shot planningdoubao-seedance-2-0-fast-260128: Fast version, optimized for low latency and cost-sensitive scenarios
Multimodal input array; the order affects role assignment.
Content type:
text, image_url, video_url, audio_url, draft_task.Required when
type=text; prompt text.Used when
type=image_url; must include url.Public image URL or asset reference
asset://{assetId}.Used when
type=video_url; must include url.Public video URL or
asset://{assetId}.Used when
type=audio_url; must include url.Public audio URL or
asset://{assetId}.Used when
type=draft_task; must include id, and it must be the only element in content.Draft task ID, used to continue generation from a draft.
Media role:
first_frame: first frame (image)last_frame: last frame (image)reference_image: reference imagereference_video: reference/source video (continuation, editing)reference_audio: reference audio (requiresmetadata.generate_audio=true)
Video generation parameters; all are optional.
Video duration in seconds. Valid range
[4, 15] or -1 (automatically determined by the model), default 5.Resolution:
480p, 720p, 1080p, default 720p.Aspect ratio:
16:9, 9:16, 1:1, 4:3, adaptive, default 16:9.Total number of video frames. Mutually exclusive with
duration; if frames is provided, it takes precedence over duration.Random seed. The same
seed plus the same input can produce similar results.Whether to keep the camera fixed (suppress camera movement), default
false.Whether to add a watermark in the bottom-right corner of the video, default
true.Whether to generate or synthesize audio. Must be
true when using reference_audio, default false.Whether to return the final frame image URL for subsequent continuation, default
false.Draft mode: faster generation with slightly lower quality, suitable for previews, default
false.Service tier, default
default.Maximum task execution time in seconds, range
[3600, 259200] (1 hour to 3 days), default 172800.Callback URL when the task is completed.
content Mixing Rules
Violating the following rules may return400:
reference_imagecannot appear together withfirst_frame/last_frameaudio_urlcannot be the only input incontent; it must be paired with at least an image or videodraft_taskmust be the only element in thecontentarray
Generation Mode Comparison
| Mode | Request Example Label | content Key Points |
|---|---|---|
| Text to Video | Text to Video | text + optional reference image/video/audio |
| First-Frame Image to Video | First-Frame Image to Video | text + first_frame |
| First-and-Last-Frame Image to Video | First-and-Last-Frame Image to Video | text + first_frame + last_frame |
| Reference Image to Video | Reference Image to Video | text + reference_image |
| Video Continuation | Video Continuation | text + reference_video |
| Video Editing | Video Editing | text + reference_video + reference_image |
| Multimodal Composition | Multimodal Composition | text + multiple types of references |
| Reference Assets | Reference Assets | Each URL uses asset://{assetId} |
Response
Task ID, used for Query Task.