Text Models
General Chat Completions API (Default Non-Streaming)
Use the OpenAI Chat Completions-compatible format to initiate a conversation and return the full result in one response.
POST
General Chat Completions API (Default Non-Streaming)
Suitable for background tasks, structured output, short Q&A, and scenarios where real-time display of the generation process is not required. Whenstream is omitted or set to false, the API returns a complete chat.completion object in a single response.
Request Body
Model name. Can be queried via the model list.
An array of conversation messages. Each message must contain at least
role and content.Non-streaming response when omitted or set to
false.Specifies the output format. Commonly used for JSON output or JSON Schema structured output.
A list of function calling tools.
Controls the tool calling strategy.
Sampling temperature. The default value is determined by the upstream model.
Nucleus sampling parameter.
Maximum number of generated tokens.
Random seed. When supported by the upstream model, this can improve reproducibility.
Request Example
Structured Output
Response Example
Response Fields
The text content generated by the model. May be
null when a tool call occurs.The function tools requested by the model.
Token usage for this request.