Qwen Series

by Alibaba Cloud

Qwen is Alibaba Cloud's in-house large language model. This page currently features the Qwen3.6 and Qwen3.5 series, which cover common production scenarios such as long text, function calling, code generation, and complex reasoning. Text models can be accessed via three public protocols: OpenAI Chat, Anthropic Messages, and Responses API.

Protocols

OpenAI Chat

/v1/chat/completions

Core Advantages

🌐

Multilingual

Native-level understanding

📚

Long text

1M context

⚡

Cost-effective

Affordable pricing

🔗

Ecosystem

Seamless with Alibaba Cloud

Available Models

Qwen3.7 Max

NEW

qwen3.7-max

FlagshipNewThinking ModeAgent

Qwen 3.7-generation flagship model built for the agent era, with comprehensive improvements in coding, office work, and long-horizon autonomous execution. Supports thinking mode toggle, function calling, and web search. 1M context.

Context Window

1,000,000

Max Output

65,536

Input Price

$12/M

Output Price

$36/M

Function callingThinking modeWeb search1M context

Qwen3.7 Plus

NEW

qwen3.7-plus

Cost-effectiveNewMultimodalAgent

Cost-effective Plus model in the Qwen3.7 series. On top of strong text capabilities, it comprehensively upgrades vision-language abilities: it can read the screen and operate a GUI, generate code from visual references, and retain full coding, tool-use, and productivity-workflow agent capabilities. 1M context.

Context Window

1,000,000

Max Output

65,536

Input Price

$2/M

Output Price

$8/M

Image inputFunction callingThinking modeWeb search1M context

Qwen3.6 Max Preview

NEW

qwen3.6-max-preview

FlagshipThinking Mode

The most powerful preview model in the Qwen3.6 series, ideal for complex reasoning, multi-step code generation, and tool-based tasks.

Context Window

262,144

Max Output

65,536

Input Price

$9/M

Output Price

$54/M

Function callingComplex reasoningCode generation

Qwen3.6 Plus

NEW

qwen3.6-plus

RecommendedBalanced

Balanced flagship model supporting a 1M-token context window, function calling, and built-in tools, ideal for most production scenarios.

Context Window

1,000,000

Max Output

65,536

Input Price

$2/M

Output Price

$12/M

Image inputFunction callingCode generation1M context

Qwen3.5 Plus

NEW

qwen3.5-plus

RecommendedBalanced

Balanced performance model, ideal for most production scenarios. Strong language ability and fast responses.

Context Window

1,000,000

Max Output

65,536

Input Price

$0.8/M

Output Price

$4.8/M

Image inputFunction callingCode generation

Qwen3.5 Flash

qwen3.5-flash

FastEconomical

High-speed response model, ideal for latency-sensitive scenarios. Highly cost-effective.

Context Window

1,000,000

Max Output

65,536

Input Price

$0.2/M

Output Price

$2/M

Function callingLow cost

Qwen3.5 Omni Plus

NEW

qwen3.5-omni-plus

FlagshipNewOmni-modal

Flagship omni-modal model supporting any combination of text, image, audio, and video input, with text and speech output. 113 input languages, 55 voices, with web search and voice cloning.

Context Window

262,144

Max Output

65,536

Input Price

$7/M

Output Price

$40/M

Audio inputAudio outputVideo inputImage inputWeb search

Qwen3.5 Omni Flash

NEW

qwen3.5-omni-flash

RecommendedNewOmni-modal

Lightweight omni-modal model supporting any combination of text, image, audio, and video input with text + speech output. A cost-effective omni-modal choice.

Context Window

262,144

Max Output

65,536

Input Price

$2.2/M

Output Price

$13.3/M

Audio inputAudio outputVideo inputImage inputWeb search

Usage Example

from openai import OpenAI

client = OpenAI(
    api_key="sk-air-your-key",
    base_url="https://nexusflow.hk/v1",
)

# Use Qwen to process long text
response = client.chat.completions.create(
    model="qwen3.6-plus",
    messages=[
        {"role": "system", "content": "You are a professional document analysis assistant."},
        {"role": "user", "content": "Summarize the main points of the following long document... (you can paste very long text here)"}
    ],
    max_tokens=4096,
)

print(response.choices[0].message.content)

Related Docs

Chat Completions API

How to call the API

DeepSeek Series

Compare reasoning models

Playground

Try it online