Model Catalog
35 modelsBrowse the full range of AI models, covering text, reasoning, vision, coding, image, video, embedding, and more
qwen3.7-plusQwen3.7 series cost-effective Plus model, with fully upgraded visual-language capabilities on top of strong text abilities, retaining complete agent capabilities for coding, tool use, and productivity workflows. Supports multimodal interactive hybrid agents: perceiving real-world scenes, reading screens and operating GUIs, generating code based on visual references, and end-to-end navigation of mobile apps. Equivalent to snapshot qwen3.7-plus-2026-05-26.
qwen3.7-maxQwen3.7 flagship model, built for the agent era with comprehensive improvements in coding, office productivity, and long-cycle autonomous execution. Supports thinking mode toggle, function calling, and web search. 1M token context window.
qwen3-maxQwen3 most powerful flagship model, supports thinking mode toggle, excels in complex reasoning, code generation, and mathematics. 262K context window.
qwen3.6-max-previewQwen3.6 most powerful preview model, designed for complex reasoning, code generation, and multi-step tool tasks, ideal for scenarios requiring stronger thinking capabilities.
qwen3.6-plusQwen3.6 balanced flagship model, supports 1M context window, function calling, and built-in tools, ideal for large codebases and general production scenarios.
qwen3.5-plusQwen3.5 enhanced model, best balance of quality, speed, and cost. Supports 1M context window, ideal for large-scale application scenarios.
qwen3.6-flashQwen3.6 flash model, ideal for simple tasks with fast speed and low cost. Supports 1M context window and context caching.
qwen3.5-flashQwen3.5 flash model, ideal for simple tasks with fast speed and low cost. Supports 1M context window and context caching.
qwen-plusQwen enhanced model, classic balance of quality and speed, ideal for large-scale application scenarios.
qwen-turboQwen high-speed model, extremely fast response and lowest cost, ideal for latency-sensitive application scenarios.
qwen3-235b-a22bQwen3 open-source flagship, 235B parameter MoE architecture (22B active), supports dynamic switching between thinking and non-thinking modes.
qwen3.6-35b-a3bQwen3.6 open-source MoE model, 35B total parameters with only 3B active, excels at agent coding, STEM, and reasoning tasks. Apache 2.0 licensed. Supports thinking mode toggle.
qwen3-32bQwen3 open-source 32B parameter dense model, excels among medium-scale models.
qwq-plusQwen reasoning model, trained on Qwen2.5, excels at mathematics, logical reasoning, and complex problem analysis, displaying complete chain-of-thought.
qwen-vl-maxQwen vision flagship model, supports image understanding, visual-text dialogue, document OCR, and other multimodal tasks.
qwen-vl-plusQwen vision enhanced model, balanced performance and cost multimodal model.
qwen3-vl-plusQwen3 vision-language model, significantly improved image understanding, supports high-resolution image input. 262K context window.
qwen3-vl-flashQwen3 vision flash model, fast image understanding, ideal for real-time scenarios.
qwen3-omni-flashQwen3 omni model, accepts text, image, video and other multimodal inputs, ideal for complex multimodal understanding scenarios.
qwen3-coder-plusQwen3 exceptional code model, excels at tool calling and environment interaction, with outstanding code generation, completion, debugging, and refactoring capabilities. 1M context window.
qwen3-coder-flashQwen3 code flash model, fast code completion and generation, ideal for IDE integration scenarios.
text-embedding-v4Qwen latest text embedding model, supports 100+ languages and multiple programming languages, vector dimensions selectable from 2048, 1536, 1024, 768, 512, 256, 128, 64, suitable for semantic retrieval, clustering, recommendation, and RAG.
wan2.6-t2iLatest generation text-to-image flagship model, supports mixed text-image output and image editing. Can process complex instructions, render Chinese and English text, and generate high-definition realistic images. Supports multiple resolutions and aspect ratios.
wan2.6-t2vLatest generation text-to-video flagship model, supports multi-shot narrative and intelligent storyboard. Can generate 2-15 second 1080P HD video, supports prompt rewriting. Generation time approximately 1-5 minutes.
wan2.6-i2vImage-driven video generation model, uses the input image as the first frame to generate coherent video. Supports multi-shot narrative, automatic dubbing, 720P/1080P resolution, 2-15 seconds duration. Excellent frame coherence and motion consistency.
pixverse-v6PixVerse latest flagship video generation model, supports text-to-video and image-to-video, with significantly improved visual quality and motion consistency. Supports 1-15 seconds duration, 360p/540p/720p/1080p multiple resolutions, various aspect ratios.
happyhorse-1.0-t2vAlibaba's 2026 latest AI video generation model, ranked #1 on benchmarks. Generates high-quality video from text, supports 720P/1080P, 3-15 seconds duration, various aspect ratios. Default audio included.
happyhorse-1.0-i2vGenerates coherent video using the input image as the first frame, supports 720P/1080P, 3-15 seconds duration. Excellent frame coherence and motion consistency. Default audio included.
happyhorse-1.0-r2vSupports 1-9 reference images input, can fuse characters/objects/scenes from images to generate video. Supports 720P/1080P, 3-15 seconds, various aspect ratios. Default audio included.
happyhorse-1.0-video-editAI video editing based on input video, supports 0-5 reference images for assisted editing. Input video 3-60 seconds (truncated beyond 15s), supports 720P/1080P, can preserve original audio.
deepseek-v4-flashDeepSeek V4 Flash high-speed model via DashScope, ideal for low-latency and high-concurrency online dialogue scenarios.
deepseek-v4-proDeepSeek V4 Pro flagship model via DashScope, designed for complex reasoning, code generation, and multi-step tasks.
deepseek-v3.2DeepSeek latest general-purpose LLM, MoE architecture, strong bilingual Chinese-English capabilities, powerful coding abilities.
glm-5.1Zhipu AI GLM-5.1 enhanced flagship model, further optimized over GLM-5, stronger complex reasoning and code generation capabilities.
qwen3-8bQwen3 open-source 8B parameter lightweight model, ideal for edge deployment and low-cost inference scenarios.