Quick Start
Start with your first request. Choose OpenAI, Anthropic Messages, or Gemini-compatible protocol, and understand when to switch to async task mode.
Choose a Compatible Protocol
https://nexusflow.vip/v1/chat/completionshttps://nexusflow.vip/v1/messageshttps://nexusflow.vip/v1beta/models/{model}:generateContentPrerequisites
- Already registered and logged in to the NexusFlow platform
- Created at least one API Key on the API Keys page
- Added balance on the Billing page
Install SDK
Use the OpenAI SDK by default. If you already have an Anthropic or Gemini client, check the corresponding protocol documentation directly.
Configure API
Point base_url to NexusFlow and use a unified API Key to access all models.
Send a Chat Request
Text and reasoning models should prefer the synchronous API. The following example uses Qwen3.5 Plus.
Integrate Async Tasks
When integrating image or video generation, we recommend uniformly using /v1/tasks. This pipeline is better suited for high-latency models, background batch tasks, and high-concurrency queuing.
High-Concurrency Tips
- Replace the
modelparameter with another model ID to switch models - All models share the same API Key - no need to apply separately
- OpenAI, Anthropic Messages, and Gemini-compatible protocols share the same balance and usage records
- Streaming output is supported - just set
stream: true - Images and videos should use
/v1/tasksto avoid synchronous blocking
For repeated system prompts or long document contexts, enable enable_context_caching: true (OpenAI protocol) or cache_control annotations (Anthropic protocol). Cached hit portions are billed at only 10% of the input price. See Billing Details.