NexusFlow Developer Documentation
One-stop integration for Qwen, DeepSeek, GLM, Kimi, HappyHorse, and more. Supports OpenAI, Anthropic Messages, and Gemini-compatible protocols with unified billing, keys, and monitoring.
Model Services
Unified model list, pricing, and capabilities
Three-Protocol Integration
OpenAI / Anthropic / Gemini compatible endpoints
Monitoring & Evaluation
Monitoring dashboard, error codes, and performance metrics
High-Concurrency Readiness
Rate limits, queues, task pipelines, and capacity scaling
Three Compatible Protocols
OpenAI-compatible
/v1/chat/completionsRecommended default integration method, covering chat, reasoning, tool calling, and most language SDKs.
Anthropic Messages
/v1/messagesSuitable for reusing Anthropic SDK, Claude Code-style clients, and Messages request format.
Gemini-compatible
/v1beta/models/{model}:generateContentSuitable for migrating from existing Gemini SDK or GenerateContent HTTP calls.
Quick Start
From your first chat request to async task integration
Model Overview
Browse model services, topic pages, and pricing
API Reference
OpenAI, Anthropic, and Gemini compatible protocols
Monitoring & Rate Limits
Rate limits, queues, and observability for production traffic
Popular Models
API Endpoints
Production Traffic Tips
Chat via Synchronous API
Chat and reasoning models should prefer `/v1/chat/completions` to reduce unnecessary polling complexity.
Multimedia via Async Tasks
Images and videos should use `/v1/tasks` uniformly, handling high latency and peak processing via task status.
Check Rate Limits & Monitoring Before Launch
Confirm peak request strategies on the rate limits, error codes, and monitoring pages to avoid discovering bottlenecks during traffic spikes.
Quick Example
Use the standard OpenAI SDK - just change the base_url to connect to NexusFlow.View full tutorial →