Provider Routing
Smart routing and automatic fault tolerance across multiple providers for the same model
NexusFlow configures multiple provider endpoints for the same model. The system intelligently selects the optimal endpoint based on real-time performance and automatically switches when a provider is unavailable - the entire process is transparent to the caller.
Smart Routing
When you make an API request, the system filters all matching provider endpoints based on model ID and request protocol, then selects the optimal endpoint considering stability, latency, and other factors. When an endpoint encounters issues, the system automatically lowers its priority; upon recovery, it returns to normal priority without manual intervention.
Automatic Fault Tolerance
When the selected endpoint returns a server error (5xx) or rate limit (429), the system automatically retries the next available endpoint until the request succeeds or all endpoints have been tried. 400-class client errors do not trigger retries.
Important Notes
We recommend implementing streaming request error handling logic on the client side to handle stream interruptions.
If fault tolerance is needed, we recommend using the protocol with the broadest model support (typically the OpenAI protocol), or combining withModel Fallbackto configure backup models.