from openai import OpenAI

client = OpenAI(
    api_key="sk-air-your-key",
    base_url="https://nexusflow.hk/v1",
)

response = client.chat.completions.create(
    model="qwen3.5-plus",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello, please introduce yourself."}
    ],
    temperature=0.7,
    max_tokens=1000
)

print(response.choices[0].message.content)

Integrate Async Tasks

When you start integrating image or video generation, we recommend using /v1/tasks. This pipeline is better suited for high-latency models, background batch jobs, and high-concurrency queuing.

curl -X POST https://nexusflow.hk/v1/tasks \
  -H "Authorization: Bearer sk-air-your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "happyhorse-1.0-t2v",
    "prompt": "City coastline at dusk, slow dolly-in, cinematic natural light",
    "duration": 10,
    "resolution": "720P"
  }'

# Poll task status
curl https://nexusflow.hk/v1/tasks/task_xxx \
  -H "Authorization: Bearer sk-air-your-key"

High-Concurrency Integration Tips

Split chat requests and multimedia tasks into separate queues to avoid throughput contention.

A single API Key works across the OpenAI, Anthropic Messages, and Responses API protocols.

Poll tasks every 3-5 seconds and use exponential backoff for failed retries.

Confirm the RPM / TPM and concurrency strategy on the rate limits page before load testing.

Tips

Switch models by replacing the model parameter with another model ID
All models share a single API Key — no separate applications needed
The OpenAI, Anthropic Messages, and Responses API protocols share the same balance and usage records
Streaming is supported — just set stream: true
Use /v1/tasks for image and video to avoid synchronous blocking

💡 Cost-saving tip: Context Cache

For repeated system prompts or long document context, enable enable_context_caching: true (OpenAI protocol) or the cache_control annotation (Anthropic protocol). Cache hits are billed at only 10% of the input price. See billing details.

Next Steps

Browse Models

View all 45+ available models

Three-Protocol Access

OpenAI / Anthropic / Responses compatibility guide

Async Tasks

Unified task API for image / video

Rate Limits & Concurrency

Limits and optimization tips for high concurrency