TokenLX — One API for 235+ AI Models

Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.

Key strengths

Strong Chinese
Good English
Multiple size tiers
Tool calling

Use cases

Chinese assistants
Bilingual content
Enterprise chat
Cross-border apps

chinesebilingual

Alibaba's alibaba/qwen3-coder-plus is a frontier text generation model in the Qwen family. It excels at complex reasoning, agentic workflows, code generation, and long-form writing tasks, with native support for streaming, tool calling, JSON mode, and multi-turn conversations.

The model handles long-context inputs gracefully and is particularly effective for software engineering, multi-step research, and end-to-end project execution. Its tokenizer and pricing are optimized for high-throughput production workloads, with a competitive cost profile relative to other models in its tier.

alibaba/qwen3-coder-plus is fully OpenAI-compatible — drop in your existing OpenAI Python or Node SDK and switch `baseURL` to `https://api.tokenlx.ai`. TokenLX transparently routes your requests to the optimal provider endpoint while preserving streaming, function-calling, and structured-output semantics.

Performance

Compare different providers across TokenLX · All locations.

Throughput

tok/s

Latency

124

E2E Latency

198

Tool Call Errors

0.06

Output Errors

0.39

Time to First Token

Effective Pricing

Actual cost per million tokens across providers over the past 7 days.

Input

$0.59

per 1M tokens

7d agotoday

Output

$2.37

per 1M tokens

7d agotoday

Input tiers

0M - 0.032M$0.59per 1M tokens

0.032M - 0.128M$0.89per 1M tokens

0.128M - 0.256M$1.48per 1M tokens

0.256M - 1M$2.96per 1M tokens

Output tiers

0M - 0.032M$2.37per 1M tokens

0.032M - 0.128M$3.55per 1M tokens

0.128M - 0.256M$5.91per 1M tokens

0.256M - 1M$29.57per 1M tokens

Recent activity

Total usage per day on TokenLX (last 30 days).

Prompt

1.50B

Completion

3.83B

30d ago15d agotoday

Sample code & API

TokenLX normalizes requests and responses across providers. Use any OpenAI SDK or our native SDK.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.tokenlx.ai/v1",
    api_key="sk-tokenlx-...",
)

# Non-streaming
response = client.chat.completions.create(
    model="qwen3-coder-plus",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello!"},
    ],
)
print(response.choices[0].message.content)

# Streaming
stream = client.chat.completions.create(
    model="qwen3-coder-plus",
    messages=[{"role": "user", "content": "Tell me a story"}],
    stream=True,
)
for chunk in stream:
    print(chunk.choices[0].delta.content or "", end="", flush=True)

Replace sk-aihubrouter-… with your key from the dashboard.

API Endpoints

Chat Completions

Sends a request for a model response for the given chat conversation. Supports both streaming and non-streaming modes.

POSThttps://api.tokenlx.ai/v1/chat/completions

AuthorizationBearer $TOKENLX_API_KEY

Content-Typeapplication/json

Modelqwen3-coder-plus

Responses API

Creates a streaming or non-streaming response using the OpenAI Responses API format.

POSThttps://api.tokenlx.ai/v1/responses

AuthorizationBearer $TOKENLX_API_KEY

Content-Typeapplication/json

Modelqwen3-coder-plus

Anthropic Messages

Creates a message using the Anthropic Messages API format. Supports text, images, tools, and extended thinking.

POSThttps://api.tokenlx.ai/v1/messages

Authorizationx-api-key: $TOKENLX_API_KEY

Content-Typeapplication/json

Modelqwen3-coder-plus

Effective Pricing

Actual cost per million tokens across providers over the past 7 days.

Input

$0.59

per 1M tokens

7d agotoday

Output

$2.37

per 1M tokens

7d agotoday

Input tiers

0M - 0.032M$0.59per 1M tokens

0.032M - 0.128M$0.89per 1M tokens

0.128M - 0.256M$1.48per 1M tokens

0.256M - 1M$2.96per 1M tokens

Output tiers

0M - 0.032M$2.37per 1M tokens

0.032M - 0.128M$3.55per 1M tokens

0.128M - 0.256M$5.91per 1M tokens

0.256M - 1M$29.57per 1M tokens

Sample code & API

TokenLX normalizes requests and responses across providers. Use any OpenAI SDK or our native SDK.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.tokenlx.ai/v1",
    api_key="sk-tokenlx-...",
)

# Non-streaming
response = client.chat.completions.create(
    model="qwen3-coder-plus",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello!"},
    ],
)
print(response.choices[0].message.content)

# Streaming
stream = client.chat.completions.create(
    model="qwen3-coder-plus",
    messages=[{"role": "user", "content": "Tell me a story"}],
    stream=True,
)
for chunk in stream:
    print(chunk.choices[0].delta.content or "", end="", flush=True)

Replace sk-aihubrouter-… with your key from the dashboard.

API Endpoints

Chat Completions

Sends a request for a model response for the given chat conversation. Supports both streaming and non-streaming modes.

POSThttps://api.tokenlx.ai/v1/chat/completions

AuthorizationBearer $TOKENLX_API_KEY

Content-Typeapplication/json

Modelqwen3-coder-plus

Responses API

Creates a streaming or non-streaming response using the OpenAI Responses API format.

POSThttps://api.tokenlx.ai/v1/responses

AuthorizationBearer $TOKENLX_API_KEY

Content-Typeapplication/json

Modelqwen3-coder-plus

Anthropic Messages

Creates a message using the Anthropic Messages API format. Supports text, images, tools, and extended thinking.

POSThttps://api.tokenlx.ai/v1/messages

Authorizationx-api-key: $TOKENLX_API_KEY

Content-Typeapplication/json

Modelqwen3-coder-plus

alibaba/qwen3-coder-plus

Key strengths

Use cases

Performance

Effective Pricing

Recent activity

Sample code & API

API Endpoints

More models from Alibaba

alibaba/qwen3-coder-plus

Key strengths

Use cases

Performance

Effective Pricing

Recent activity

Sample code & API

API Endpoints

More models from Alibaba