TokenLX
QWEN

alibaba/qwen3.5-omni

$1.03/M tokens input$5.91/M tokens output4.76B  tokens servedAudio

Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.

Key strengths

  • Strong Chinese
  • Good English
  • Multiple size tiers
  • Tool calling

Use cases

  • Chinese assistants
  • Bilingual content
  • Enterprise chat
  • Cross-border apps
chinesebilingual

Alibaba's alibaba/qwen3.5-omni is a high-quality speech model. It generates natural-sounding speech across multiple voices and languages, with low-latency streaming output suitable for real-time voice applications.

Supports SSML-style controls, configurable voices, speaking rate, and pitch. Compatible with the OpenAI `/audio/speech` and `/audio/transcriptions` endpoint shapes.

alibaba/qwen3.5-omni is fully OpenAI-compatible — drop in your existing OpenAI Python or Node SDK and switch `baseURL` to `https://api.tokenlx.ai`. TokenLX transparently routes your requests to the optimal provider endpoint while preserving streaming, function-calling, and structured-output semantics.

Performance

Compare different providers across TokenLX · All locations.

Throughput
56
tok/s
Latency
157
ms
E2E Latency
200
ms
Tool Call Errors
0.08
%
Output Errors
0.36
%
Time to First Token
120
ms

Effective Pricing

Actual cost per million tokens across providers over the past 7 days.

Input
$1.03
per 1M tokens
7d agotoday
Output
$5.91
per 1M tokens
7d agotoday

Recent activity

Total usage per day on TokenLX (last 30 days).

Prompt
1.38B
Completion
3.38B
30d ago15d agotoday

Sample code & API

TokenLX normalizes requests and responses across providers. Use any OpenAI SDK or our native SDK.

# Python — use HTTP client directly
# Endpoint: POST https://api.tokenlx.ai/v1/videos/generations
# Headers:  Authorization: Bearer $TOKENLX_API_KEY
# Body:     { "model": "qwen3.5-omni", "prompt": "...", "duration": 5 }

Replace sk-aihubrouter-… with your key from the dashboard.