TokenLX — One API for 202+ AI Models

MiniMax — Chinese LLM family with hybrid attention for extreme-length contexts.

Key strengths

Very long context
Hybrid attention architecture
Good reasoning
Chinese + English

Use cases

Book-length inputs
Long conversations
RAG over huge corpora
Research

long-contextchinese

MiniMax's minimax/minimax-music-2.0 is a high-quality speech model. It generates natural-sounding speech across multiple voices and languages, with low-latency streaming output suitable for real-time voice applications.

Supports SSML-style controls, configurable voices, speaking rate, and pitch. Compatible with the OpenAI `/audio/speech` and `/audio/transcriptions` endpoint shapes.

minimax/minimax-music-2.0 is fully OpenAI-compatible — drop in your existing OpenAI Python or Node SDK and switch `baseURL` to `https://api.tokenlx.ai`. TokenLX transparently routes your requests to the optimal provider endpoint while preserving streaming, function-calling, and structured-output semantics.

Performance

Compare different providers across TokenLX · All locations.

Throughput

tok/s

Latency

E2E Latency

132

Tool Call Errors

0.07

Output Errors

0.40

Time to First Token

Effective Pricing

Pricing is shown by the model billing method, using per-call or per-second prices and resolution tiers.

Per call

$0.03

Per call

Recent activity

Total usage per day on TokenLX (last 30 days).

Prompt

24.25M

Completion

37.59M

30d ago15d agotoday

Sample code & API

TokenLX normalizes requests and responses across providers. Use any OpenAI SDK or our native SDK.

# Python — use HTTP client directly
# Endpoint: POST https://api.tokenlx.ai/v1/videos/generations
# Headers:  Authorization: Bearer $TOKENLX_API_KEY
# Body:     { "model": "minimax-music-2.0", "prompt": "...", "duration": 5 }

Replace sk-aihubrouter-… with your key from the dashboard.