Gemini Flash — fast Google multimodal model with long context. Best value for volume tasks.
Key strengths
- Very low cost
- Long context
- Multimodal input
- Fast first token
Use cases
- Bulk classification
- Long-doc RAG
- Video input
- High QPS
Google's google/gemini-2.5-flash-tts is a high-quality speech model. It generates natural-sounding speech across multiple voices and languages, with low-latency streaming output suitable for real-time voice applications.
Supports SSML-style controls, configurable voices, speaking rate, and pitch. Compatible with the OpenAI `/audio/speech` and `/audio/transcriptions` endpoint shapes.
google/gemini-2.5-flash-tts is fully OpenAI-compatible — drop in your existing OpenAI Python or Node SDK and switch `baseURL` to `https://api.tokenlx.ai`. TokenLX transparently routes your requests to the optimal provider endpoint while preserving streaming, function-calling, and structured-output semantics.
Performance
Compare different providers across TokenLX · All locations.
Effective Pricing
Actual cost per million tokens across providers over the past 7 days.
Recent activity
Total usage per day on TokenLX (last 30 days).
Sample code & API
TokenLX normalizes requests and responses across providers. Use any OpenAI SDK or our native SDK.
# Python — use HTTP client directly
# Endpoint: POST https://api.tokenlx.ai/v1/videos/generations
# Headers: Authorization: Bearer $TOKENLX_API_KEY
# Body: { "model": "gemini-2.5-flash-tts", "prompt": "...", "duration": 5 }Replace sk-aihubrouter-… with your key from the dashboard.