TokenLX
GEMINI

google/gemini-3.1-flash-tts

$0.5/M tokens input$10.00/M tokens output3.82B  tokens servedAudio

Gemini Flash — fast Google multimodal model with long context. Best value for volume tasks.

Key strengths

  • Very low cost
  • Long context
  • Multimodal input
  • Fast first token

Use cases

  • Bulk classification
  • Long-doc RAG
  • Video input
  • High QPS
cheapmultimodal

Google's google/gemini-3.1-flash-tts is a high-quality speech model. It generates natural-sounding speech across multiple voices and languages, with low-latency streaming output suitable for real-time voice applications.

Supports SSML-style controls, configurable voices, speaking rate, and pitch. Compatible with the OpenAI `/audio/speech` and `/audio/transcriptions` endpoint shapes.

google/gemini-3.1-flash-tts is fully OpenAI-compatible — drop in your existing OpenAI Python or Node SDK and switch `baseURL` to `https://api.tokenlx.ai`. TokenLX transparently routes your requests to the optimal provider endpoint while preserving streaming, function-calling, and structured-output semantics.

Performance

Compare different providers across TokenLX · All locations.

Throughput
67
tok/s
Latency
102
ms
E2E Latency
171
ms
Tool Call Errors
0.08
%
Output Errors
0.36
%
Time to First Token
80
ms

Effective Pricing

Actual cost per million tokens across providers over the past 7 days.

Input
$0.5
per 1M tokens
7d agotoday
Output
$10.00
per 1M tokens
7d agotoday

Recent activity

Total usage per day on TokenLX (last 30 days).

Prompt
1.56B
Completion
2.26B
30d ago15d agotoday

Sample code & API

TokenLX normalizes requests and responses across providers. Use any OpenAI SDK or our native SDK.

# Python — use HTTP client directly
# Endpoint: POST https://api.tokenlx.ai/v1/videos/generations
# Headers:  Authorization: Bearer $TOKENLX_API_KEY
# Body:     { "model": "gemini-3.1-flash-tts", "prompt": "...", "duration": 5 }

Replace sk-aihubrouter-… with your key from the dashboard.