Models
202 models · 12 providers
Claude Opus — AWS's most capable (and expensive) tier. Reserved for the hardest problems.
Claude Opus — AWS's most capable (and expensive) tier. Reserved for the hardest problems.
Claude Opus — AWS's most capable (and expensive) tier. Reserved for the hardest problems.
Claude Opus — AWS's most capable (and expensive) tier. Reserved for the hardest problems.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Text generation model. Compatible with the OpenAI Chat Completions API.
DeepSeek — open-weight Chinese LLM family. Strong cost-to-quality ratio and good code generation.
Text generation model. Compatible with the OpenAI Chat Completions API.
DeepSeek — open-weight Chinese LLM family. Strong cost-to-quality ratio and good code generation.
Moonshot Kimi — long-context Chinese model known for strong document reading and comprehension.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
Claude Opus — AWS's most capable (and expensive) tier. Reserved for the hardest problems.
Claude Opus — AWS's most capable (and expensive) tier. Reserved for the hardest problems.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Text-to-image model. Generates original images from natural-language prompts.
Text-to-image model. Generates original images from natural-language prompts.
Video generation model. Produces video clips from text or images.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
Zhipu GLM — Chinese LLM from Tsinghua. Solid bilingual support with academic training roots.
Text generation model. Compatible with the OpenAI Chat Completions API.
MiniMax — Chinese LLM family with hybrid attention for extreme-length contexts.
Text generation model. Compatible with the OpenAI Chat Completions API.
Claude Sonnet — AWS's balanced model. Strong coding, writing, and tool use with 200K context.
Claude Sonnet — AWS's balanced model. Strong coding, writing, and tool use with 200K context.
Latest-generation frontier model with expanded reasoning and faster tool execution. Top choice when quality trumps cost.
Gemini Flash — fast Google multimodal model with long context. Best value for volume tasks.
Gemini Flash — fast Google multimodal model with long context. Best value for volume tasks.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Gemini Flash — fast Google multimodal model with long context. Best value for volume tasks.
Gemini Pro — Google's higher-quality Gemini tier. Strong reasoning with large context windows.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Text-to-image model. Generates original images from natural-language prompts.
Moonshot's Kimi K2.5 — Chinese-first model with exceptional long-context ability. Known for strong reading comprehension.
Text-to-image model. Generates original images from natural-language prompts.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
Zhipu GLM — Chinese LLM from Tsinghua. Solid bilingual support with academic training roots.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
Zhipu GLM — Chinese LLM from Tsinghua. Solid bilingual support with academic training roots.
Claude Opus — AWS's most capable (and expensive) tier. Reserved for the hardest problems.
Claude Opus — AWS's most capable (and expensive) tier. Reserved for the hardest problems.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text generation model. Compatible with the OpenAI Chat Completions API.
MiniMax — Chinese LLM family with hybrid attention for extreme-length contexts.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text-to-image model. Generates original images from natural-language prompts.
Video generation model. Produces video clips from text or images.
Video generation model. Produces video clips from text or images.
Upgraded GPT-5 with longer context and improved latency. Production default for demanding agentic workloads.
Codex variant of GPT-5.2 tuned for software engineering. Specialized for repo-aware coding agents.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
Claude Opus — AWS's most capable (and expensive) tier. Reserved for the hardest problems.
Claude Opus — AWS's most capable (and expensive) tier. Reserved for the hardest problems.
Gemini Pro — Google's higher-quality Gemini tier. Strong reasoning with large context windows.
Gemini Pro — Google's higher-quality Gemini tier. Strong reasoning with large context windows.
MiniMax — Chinese LLM family with hybrid attention for extreme-length contexts.
Zhipu GLM — Chinese LLM from Tsinghua. Solid bilingual support with academic training roots.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
Claude Haiku — fast, affordable AWS model. Best for high-volume real-time tasks.
Claude Haiku — fast, affordable AWS model. Best for high-volume real-time tasks.
Video generation model. Produces video clips from text or images.
Video generation model. Produces video clips from text or images.
MiniMax — Chinese LLM family with hybrid attention for extreme-length contexts.
Text-to-image model. Generates original images from natural-language prompts.
Image generation model. Creates or edits images from text prompts.
Video generation model. Produces video clips from text or images.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Zhipu GLM — Chinese LLM from Tsinghua. Solid bilingual support with academic training roots.
Zhipu GLM — Chinese LLM from Tsinghua. Solid bilingual support with academic training roots.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Claude Sonnet — AWS's balanced model. Strong coding, writing, and tool use with 200K context.
Text generation model. Compatible with the OpenAI Chat Completions API.
DeepSeek — open-weight Chinese LLM family. Strong cost-to-quality ratio and good code generation.
Claude Sonnet — AWS's balanced model. Strong coding, writing, and tool use with 200K context.
Text-to-image model. Generates original images from natural-language prompts.
Video generation model. Produces video clips from text or images.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
MiniMax — Chinese LLM family with hybrid attention for extreme-length contexts.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Upgraded DeepSeek V3.1 with improved reasoning and better tool calling. Pareto-optimal on cost vs quality.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Text-to-image model. Generates original images from natural-language prompts.
Gemini Flash — fast Google multimodal model with long context. Best value for volume tasks.
Video generation model. Produces video clips from text or images.
Video generation model. Produces video clips from text or images.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
DeepSeek — open-weight Chinese LLM family. Strong cost-to-quality ratio and good code generation.
Claude 4 Sonnet — balance of speed, quality, and cost for agentic workflows and production coding.
Claude Sonnet — AWS's balanced model. Strong coding, writing, and tool use with 200K context.
Video generation model. Produces video clips from text or images.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Alibaba Qwen series — Chinese-first LLMs with strong bilingual support. Wide range from turbo to max tiers.
Latest Azure image model with improved realism and editing. Supports inpainting, outpainting, and mask-guided edits.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
Next-generation reasoning model succeeding o1. Solves problems that stumped previous models, at a reasonable cost.
Azure o-series — reasoning-first models that think before answering. Best for hard math, science, and code.
Code-focused GPT-4 successor with stronger instruction following and 1M+ context. Great for long-document analysis and agentic coding.
Tiny model optimized for classification and structured output. Cheapest in the GPT-4 family.
Smaller GPT-4.1 with the same 1M context at a fraction of the cost. The new default for long-context RAG and bulk processing.
MiniMax — Chinese LLM family with hybrid attention for extreme-length contexts.
ByteDance Doubao — Chinese LLM family tuned for the Volcano Engine cloud and ByteDance ecosystem.
Gemini 2.5 Pro — Google's top reasoning model with thinking mode. Frontier performance on coding and math.
Gemini Pro — Google's higher-quality Gemini tier. Strong reasoning with large context windows.
Gemini Flash — fast Google multimodal model with long context. Best value for volume tasks.
Gemini Flash — fast Google multimodal model with long context. Best value for volume tasks.
Gemini Flash — fast Google multimodal model with long context. Best value for volume tasks.
Claude Sonnet — AWS's balanced model. Strong coding, writing, and tool use with 200K context.
Claude Sonnet — AWS's balanced model. Strong coding, writing, and tool use with 200K context.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
DeepSeek R1 — a reasoning-first model trained with reinforcement learning. Competes with o1-class models at much lower cost.
Text-to-image model. Generates original images from natural-language prompts.
Text-to-image model. Generates original images from natural-language prompts.
Open-weight DeepSeek V3 — MoE architecture delivering frontier-adjacent quality at a fraction of the cost.
Next-gen Gemini Flash with improved reasoning and native tool use. Drop-in upgrade to 1.5 Flash.
Gemini Flash — fast Google multimodal model with long context. Best value for volume tasks.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text-to-image model. Generates original images from natural-language prompts.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Balanced Qwen tier with strong Chinese + reasonable cost. The pragmatic default for production Chinese apps.
Alibaba's flagship Qwen model. Strong bilingual (Chinese / English) performance, especially tuned for enterprise scenarios.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Reasoning-first model that thinks before answering. Best for math, science, and multi-step problem solving.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
Text generation model. Compatible with the OpenAI Chat Completions API.
Claude Sonnet — AWS's balanced model. Strong coding, writing, and tool use with 200K context.
Text-to-image model. Generates original images from natural-language prompts.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Text-to-video or image-to-video model. Generates short video clips with configurable duration and resolution.
Video generation model. Produces video clips from text or images.
Video generation model. Produces video clips from text or images.
Video generation model. Produces video clips from text or images.
Text generation model. Compatible with the OpenAI Chat Completions API.
Flagship multimodal model from Azure with native text, vision, and voice understanding. Strong at general-purpose reasoning and instruction following.
Cheap and fast sibling of GPT-4o. Best value for high-volume classification, extraction, and routing tasks.
Qwen variant with very long context (10M+ tokens). Purpose-built for long-document analysis and codebase-level tasks.
Smallest, cheapest Qwen. Good for classification, routing, and high-volume light tasks in Chinese.
Claude Haiku — fast, affordable AWS model. Best for high-volume real-time tasks.
Claude Sonnet — AWS's balanced model. Strong coding, writing, and tool use with 200K context.
Gemini Pro — Google's higher-quality Gemini tier. Strong reasoning with large context windows.
Google's fast multimodal model with 1M context. Remarkable value for long-document and video input tasks.
Azure's production image generator. Known for strong prompt adherence and coherent in-image text rendering.