Supported Models

HuosuAI aggregates 40+ leading AI models from around the world. Switch between any model by changing the model parameter — no need to change your API Key or Base URL.

OpenAI

Model	Identifier	Context	Description
GPT-4o	`gpt-4o`	128K	Latest flagship multimodal model
GPT-4o mini	`gpt-4o-mini`	128K	Cost-effective model for high-volume use
GPT-4 Turbo	`gpt-4-turbo`	128K	High-performance with vision support
o1	`o1`	200K	Reasoning-enhanced model
o3-mini	`o3-mini`	200K	Compact reasoning model

Anthropic (Claude)

Model	Identifier	Context	Description
Claude Opus 4	`claude-opus-4-20250514`	200K	Most capable model for complex tasks
Claude Sonnet 4	`claude-sonnet-4-20250514`	200K	Balanced intelligence and speed
Claude 3.5 Haiku	`claude-3-5-haiku-20241022`	200K	Fast response for real-time use

Google (Gemini)

Model	Identifier	Context	Description
Gemini 2.5 Pro	`gemini-2.5-pro`	1M	Latest flagship with ultra-long context
Gemini 2.0 Flash	`gemini-2.0-flash`	1M	Fast model balancing speed and quality
Gemini 1.5 Pro	`gemini-1.5-pro`	1M	Million-token context for document analysis

DeepSeek

Model	Identifier	Context	Description
DeepSeek-V3	`deepseek-chat`	128K	Flagship chat model
DeepSeek-R1	`deepseek-reasoner`	128K	Reasoning-enhanced model
DeepSeek-Coder-V2	`deepseek-coder`	128K	Code-specialized model

Qwen (Alibaba Cloud)

Model	Identifier	Context	Description
Qwen-Max	`qwen-max`	128K	Flagship model
Qwen-Plus	`qwen-plus`	128K	Enhanced balanced model
Qwen-Turbo	`qwen-turbo`	128K	High-speed model

Zhipu AI (GLM)

Model	Identifier	Context	Description
GLM-4-Plus	`glm-4-plus`	128K	Latest flagship model
GLM-4	`glm-4`	128K	High-performance general model
GLM-4-Flash	`glm-4-flash`	128K	Free fast inference model

Moonshot (Kimi)

Model	Identifier	Context	Description
Moonshot-v1-128K	`moonshot-v1-128k`	128K	Long context model
Moonshot-v1-32K	`moonshot-v1-32k`	32K	Medium context model
Moonshot-v1-8K	`moonshot-v1-8k`	8K	Standard context, faster response

Model Selection Guide

General chat: GPT-4o, Claude Sonnet 4, DeepSeek-V3
Code generation: Claude Opus 4, DeepSeek-Coder-V2, GPT-4o
Long documents: Gemini 2.5 Pro (1M context), Qwen-Long
Budget-friendly: GPT-4o mini, DeepSeek-V3, GLM-4-Flash (free)