Skip to content

Supported Models

HuosuAI aggregates 40+ leading AI models from around the world. Switch between any model by changing the model parameter — no need to change your API Key or Base URL.

OpenAI

ModelIdentifierContextDescription
GPT-4ogpt-4o128KLatest flagship multimodal model
GPT-4o minigpt-4o-mini128KCost-effective model for high-volume use
GPT-4 Turbogpt-4-turbo128KHigh-performance with vision support
o1o1200KReasoning-enhanced model
o3-minio3-mini200KCompact reasoning model

Anthropic (Claude)

ModelIdentifierContextDescription
Claude Opus 4claude-opus-4-20250514200KMost capable model for complex tasks
Claude Sonnet 4claude-sonnet-4-20250514200KBalanced intelligence and speed
Claude 3.5 Haikuclaude-3-5-haiku-20241022200KFast response for real-time use

Google (Gemini)

ModelIdentifierContextDescription
Gemini 2.5 Progemini-2.5-pro1MLatest flagship with ultra-long context
Gemini 2.0 Flashgemini-2.0-flash1MFast model balancing speed and quality
Gemini 1.5 Progemini-1.5-pro1MMillion-token context for document analysis

DeepSeek

ModelIdentifierContextDescription
DeepSeek-V3deepseek-chat128KFlagship chat model
DeepSeek-R1deepseek-reasoner128KReasoning-enhanced model
DeepSeek-Coder-V2deepseek-coder128KCode-specialized model

Qwen (Alibaba Cloud)

ModelIdentifierContextDescription
Qwen-Maxqwen-max128KFlagship model
Qwen-Plusqwen-plus128KEnhanced balanced model
Qwen-Turboqwen-turbo128KHigh-speed model

Zhipu AI (GLM)

ModelIdentifierContextDescription
GLM-4-Plusglm-4-plus128KLatest flagship model
GLM-4glm-4128KHigh-performance general model
GLM-4-Flashglm-4-flash128KFree fast inference model

Moonshot (Kimi)

ModelIdentifierContextDescription
Moonshot-v1-128Kmoonshot-v1-128k128KLong context model
Moonshot-v1-32Kmoonshot-v1-32k32KMedium context model
Moonshot-v1-8Kmoonshot-v1-8k8KStandard context, faster response

Model Selection Guide

  • General chat: GPT-4o, Claude Sonnet 4, DeepSeek-V3
  • Code generation: Claude Opus 4, DeepSeek-Coder-V2, GPT-4o
  • Long documents: Gemini 2.5 Pro (1M context), Qwen-Long
  • Budget-friendly: GPT-4o mini, DeepSeek-V3, GLM-4-Flash (free)

Intelligent Model Aggregation Platform