Explore Models

All Models

Text Models

gpt-oss-120b

gpt-oss-120b

OpenAI’s open-weight model (Big model smell)

LLM
FP8
New
gpt-oss-20b

gpt-oss-20b

OpenAI’s open-weight model (Small model smell)

LLM
FP8
New
Qwen3-Coder-480B-A35B-Instruct

Qwen3-Coder-480B-A35B-Instruct

The latest and most powerful coder model from the Qwen Team.

LLM
FP8
New
Qwen3-235B-A22B-Instruct-2507

Qwen3-235B-A22B-Instruct-2507

Qwen latest non-thinking model with significant improvements in general capabilities.

LLM
FP8
New
Kimi-K2

Kimi-K2

Kimi's latest 1T LLM, good at coding and tool-calling.

LLM
FP8
New
DeepSeek-R1-0528

DeepSeek-R1-0528

The latest open-source reasoner LLM released by DeepSeek.

LLM
FP8
New
Qwen3-235B-A22B

Qwen3-235B-A22B

A mixture-of-experts (MoE) model by Qwen, demonstrating strong reasoning ability and agent tool-calling capabilities.

LLM
FP8
DeepSeek-V3-0324

DeepSeek-V3-0324

DeepSeek's updated V3 model released on 03/24/2025.

LLM
FP8
QwQ-32B

QwQ-32B

The lastest Qwen reasoning model.

LLM
FP8
DeepSeek-R1

DeepSeek-R1

The best open-source reasoner LLM released by DeepSeek.

LLM
FP8
Popular
DeepSeek-V3

DeepSeek-V3

The best open-source LLM released by DeepSeek.

LLM
FP8
Llama-3.3-70B

Llama-3.3-70B

Meta's latest 70B LLM with performance comparable to llama 3.1 405B

LLM
FP8
Popular
Qwen2.5-Coder-32B

Qwen2.5-Coder-32B

The best coder from the Qwen Team.

LLM
FP8
Llama-3.2-3B

Llama-3.2-3B

The latest Llama 3.2 instruction-tuned model by Meta.

LLM
FP8
Qwen2.5-72B

Qwen2.5-72B

The latest Qwen LLM with more knowledge in coding and math.

LLM
FP8
Llama-3-70B

Llama-3-70B

A highly efficient and powerful model designed for a variety of tasks.

LLM
FP8
Hermes-3-70B

Hermes-3-70B

The latest flagship model in the Hermes series and the first full parameter fine-tune since the release of Llama 3.1.

LLM
FP8
Llama-3.1-405B

Llama-3.1-405B

The Biggest and Best open-source AI model trained by Meta, beating GPT-4o across most benchmarks.

LLM
FP8
Llama-3.1-8B

Llama-3.1-8B

The smallest and fastest member of the Llama 3.1 family.

LLM
FP8
Llama-3.1-70B

Llama-3.1-70B

The best LLM at its size with faster response times compared to the 405B model.

LLM
FP8