Skip to content

GLM 5

GLM 5 is Z.ai's GLM-5 generation model released February 12, 2026, featuring multiple thinking modes, enhanced long-range planning and memory, and improved handling of complex multi-step agent tasks. It supports agentic coding and structured data extraction workflows.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'zai/glm-5',
prompt: 'Why is the sky blue?'
})

More models by Z.ai

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
7.8s
78tps
$1.40/M$4.40/M
Read:$0.26/M
Write:
+1
zai logo
06/16/2026
205K
0.2s
150tps
$1.30/M$4.30/M
Read:$0.26/M
Write:
+1
baseten logo
deepinfra logo
fireworks logo
+3
04/07/2026
203K
3.4s
71tps
$1.20/M$4.00/M
Read:$0.24/M
Write:
+1
zai logo
03/15/2026
205K
0.1s
414tps
$2.25/M$2.75/M
Read:$2.25/M
Write:
+1
baseten logo
bedrock logo
cerebras logo
+3
12/22/2025
200K
$0.06/M$0.40/M
Read:$0.01/M
Write:
+1
zai logo
01/01/2025
200K
0.3s
124tps
$0.07/M$0.40/M
Read:$0.01/M
Write:
bedrock logo
zai logo