glm-4.6-thinking
GLM · Per Token
glm-4.6-thinkingAvailable⚡Cache 86% offChatReasoningTool UseContext Window: 128KMax Output Tokens: 8K
Pricing
| Official Price | LemonData Price | |
|---|---|---|
| Input | $0.80 | $0.56 |
| Output | $1.60 | $1.12 |
| Cache Read | $0.11 | $0.11 |
| Cache Write | Free | Free |
Parameters
Context Window
128K tokens
Max Output Tokens
8K tokens
Best For
Chat
Conversational AI, customer support, and Q&A
Reasoning
Complex reasoning tasks, analytical work, and research
Cost Calculator
1M
0.5M
Estimated Monthly Cost$1.12
API Code Example
POST/v1/chat/completions
curl https://api.lemondata.cc/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer sk-xxx" \
-d '{
"model": "glm-4.6-thinking",
"messages": [
{"role": "user", "content": "Hello!"}
]
}'FAQ
How much does glm-4.6-thinking cost?
On LemonData, glm-4.6-thinking costs $0.5600 per 1M input tokens and $1.1200 per 1M output tokens, which is up to 30% off the official pricing.
What is glm-4.6-thinking best for?
glm-4.6-thinking excels at Chat, Reasoning, Tool Use. Access it through LemonData's unified API with a single API key.
How to use glm-4.6-thinking API?
Get your API key from LemonData, set the base URL to api.lemondata.cc/v1, and use any OpenAI-compatible SDK. See the code examples above.