glm-4.6-thinking

GLM · Per Token

glm-4.6-thinkingAvailableCache 86% off
ChatReasoningTool UseContext Window: 128KMax Output Tokens: 8K

Pricing

Official PriceLemonData Price
Input$0.80$0.56
Output$1.60$1.12
Cache Read$0.11$0.11
Cache WriteFreeFree

Parameters

Context Window
128K tokens
Max Output Tokens
8K tokens

Best For

Chat

Conversational AI, customer support, and Q&A

Reasoning

Complex reasoning tasks, analytical work, and research

Cost Calculator

1M
0.5M
Estimated Monthly Cost$1.12

API Code Example

POST/v1/chat/completions
curl https://api.lemondata.cc/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-xxx" \
  -d '{
    "model": "glm-4.6-thinking",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

FAQ

How much does glm-4.6-thinking cost?

On LemonData, glm-4.6-thinking costs $0.5600 per 1M input tokens and $1.1200 per 1M output tokens, which is up to 30% off the official pricing.

What is glm-4.6-thinking best for?

glm-4.6-thinking excels at Chat, Reasoning, Tool Use. Access it through LemonData's unified API with a single API key.

How to use glm-4.6-thinking API?

Get your API key from LemonData, set the base URL to api.lemondata.cc/v1, and use any OpenAI-compatible SDK. See the code examples above.

Related Models