glm-4.6-thinking

Name: glm-4.6-thinking API
Brand: zhipu
Price: 0.560000 USD
Availability: InStock

GLM · Per Token

glm-4.6-thinkingAvailable⚡Cache 86% off

ChatReasoningTool UseContext Window: 128KMax Output Tokens: 8K

Pricing

	Official Price	LemonData Price
Input	$0.80	$0.56
Output	$1.60	$1.12
Cache Read	$0.11	$0.11
Cache Write	Free	Free

Parameters

Context Window

128K tokens

Max Output Tokens

8K tokens

Best For

Chat

Conversational AI, customer support, and Q&A

Reasoning

Complex reasoning tasks, analytical work, and research

Cost Calculator

Monthly Input Tokens1M

Monthly Output Tokens0.5M

Estimated Monthly Cost$1.12

API Code Example

POST/v1/chat/completions

curl https://api.lemondata.cc/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-xxx" \
  -d '{
    "model": "glm-4.6-thinking",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

FAQ

How much does glm-4.6-thinking cost?

On LemonData, glm-4.6-thinking costs $0.5600 per 1M input tokens and $1.1200 per 1M output tokens, which is up to 30% off the official pricing.

What is glm-4.6-thinking best for?

glm-4.6-thinking excels at Chat, Reasoning, Tool Use. Access it through LemonData's unified API with a single API key.

How to use glm-4.6-thinking API?

Get your API key from LemonData, set the base URL to api.lemondata.cc/v1, and use any OpenAI-compatible SDK. See the code examples above.

Related Models

Qwen3 / QwQ

Alibaba Cloud · 37 models

Doubao

ByteDance · 17 models