llama-3.1-8b-instant

Llama 3 · Per Token

llama-3.1-8b-instantAvailable
ChatTool UseFastContext Window: 128KMax Output Tokens: 8K

Pricing

Official PriceLemonData Price
Input$0.05$0.035
Output$0.08$0.056

Parameters

Context Window
128K tokens
Max Output Tokens
8K tokens

Best For

Chat

Conversational AI, customer support, and Q&A

Cost Calculator

1M
0.5M
Estimated Monthly Cost$0.06

API Code Example

POST/v1/chat/completions
curl https://api.lemondata.cc/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-xxx" \
  -d '{
    "model": "llama-3.1-8b-instant",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

FAQ

How much does llama-3.1-8b-instant cost?

On LemonData, llama-3.1-8b-instant costs $0.0350 per 1M input tokens and $0.0560 per 1M output tokens, which is up to 30% off the official pricing.

What is llama-3.1-8b-instant best for?

llama-3.1-8b-instant excels at Chat, Tool Use, Fast. Access it through LemonData's unified API with a single API key.

How to use llama-3.1-8b-instant API?

Get your API key from LemonData, then call https://api.lemondata.cc/v1/chat/completions using any compatible SDK. See the code examples above for detailed integration.

Related Models